Junli Gu
Junli Gu
Tesla autopilot
Verified email at
Cited by
Cited by
Heterogeneous system coherence for integrated CPU-GPU systems
J Power, A Basu, J Gu, S Puthoor, BM Beckmann, MD Hill, SK Reinhardt, ...
Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013
PPEP: Online performance, power, and energy prediction framework and DVFS space exploration
B Su, J Gu, L Shen, W Huang, JL Greathouse, Z Wang
2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 445-457, 2014
WADE: Writeback-aware dynamic cache management for NVM-based main memory system
Z Wang, S Shan, T Cao, J Gu, Y Xu, S Mu, Y Xie, DA Jiménez
ACM Transactions on Architecture and Code Optimization (TACO) 10 (4), 1-21, 2013
Implementing a leading loads performance predictor on commodity processors
B Su, JL Greathouse, J Gu, M Boyer, L Shen, Z Wang
2014 USENIX Annual Technical Conference (USENIX ATC 14), 2014
Opencl caffe: Accelerating and enabling a cross platform machine learning framework
J Gu, Y Liu, Y Gao, M Zhu
Proceedings of the 4th International Workshop on OpenCL, 1-5, 2016
Implementation and evaluation of deep neural networks (DNN) on mainstream heterogeneous systems
J Gu, M Zhu, Z Zhou, F Zhang, Z Lin, Q Zhang, M Breternitz
Proceedings of 5th Asia-Pacific Workshop on Systems, 1-7, 2014
NAIS: Neural architecture and implementation search and its applications in autonomous driving
C Hao, Y Chen, X Liu, A Sarwari, D Sew, A Dhar, B Wu, D Fu, J Xiong, ...
2019 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2019
A hybrid GPU+ FPGA system design for autonomous driving cars
C Hao, A Sarwari, Z Jin, H Abu-Haimed, D Sew, Y Li, X Liu, B Wu, D Fu, ...
2019 IEEE International Workshop on Signal Processing Systems (SiPS), 121-126, 2019
Self-supervised learning of depth and ego-motion with differentiable bundle adjustment
Y Shi, J Zhu, Y Fang, K Lien, J Gu
arXiv preprint arXiv:1909.13163, 2019
Moving data between caches in a heterogeneous processor system
J Gu, BM Beckmann, Y Xie
US Patent 9,652,390, 2017
Optimizing a parallel video encoder with message passing and a shared memory architecture
J Gu, Y Sun
Tsinghua Science and Technology 16 (4), 393-398, 2011
Structure-attentioned memory network for monocular depth estimation
J Zhu, Y Shi, M Ren, Y Fang, KC Lien, J Gu
arXiv preprint arXiv:1909.04594, 2019
MOPED: Orchestrating interprocess message data on CMPs
J Gu, SS Lumetta, R Kumar, Y Sun
2011 IEEE 17th International Symposium on High Performance Computer …, 2011
iCHAT: inter-cache hardware-assistant data transfer for heterogeneous chip multiprocessors
J Gu, BM Beckmann, T Cao, Y Hu
2014 9th IEEE International Conference on Networking, Architecture, and …, 2014
Accelerating data movement on future chip multi-processors
J Gu, R Kumar, SS Lumetta, Y Sun
Proceedings of the Second International Forum on Next-Generation Multicore …, 2010
MOPED: Accelerating data communication on future cmps
J Gu, Y Sun, SS Lumetta, R Kumar
IEEE Micro 31 (4), 42-50, 2011
Enhancing lifetime of non-volatile cache by injecting random replacement policy
Z Wang, Y Xie, Y Xu, J Gu, T Cao
US Patent 9,792,228, 2017
Enhancing lifetime of non-volatile cache by reducing intra-block write variation
Z Wang, Y Xie, Y Xu, J Gu, T Cao
US Patent 9,767,043, 2017
Thermal-aware compiler for parallel instruction execution in processors
Y Xie, J Gu
US Patent 9,639,359, 2017
Method and apparatus related to cache memory
Z Wang, J Gu, Y Xu
US Patent 9,552,301, 2017
The system can't perform the operation now. Try again later.
Articles 1–20