Junli Gu

Cited by

	All	Since 2019
Citations	594	298
h-index	10	10
i10-index	11	10

201120122013201420152016201720182019202020212022202320242 2 1 13 40 78 73 72 60 63 66 47 51 10

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Mark D. HillUniversity of Wisconsin-Madison Professor EmeritusVerified email at cs.wisc.edu

Junli Gu

Tesla autopilot

Verified email at tesla.com

Machine learning Heterogeneous systems


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Heterogeneous system coherence for integrated CPU-GPU systems J Power, A Basu, J Gu, S Puthoor, BM Beckmann, MD Hill, SK Reinhardt, ... Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013	202	2013
PPEP: Online performance, power, and energy prediction framework and DVFS space exploration B Su, J Gu, L Shen, W Huang, JL Greathouse, Z Wang 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 445-457, 2014	111	2014
WADE: Writeback-aware dynamic cache management for NVM-based main memory system Z Wang, S Shan, T Cao, J Gu, Y Xu, S Mu, Y Xie, DA Jiménez ACM Transactions on Architecture and Code Optimization (TACO) 10 (4), 1-21, 2013	58	2013
Implementing a leading loads performance predictor on commodity processors B Su, JL Greathouse, J Gu, M Boyer, L Shen, Z Wang 2014 USENIX Annual Technical Conference (USENIX ATC 14), 2014	47	2014
Opencl caffe: Accelerating and enabling a cross platform machine learning framework J Gu, Y Liu, Y Gao, M Zhu Proceedings of the 4th International Workshop on OpenCL, 1-5, 2016	40	2016
Implementation and evaluation of deep neural networks (DNN) on mainstream heterogeneous systems J Gu, M Zhu, Z Zhou, F Zhang, Z Lin, Q Zhang, M Breternitz Proceedings of 5th Asia-Pacific Workshop on Systems, 1-7, 2014	34	2014
NAIS: Neural architecture and implementation search and its applications in autonomous driving C Hao, Y Chen, X Liu, A Sarwari, D Sew, A Dhar, B Wu, D Fu, J Xiong, ... 2019 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2019	24	2019
A hybrid GPU+ FPGA system design for autonomous driving cars C Hao, A Sarwari, Z Jin, H Abu-Haimed, D Sew, Y Li, X Liu, B Wu, D Fu, ... 2019 IEEE International Workshop on Signal Processing Systems (SiPS), 121-126, 2019	19	2019
Methods and apparatus related to data processors and caches incorporated in data processors Z Wang, X Yuan, J Gu, Y Xu, SC Shan, S Mu, T Cao US Patent 9,317,448, 2016	13	2016
Self-supervised learning of depth and ego-motion with differentiable bundle adjustment Y Shi, J Zhu, Y Fang, K Lien, J Gu arXiv preprint arXiv:1909.13163, 2019	11	2019
Moving data between caches in a heterogeneous processor system J Gu, BM Beckmann, Y Xie US Patent 9,652,390, 2017	10	2017
Optimizing a parallel video encoder with message passing and a shared memory architecture J Gu, Y Sun Tsinghua Science and Technology 16 (4), 393-398, 2011	9	2011
Structure-attentioned memory network for monocular depth estimation J Zhu, Y Shi, M Ren, Y Fang, KC Lien, J Gu arXiv preprint arXiv:1909.04594, 2019	5	2019
MOPED: Orchestrating interprocess message data on CMPs J Gu, SS Lumetta, R Kumar, Y Sun 2011 IEEE 17th International Symposium on High Performance Computer …, 2011	4	2011
Enhancing lifetime of non-volatile cache by reducing intra-block write variation Z Wang, Y Xie, Y Xu, J Gu, T Cao US Patent 9,767,043, 2017	2	2017
iCHAT: inter-cache hardware-assistant data transfer for heterogeneous chip multiprocessors J Gu, BM Beckmann, T Cao, Y Hu 2014 9th IEEE International Conference on Networking, Architecture, and …, 2014	2	2014
Accelerating data movement on future chip multi-processors J Gu, R Kumar, SS Lumetta, Y Sun Proceedings of the Second International Forum on Next-Generation Multicore …, 2010	2	2010
MOPED: Accelerating data communication on future cmps J Gu, Y Sun, SS Lumetta, R Kumar IEEE Micro 31 (4), 42-50, 2011	1	2011
Enhancing lifetime of non-volatile cache by injecting random replacement policy Z Wang, Y Xie, Y Xu, J Gu, T Cao US Patent 9,792,228, 2017		2017
Thermal-aware compiler for parallel instruction execution in processors Y Xie, J Gu US Patent 9,639,359, 2017		2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors