Follow
Yaohui Cai
Title
Cited by
Cited by
Year
Zeroq: A novel zero shot quantization framework
Y Cai, Z Yao, Z Dong, A Gholami, MW Mahoney, K Keutzer
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
3672020
Hawq-v2: Hessian aware trace-weighted quantization of neural networks
KK Zhen Dong, Zhewei Yao, Yaohui Cai, Daiyaan Arfeen, Amir Gholami, Michael ...
arXiv preprint arXiv:1911.03852, 2019
226*2019
Codenet: Efficient deployment of input-adaptive object detection on embedded fpgas
Q Huang, D Wang, Z Dong, Y Gao, Y Cai, T Li, B Wu, K Keutzer, ...
The 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays …, 2021
502021
Quip: 2-bit quantization of large language models with guarantees
J Chee, Y Cai, V Kuleshov, CM De Sa
Advances in Neural Information Processing Systems 36, 2024
292024
Structured Pruning of CNNs at Initialization
Y Cai, W Hua, H Chen, F Li, GE Suh, C De Sa, Z Zhang
11*2022
Spade: A spectral method for black-box adversarial robustness evaluation
W Cheng, C Deng, Z Zhao, Y Cai, Z Zhang, Z Feng
International Conference on Machine Learning, 1814-1824, 2021
82021
Algorithm-hardware co-design for deformable convolution
Q Huang, D Wang, Y Gao, Y Cai, Z Dong, B Wu, K Keutzer, J Wawrzynek
2019 Fifth Workshop on Energy Efficient Machine Learning and Cognitive …, 2019
82019
Understanding the potential of fpga-based spatial acceleration for large language model inference
H Chen, J Zhang, Y Du, S Xiang, Z Yue, N Zhang, Y Cai, Z Zhang
arXiv preprint arXiv:2312.15159, 2023
12023
Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAs
D Dai, Y Zhang, J Zhang, Z Hu, Y Cai, Q Sun, Z Zhang
arXiv preprint arXiv:2401.17544, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–9