A second look at exponential and cosine step sizes: Simplicity, adaptivity, and performance X Li, Z Zhuang, F Orabona International Conference on Machine Learning, 6553-6564, 2021 | 16* | 2021 |
No-regret non-convex online meta-learning Z Zhuang, Y Wang, K Yu, S Lu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 13 | 2020 |
Online meta-learning on non-convex setting Z Zhuang, Y Wang, K Yu, S Lu arXiv preprint arXiv:1910.10196 3, 2019 | 4 | 2019 |
Surrogate losses for online learning of stepsizes in stochastic non-convex optimization Z Zhuang, A Cutkosky, F Orabona Proceedings of the 36th International Conference on Machine Learning 97 …, 2019 | 1 | 2019 |
A Communication-Efficient Distributed Gradient Clipping Algorithm for Training Deep Neural Networks M Liu, Z Zhuang, Y Lei, C Liao arXiv preprint arXiv:2205.05040, 2022 | | 2022 |
Understanding AdamW through Proximal Methods and Scale-Freeness Z Zhuang, M Liu, A Cutkosky, F Orabona arXiv preprint arXiv:2202.00089, 2022 | | 2022 |