Fast federated learning in the presence of arbitrary device unavailability X Gu, K Huang, J Zhang, L Huang Advances in Neural Information Processing Systems 34, 12052-12064, 2021 | 93 | 2021 |
Why (and When) does Local SGD Generalize Better than SGD? X Gu, K Lyu, L Huang, S Arora 2023 International Conference on Learning Representations (ICLR 2023), 2023 | 24 | 2023 |
Keeping llms aligned after fine-tuning: The crucial role of prompt templates K Lyu, H Zhao, X Gu, D Yu, A Goyal, S Arora arXiv preprint arXiv:2402.18540, 2024 | 15 | 2024 |
A Quadratic Synchronization Rule for Distributed Deep Learning X Gu, K Lyu, S Arora, J Zhang, L Huang 2024 International Conference on Learning Representations (ICLR 2024), 2023 | 1 | 2023 |