Follow
Aurick Qiao
Aurick Qiao
Snowflake AI Research
Verified email at snowflake.com - Homepage
Title
Cited by
Cited by
Year
Pollux: Co-adaptive cluster scheduling for goodput-optimized deep learning
A Qiao, SK Choe, SJ Subramanya, W Neiswanger, Q Ho, H Zhang, ...
15th {USENIX} Symposium on Operating Systems Design and Implementation …, 2021
1842021
Managed communication and consistency for fast data-parallel iterative analytics
J Wei, W Dai, A Qiao, Q Ho, H Cui, GR Ganger, PB Gibbons, GA Gibson, ...
Proceedings of the Sixth ACM Symposium on Cloud Computing, 381-394, 2015
1492015
Litz: An Elastic Framework for High-Performance Distributed Machine Learning
A Qiao, A Aghayev, W Yu, H Chen, Q Ho, GA Gibson, EP Xing
662017
Multi-pivot quicksort: Theory and experiments
S Kushagra, A López-Ortiz, A Qiao, JI Munro
2014 Proceedings of the Sixteenth Workshop on Algorithm Engineering and …, 2014
632014
Fault tolerance in iterative-convergent machine learning
A Qiao, B Aragam, B Zhang, E Xing
International Conference on Machine Learning, 5220-5230, 2019
522019
LLM360: Towards fully transparent open-source llms
Z Liu, A Qiao, W Neiswanger, H Wang, B Tan, T Tao, J Li, Y Wang, S Sun, ...
arXiv preprint arXiv:2312.06550, 2023
462023
Sia: Heterogeneity-aware, goodput-optimized ML-cluster scheduling
S Jayaram Subramanya, D Arfeen, S Lin, A Qiao, Z Jia, GR Ganger
Proceedings of the 29th Symposium on Operating Systems Principles, 642-657, 2023
372023
Elastic management of machine learning computing
A Qiao, Q Ho, E Xing
US Patent 10,649,806, 2020
152020
Operating system for distributed enterprise artificial intelligence programs on data centers and the clouds
W Dai, W Yu, EP Xing, A Qiao, Q Ho
US Patent 10,782,988, 2020
82020
Scaling HDBSCAN Clustering with kNN Graph Approximation
J Jackson, A Qiao, EP Xing
22018
SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference
G Oliaro, Z Jia, D Campos, A Qiao
arXiv preprint arXiv:2411.04975, 2024
2024
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation
A Qiao, Z Yao, S Rajbhandari, Y He
arXiv preprint arXiv:2410.03960, 2024
2024
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
J Lee, A Qiao, DF Campos, Z Yao, Y He
arXiv preprint arXiv:2409.06211, 2024
2024
Efficient LLM Scheduling by Learning to Rank
Y Fu, S Zhu, R Su, A Qiao, I Stoica, H Zhang
arXiv preprint arXiv:2408.15792, 2024
2024
Elastic Machine Learning Systems with Co-adaptation
A Qiao
Carnegie Mellon University, 2021
2021
AUTODIST: ACOMPOSABLE AND AUTOMATED SYNCHRONIZATION SYSTEM FOR DISTRIBUTED DEEP LEARNING
H Zhang, P Wu, Z Deng, C Li, Q Ho, A Qiao, Z Wang, EP Xing
Training Larger Models on TensorFlow without Additional GPU
J Wei, A Qiao, A Jayarajan, G Gibson, V Vasudevan, E Xing
The system can't perform the operation now. Try again later.
Articles 1–17