Sustainable serverless computing with cold-start optimization and automatic workflow resource scheduling S Pan, H Zhao, Z Cai, D Li, R Ma, H Guan IEEE Transactions on Sustainable Computing, 2023 | 18 | 2023 |
faaShark: An end-to-end network traffic analysis system atop serverless computing platforms H Zhao, S Pan, Z Cai, X Chen, L Jin, H Gao, S Wan, R Ma, H Guan IEEE Transactions on Network Science and Engineering, 2023 | 8 | 2023 |
GUARDIAN: A Hardware-Assisted Distributed Framework to Enhance Deep Learning Security Z Cai, B Ren, R Ma, H Guan, M Tian, Y Wang IEEE Transactions on Computational Social Systems, 2023 | 8 | 2023 |
SMSS: Stateful Model Serving in Metaverse With Serverless Computing and GPU Sharing Z Cai, Z Chen, R Ma, H Guan IEEE Journal on Selected Areas in Communications, 2023 | 5 | 2023 |
Themis: A Fair Evaluation Platform for Computer Vision Competitions. Z Cai, J Yuan, Y Hua, T Song, H Wang, Z Xue, N Hu, J Ding, R Ma, ... IJCAI, 599-605, 2021 | 3 | 2021 |
RIDIC: Real-Time Intelligent Transportation System With Dispersed Computing Z Cai, Z Chen, Z Liu, Q Xie, R Ma, H Guan IEEE Transactions on Intelligent Transportation Systems, 2023 | 2 | 2023 |
Towards Variance Reduction for Reinforcement Learning of Industrial Decision-making Tasks: A Bi-Critic based Demand-Constraint Decoupling Approach J Yuan, J Zhang, Z Cai, J Yan Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 2 | 2023 |
Slob: Suboptimal load balancing scheduling in local heterogeneous gpu clusters for large language model inference P Jiang, H Wang, Z Cai, L Gao, W Zhang, R Ma, X Zhou IEEE Transactions on Computational Social Systems, 2024 | 1 | 2024 |
Deep Convolutional Linear Precoder Neural Network for Rate Splitting Strategy of Aerial Computing Networks Z Wang, R Ma, H Shi, Z Cai, L Lin, H Guan IEEE Transactions on Network Science and Engineering, 2024 | 1 | 2024 |
MemoriaNova: Optimizing Memory-Aware Model Inference for Edge Computing R Zhang, T Zhang, Z Cai, D Li, R Ma, B Rajkumar ACM Transactions on Architecture and Code Optimization, 2024 | | 2024 |
FasDL: An Efficient Serverless-Based Training Architecture with Communication Optimization and Resource Configuration X Chen, Z Cai, H Zhang, R Ma, R Buyya IEEE Transactions on Computers, 1-14, 2024 | | 2024 |
Hermes: Memory-Efficient Pipeline Inference for Large Models on Edge Devices X Han, Z Cai, Y Zhang, C Fan, J Liu, R Ma, R Buyya The 42nd IEEE International Conference on Computer Design (ICCD 2024), 2024 | | 2024 |
Serving Large Language Models on Trusted Serverless Computing Platforms Z Cai, R Ma, Y Fu, W Zhang, R Ma, H Guan IEEE Transactions on Artificial Intelligence, 2024 | | 2024 |
SPSC: Stream Processing Framework Atop Serverless Computing for Industrial Big Data Z Cai, Z Chen, X Chen, R Ma, H Guan, R Buyya IEEE Transactions on Cybernetics, 2024 | | 2024 |