Follow
Zinuo Cai
Title
Cited by
Cited by
Year
Sustainable serverless computing with cold-start optimization and automatic workflow resource scheduling
S Pan, H Zhao, Z Cai, D Li, R Ma, H Guan
IEEE Transactions on Sustainable Computing, 2023
182023
faaShark: An end-to-end network traffic analysis system atop serverless computing platforms
H Zhao, S Pan, Z Cai, X Chen, L Jin, H Gao, S Wan, R Ma, H Guan
IEEE Transactions on Network Science and Engineering, 2023
82023
GUARDIAN: A Hardware-Assisted Distributed Framework to Enhance Deep Learning Security
Z Cai, B Ren, R Ma, H Guan, M Tian, Y Wang
IEEE Transactions on Computational Social Systems, 2023
82023
SMSS: Stateful Model Serving in Metaverse With Serverless Computing and GPU Sharing
Z Cai, Z Chen, R Ma, H Guan
IEEE Journal on Selected Areas in Communications, 2023
52023
Themis: A Fair Evaluation Platform for Computer Vision Competitions.
Z Cai, J Yuan, Y Hua, T Song, H Wang, Z Xue, N Hu, J Ding, R Ma, ...
IJCAI, 599-605, 2021
32021
RIDIC: Real-Time Intelligent Transportation System With Dispersed Computing
Z Cai, Z Chen, Z Liu, Q Xie, R Ma, H Guan
IEEE Transactions on Intelligent Transportation Systems, 2023
22023
Towards Variance Reduction for Reinforcement Learning of Industrial Decision-making Tasks: A Bi-Critic based Demand-Constraint Decoupling Approach
J Yuan, J Zhang, Z Cai, J Yan
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023
22023
Slob: Suboptimal load balancing scheduling in local heterogeneous gpu clusters for large language model inference
P Jiang, H Wang, Z Cai, L Gao, W Zhang, R Ma, X Zhou
IEEE Transactions on Computational Social Systems, 2024
12024
Deep Convolutional Linear Precoder Neural Network for Rate Splitting Strategy of Aerial Computing Networks
Z Wang, R Ma, H Shi, Z Cai, L Lin, H Guan
IEEE Transactions on Network Science and Engineering, 2024
12024
MemoriaNova: Optimizing Memory-Aware Model Inference for Edge Computing
R Zhang, T Zhang, Z Cai, D Li, R Ma, B Rajkumar
ACM Transactions on Architecture and Code Optimization, 2024
2024
FasDL: An Efficient Serverless-Based Training Architecture with Communication Optimization and Resource Configuration
X Chen, Z Cai, H Zhang, R Ma, R Buyya
IEEE Transactions on Computers, 1-14, 2024
2024
Hermes: Memory-Efficient Pipeline Inference for Large Models on Edge Devices
X Han, Z Cai, Y Zhang, C Fan, J Liu, R Ma, R Buyya
The 42nd IEEE International Conference on Computer Design (ICCD 2024), 2024
2024
Serving Large Language Models on Trusted Serverless Computing Platforms
Z Cai, R Ma, Y Fu, W Zhang, R Ma, H Guan
IEEE Transactions on Artificial Intelligence, 2024
2024
SPSC: Stream Processing Framework Atop Serverless Computing for Industrial Big Data
Z Cai, Z Chen, X Chen, R Ma, H Guan, R Buyya
IEEE Transactions on Cybernetics, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–14