Publication

2026

[C5] FedCompass: Federated Clustered and Periodic Aggregation Framework for Hybrid Classical-Quantum Models (CCF-B)
Yueheng Wang, Xing He, Zinuo Cai, Rui Zhang, Ruhui Ma, Yuan Liu and Rajkumar Buyya
in Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2026.

2025

[C4] AARC: Automated Affinity-aware Resource Configuration for Serverless Workflows (CCF-A)
Lingxiao Jin#, Zinuo Cai#, Zebin Chen, Hongyu Zhao and Ruhui Ma
in Proc. of the Design Automation Conference (DAC), 2025.

[J18] QFI-Opt: Communication-Efficient Quantum Federated Learning via Quantum Fisher Information (CCF-B)
Rui Zhang, Yucheng Wang, Xing He, Zinuo Cai, Yicheng Di, Jiayu Bao, Jiansong Fan, Zhongle Qu
in Software: Practice and Experience (SPE), 2025.

[J17] FedACL: A Collaborative Federated Fine-Tuning Framework for Large Language Models with AWLoRA and Contrastive Learning
Zijie Zhao#, Zhenshuo Zhang#, Zinuo Cai, Yiming Qiang, Tianqi Wu, Baoheng Zhang, Ruhui Ma, Yuan Liu
in IEEE Transactions on Computational Social Systems (TCSS), 2025.

[J16] HeShare: Energy-Aware and Efficient Multi-Task GPU Sharing in Heterogeneous GPU-based Computing Systems (CCF-A)
Zhuolong Jiang, Zinuo Cai, Hongyu Zhao, Baoheng Zhang, Tianqi Wu, Yiming Qiang, Ruhui Ma, Haibing Guan, Rajkumar Buyya
in IEEE Transactions on Computers (TC), 2025.

[J15] Ephemera: Accelerating I/O-Intensive Serverless Workloads with a Harvested In-memory File System (CCF-A)
Lingxiao Jin#, Zinuo Cai#, Haoxin Wang, Zongpu Zhang, Ruhui Ma, Haibing Guan, Yuan Liu, Rajkumar Buyya
in ACM Transactions on Architecture and Code Optimization (TACO), 2025.

[J14] SMORE: Enhancing GPU Utilization in Deep Learning Clusters by Serverless-based Co-location Scheduling (CCF-A)
Junhan Liu#, Zinuo Cai#, Yumou Liu, Hao Li, Zongpu Zhang, Ruhui Ma, Rajkumar Buyya
in IEEE Transactions on Parallel and Distributed Systems (TPDS), 2025.

[J13] HWDSQP: a Historical Weighted and Dynamic Scheduling Quantum Protocol to Enhance Communication Reliability (CCF-A)
Liwei Lin#, Rongbo Ma#, Zejian Wang, Zinuo Cai, Haochen Xu, Baoheng Zhang, Ruhui Ma, Rajkumar Buyya
in IEEE Journal on Selected Areas in Communications (JSAC), 2025.

[J12] HyperTuneFaaS: A Serverless Framework for Hyperparameter Tuning in Image Processing Models
Jiantao Zhang, Bojun Ren, Yicheng Fu, Rongbo Ma, Zinuo Cai, Weishan Zhang, Ruhui Ma, Jinshan Sun
in Displays, 2025.

2024

[J11] FasDL: An Efficient Serverless-Based Training Architecture with Communication Optimization and Resource Configuration (CCF-A)
Xinglei Chen#, Zinuo Cai#, Hanwen Zhang, Ruhui Ma, Rajkumar Buyya
in IEEE Transactions on Computers (TC), 2024.

[J10] MemoriaNova: Optimizing Memory-Aware Model Inference on Edge Devices (CCF-A)
Renjun Zhang, Tianming Zhang, Zinuo Cai, Dongmei Li, Ruhui Ma, Rajkumar Buyya
in ACM Transactions on Architecture and Code Optimization (TACO), 2024.

[J9] Slob: Suboptimal load balancing scheduling in local heterogeneous gpu clusters for large language model inference
Peiwen Jiang, Haoxin Wang, Zinuo Cai, Lintao Gao, Weishan Zhang, Ruhui Ma, Xiaokang Zhou
in IEEE Transactions on Computational Social Systems (TCSS), 2024.

[J8] LLMaaS: Serving Large Language Models on Trusted Serverless Computing Platforms
Zinuo Cai, Rongbo Ma, Yicheng Fu, Weishan Zhang, Ruhui Ma, Haibing Guan
in IEEE Transactions on Artificial Intelligence (TAI), 2024.

[J7] SPSC: Stream Processing Framework Atop Serverless Computing for Industrial Big Data
Zinuo Cai, Zebin Chen, Xinglei Chen, Ruhui Ma, Haibing Guan, Rajkumar Buyya
in IEEE Transactions on Cybernetics (TCYB), 2024.

[J6] Deep Convolutional Linear Precoder Neural Network for Rate Splitting Strategy of Aerial Computing Networks
Zhijie Wang, Ruhui Ma, Hongjian Shi, Zinuo Cai, Liwei Lin, Haibing Guan
in IEEE Transactions on Network Science and Engineering (TNSE), 2024.

2023

[C3] Hermes: Memory-Efficient Pipeline Inference for Large Models on Edge Devices (CCF-B)
Xueyuan Han, Zinuo Cai, Yichu Zhang, Chongxin Fan, Junhan Liu, Ruhui Ma, Rajkumar Buyya
in Proc. of the IEEE International Conference on Computer Design (ICCD), 2024.

[C2] Towards Variance Reduction for Reinforcement Learning of Industrial Decision-making Tasks: A Bi-Critic based Demand-Constraint Decoupling Approach (CCF-A)
Jianyong Yuan, Jiayi Zhang, Zinuo Cai, Junchi Yan
in Proc. of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2024.

[J5] SMSS: Stateful Model Serving in Metaverse With Serverless Computing and GPU Sharing (CCF-A)
Zinuo Cai, Zebin Chen, Ruhui Ma, Haibing Guan
in IEEE Journal on Selected Areas in Communications (JSAC), 2023.

[J4] Sustainable serverless computing with cold-start optimization and automatic workflow resource scheduling
Shanxing Pan, Hongyu Zhao, Zinuo Cai, Dongmei Li, Ruhui Ma, Haibing Guan
in IEEE Transactions on Sustainable Computing (TSUSC), 2023.

[J3] RIDIC: Real-Time Intelligent Transportation System With Dispersed Computing
Zinuo Cai, Zebin Chen, Zihan Liu, Quanmin Xie, Ruhui Ma, Haibing Guan
in IEEE Transactions on Intelligent Transportation Systems (TITS), 2023.

[J2] faaShark: An end-to-end network traffic analysis system atop serverless computing platforms
Hongyu Zhao, Shanxing Pan, Zinuo Cai, Xinglei Chen, Lingxiao Jin, Honghao Gao, Shaohua Wan, Ruhui Ma, Haibing Guan
in IEEE Transactions on Network Science and Engineering (TNSE), 2023.

[J1] GUARDIAN: A Hardware-Assisted Distributed Framework to Enhance Deep Learning Security
Zinuo Cai, Bojun Ren, Ruhui Ma, Haibing Guan, Mengke Tian, Yong Wang
in IEEE Transactions on Computational Social Systems (TCSS), 2023.

2021

[C1] Themis: A Fair Evaluation Platform for Computer Vision Competitions (CCF-A)
Zinuo Cai#, Jianyong Yuan#, Yang Hua, Tao Song, Hao Wang, Zhengui Xue, Ningxin Hu, Jonathan Ding, Ruhui Ma, Mohammad Reza Haghighat, Haibing Guan
in Proc. of the International Joint Conference on Artificial Intelligence (IJCAI), 2021.