I am fortunate to work with Prof. Guoliang Xing at CUHK. Before joining CUHK, I was advised by Prof. Xiang-Yang Li and Prof. Lan Zhang at USTC. My research interest is designing theory-backed algorithms and building innovative systems for AI workloads.
STIP: Three-Party Privacy-Preserving and Lossless Inference for Large Transformers in Production
Mu Yuan, Lan Zhang, Yihang Cheng, Miao-Hui Song, Guoliang Xing, Xiang-Yang Li
Accepted by NDSS 2026
Code
| Official MindSpore Support
| Qwen2.5 Usage Example
SCX: Stateless KV-Cache Encoding for Cloud-Scale Confidential Transformer Serving
Mu Yuan, Lan Zhang, Liekang Zeng, Siyang Jiang, Bufang Yang, Di Duan, Guoliang Xing
Accepted by SIGCOMM 2025
Code
| Bilibili
| YouTube
Myo-Trainer: A Vision-based Muscle-Aware Motion Feedback System for In-Home Resistance Training
Yuting He, Xinyan Wang, Mu Yuan, Bufang Yang, Siyang Jiang, Yihua Huang, Doris S. F. Yu, Guoliang Xing, Hongkai Chen
Accepted by MobiCom 2025
RemoteRAG: A Privacy-Preserving LLM Cloud RAG Service
Yihang Cheng, Lan Zhang, Junyang Wang, Mu Yuan, Yunhao Yao
ACL 2025
Cite
A-VL: Adaptive Attention for Large Vision-Language Models
Junyang Zhang, Mu Yuan, Ruiguang Zhong, Puhan Luo, Huiyou Zhan, Ningkang Zhang, Chengchen Hu, Xiangyang Li
AAAI 2025
PDF
Grape: Efficient Spatiotemporal Prediction Services with Stale Sensing Streams
Liekang Zeng, Yunchao Liu, Shengyuan Ye, Mu Yuan, Di Duan, Xu Chen, Guoliang Xing
IEEE Real-Time Systems Symposium (RTSS) 2025
Argus: Multi-view egocentric human mesh reconstruction based on stripped-down wearable mmwave add-on
Di Duan, Shengzhe Lyu, Mu Yuan, Hongfei Xue, Tianxing Li, Weitao Xu, Kaishun Wu, Guoliang Xing
SenSys 2025 (🏅 Best Paper Honorable Mention Award)
PDF
PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale
Mu Yuan, Lan Zhang, Xuanke You, Xiang-Yang Li
ACM SIGCOMM 2023 (SIG Grant Award)
PDF
| Slides
| Code
| Cite
InFi: End-to-end Learnable Input Filter for Resource-efficient Mobile-centric Inference
Mu Yuan, Lan Zhang, Fengxiang He, Xueting Tong, Xiang-Yang Li
ACM MobiCom 2022
PDF
| Slides
| Code
| Bilibili
| Cite
MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference
Mu Yuan, Lan Zhang, Zimu Zheng, Yi-Nan Zhang, Xiang-Yang Li
IEEE TPAMI 2023
PDF
| Cite
Efficient Deep Ensemble Inference via Query Difficulty-dependent Task Scheduling
Zichong Li, Lan Zhang, Mu Yuan, Miao-Hui Song, Qi Song
IEEE ICDE 2023
MLink: Linking Black-box Models for Collaborative Multi-model Inference
Mu Yuan, Lan Zhang, Xiang-Yang Li
AAAI 2022 (Oral 4.5%)
PDF
| Slides (20min version)
| Slides (1min version)
| Code
| Bilibili
| Cite
Adaptive Model Scheduling for Resource-efficient Data Labeling
Mu Yuan, Lan Zhang, Xiang-Yang Li, Lin-Zhuo Yang, Hui Xiong
ACM TKDD 2022
PDF
| Cite
Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling
Mu Yuan, Lan Zhang, Xiang-Yang Li, Hui Xiong
IEEE ICDE 2020
PDF
| Official Video
| Bilibili
| Slides
| Cite
异构协同模型推理 (Heterogeneous Collaborative Model Inference)
袁牧 (Mu Yuan)
CCF 全国优博 (CCF Doctoral Dissertation Award)
CCF 物联网专委优博 (CCF TCIoT Doctoral Dissertation Award)
中国科学技术大学校优博 (USTC Doctoral Dissertation Award)
PDF
[The 3rd International Conference on the Frontiers of Robotics and Software Engineering (FRSE2025)]
[2025.8.9] 张家界,湖南 - Enhancing AI System Performance and Security via Device-Cloud Collaboration
[第十五届中国计算机学会优博论坛]
[2025.8.4] 乌镇,浙江 - 端云协同智能面向高效、安全、个性化的模型服务
[中国科学技术大学 邀请报告]
[2025.5.6] 合肥,安徽 - 端云协同范式赋能大模型高效机密推理
[港中文大模型可靠性技术沙龙]
[2025.2.21] 香港 - 模型推理原生的智能物联网系统 Model Inference-Native AIoT Systems
[2024年华为青年学者交流会(数通专场)]
[2024.11.30] 苏州,江苏 - 异构协同模型推理
[第十八届中国物联网学术会议 CWSN 2024]
[2024.9.21] 太原,山西 - 优秀博士论文报告
Reviewer of ACM IMWUT, IEEE INFOCOM, IEEE TMC, IEEE IoTJ, AAAI, NeurIPS
Co-chair of ANAI Workshop 2025 (co-located with ACM MobiCom 2025)
2024-2025 National Natural Science Foundation of China, Grant No.623B2093, RMB 300,000
2024 CCF Doctoral Dissertation Award (10 Nationwide) [Link]
2024 CCF TCIoT Doctoral Dissertation Award (4 Nationwide) [Link]
2024 USTC Doctoral Dissertation Award [Link]
2024 CAS President Award [Link]
2023 ByteDance Scholars Award (13 Nationwide) [Link]
2018 SenseTime Scholarship (22 Nationwide) [Link]
2018 Grand Price (1 out of 1530 teams) of the 4th National University Cloud Computing Contest [Link]