Logo

Mu Yuan (袁牧)

毋自欺,以诚意正心

The Chinese University of Hong Kong

Email: ym0813 at mail.ustc.edu.cn
muyuan at cuhk.edu.hk

Google Scholar
GitHub
ORCID
Complete CV

I am fortunate to work with Prof. Guoliang Xing at CUHK. Before joining CUHK, I was advised by Prof. Xiang-Yang Li and Prof. Lan Zhang at USTC. My research interest is designing theory-backed algorithms and building innovative systems for AI workloads.

📌 News

  • [Aug 2025] Our paper "STIP: Three-Party Privacy-Preserving and Lossless Inference for Large Transformers in Production" has been accepted by NDSS 2026!
  • [Jun 2025] Our two papers "Myo-Trainer" and "Llambda" have been accepted by MobiCom 2025!
  • [May 2025] Our paper "RemoteRAG: A Privacy-Preserving LLM Cloud RAG Service" has been accepted by ACL 2025!
  • [May 2025] Our paper "Argus: Multi-view egocentric human mesh reconstruction based on stripped-down wearable mmwave add-on" received the Best Paper Honorable Mention Award at SenSys 2025!
  • [Mar 2025] Our paper "SCX: Stateless KV-Cache Encoding for Cloud-Scale Confidential Transformer Serving" has been accepted by SIGCOMM 2025!

Selected Publications

Large Language Models (LLMs)

STIP: Three-Party Privacy-Preserving and Lossless Inference for Large Transformers in Production
Mu Yuan, Lan Zhang, Yihang Cheng, Miao-Hui Song, Guoliang Xing, Xiang-Yang Li
Accepted by NDSS 2026
Code | Official MindSpore Support | Qwen2.5 Usage Example


SCX: Stateless KV-Cache Encoding for Cloud-Scale Confidential Transformer Serving
Mu Yuan, Lan Zhang, Liekang Zeng, Siyang Jiang, Bufang Yang, Di Duan, Guoliang Xing
Accepted by SIGCOMM 2025
Code


LLM-Driven Low-Resolution Vision System for On-Device Human Behavior Understanding
Siyang Jiang, Bufang Yang, Lilin Xu, Mu Yuan, Yeerzhati Abudunuer, Kaiwei Liu, Liekang Zeng, Hongkai Chen, Xiaofan Jiang, Zhenyu Yan, Guoliang Xing
Accepted by MobiCom 2025


Myo-Trainer: A Vision-based Muscle-Aware Motion Feedback System for In-Home Resistance Training
Yuting He, Xinyan Wang, Mu Yuan, Bufang Yang, Siyang Jiang, Yihua Huang, Doris S. F. Yu, Guoliang Xing, Hongkai Chen
Accepted by MobiCom 2025


RemoteRAG: A Privacy-Preserving LLM Cloud RAG Service
Yihang Cheng, Lan Zhang, Junyang Wang, Mu Yuan, Yunhao Yao
ACL 2025
Cite


A-VL: Adaptive Attention for Large Vision-Language Models
Junyang Zhang, Mu Yuan, Ruiguang Zhong, Puhan Luo, Huiyou Zhan, Ningkang Zhang, Chengchen Hu, Xiangyang Li
AAAI 2025
PDF


Mobile/Edge Intelligence

Grape: Efficient Spatiotemporal Prediction Services with Stale Sensing Streams
Liekang Zeng, Yunchao Liu, Shengyuan Ye, Mu Yuan, Di Duan, Xu Chen, Guoliang Xing
IEEE Real-Time Systems Symposium (RTSS) 2025


Argus: Multi-view egocentric human mesh reconstruction based on stripped-down wearable mmwave add-on
Di Duan, Shengzhe Lyu, Mu Yuan, Hongfei Xue, Tianxing Li, Weitao Xu, Kaishun Wu, Guoliang Xing
SenSys 2025 (🏅 Best Paper Honorable Mention Award)
PDF


PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale
Mu Yuan, Lan Zhang, Xuanke You, Xiang-Yang Li
ACM SIGCOMM 2023 (SIG Grant Award)
PDF | Slides | Code | Cite


InFi: End-to-end Learnable Input Filter for Resource-efficient Mobile-centric Inference
Mu Yuan, Lan Zhang, Fengxiang He, Xueting Tong, Xiang-Yang Li
ACM MobiCom 2022
PDF | Slides | Code | Bilibili | Cite

Model Scheduling

MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference
Mu Yuan, Lan Zhang, Zimu Zheng, Yi-Nan Zhang, Xiang-Yang Li
IEEE TPAMI 2023
PDF | Cite


Efficient Deep Ensemble Inference via Query Difficulty-dependent Task Scheduling
Zichong Li, Lan Zhang, Mu Yuan, Miao-Hui Song, Qi Song
IEEE ICDE 2023


MLink: Linking Black-box Models for Collaborative Multi-model Inference
Mu Yuan, Lan Zhang, Xiang-Yang Li
AAAI 2022 (Oral 4.5%)
PDF | Slides (20min version) | Slides (1min version) | Code | Bilibili | Cite


Adaptive Model Scheduling for Resource-efficient Data Labeling
Mu Yuan, Lan Zhang, Xiang-Yang Li, Lin-Zhuo Yang, Hui Xiong
ACM TKDD 2022
PDF | Cite


Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling
Mu Yuan, Lan Zhang, Xiang-Yang Li, Hui Xiong
IEEE ICDE 2020
PDF | Official Video | Bilibili | Slides | Cite

🎓 Doctoral Dissertation

异构协同模型推理 (Heterogeneous Collaborative Model Inference)
袁牧 (Mu Yuan)
CCF 全国优博 (CCF Doctoral Dissertation Award)
CCF 物联网专委优博 (CCF TCIoT Doctoral Dissertation Award)
中国科学技术大学校优博 (USTC Doctoral Dissertation Award)
PDF

Invited Talks

[The 3rd International Conference on the Frontiers of Robotics and Software Engineering (FRSE2025)]
[2025.8.9] 张家界,湖南 - Enhancing AI System Performance and Security via Device-Cloud Collaboration

[第十五届中国计算机学会优博论坛]
[2025.8.4] 乌镇,浙江 - 端云协同智能面向高效、安全、个性化的模型服务

[中国科学技术大学 邀请报告]
[2025.5.6] 合肥,安徽 - 端云协同范式赋能大模型高效机密推理

[港中文大模型可靠性技术沙龙]
[2025.2.21] 香港 - 模型推理原生的智能物联网系统 Model Inference-Native AIoT Systems

[2024年华为青年学者交流会(数通专场)]
[2024.11.30] 苏州,江苏 - 异构协同模型推理

[第十八届中国物联网学术会议 CWSN 2024]
[2024.9.21] 太原,山西 - 优秀博士论文报告


Services

Reviewer of ACM IMWUT, IEEE INFOCOM, IEEE TMC, IEEE IoTJ, AAAI, NeurIPS
Co-chair of ANAI Workshop 2025 (co-located with ACM MobiCom 2025)


Funds and Awards

2024-2025 National Natural Science Foundation of China, Grant No.623B2093, RMB 300,000
2024 CCF Doctoral Dissertation Award (10 Nationwide) [Link]
2024 CCF TCIoT Doctoral Dissertation Award (4 Nationwide) [Link]
2024 USTC Doctoral Dissertation Award [Link]
2024 CAS President Award [Link]
2023 ByteDance Scholars Award (13 Nationwide) [Link]
2018 SenseTime Scholarship (22 Nationwide) [Link]
2018 Grand Price (1 out of 1530 teams) of the 4th National University Cloud Computing Contest [Link]