Logo

Mu Yuan (袁牧), PhD

毋自欺,以诚意正心

The Chinese University of Hong Kong

Email: ym0813 at mail.ustc.edu.cn
muyuan at cuhk.edu.hk

Google Scholar
GitHub
ORCID
Complete CV

Currently, I am a postdoctoral fellow at CUHK, working with Prof. Guoliang Xing.
I received my PhD degree from USTC in 2024, advised by Prof. Xiang-Yang Li and Prof. Lan Zhang.
I received my Bachelor's degree from USTC in 2019, as a member of the Hua-Xia Talent Class (华夏英才班).
My research interests lie in designing theory-backed algorithms and building innovative systems for AI workloads.

📌 News

  • [Dec 2025] We released the TopoSense-Bench dataset, a large-scale benchmark for semantic-spatial sensor scheduling used in our IoT-Brain work [MobiCom '26]!
  • [Dec 2025] Our paper "Venus" and "BeeKeeper" have been accepted by INFOCOM 2026! Congratulations to Shengyuan and Weizheng!
  • [Nov 2025] Our paper "IoT-Brain: Grounding LLMs for Semantic-Spatial Sensor Scheduling" has been accepted by ACM MobiCom 2026! Congratulations to Zhaomeng and Junyang!
  • [Nov 2025] The agenda for the ANAI workshop is out! Check out our keynote, panels, industry experience sharing, and banquet (link)! Looking forward to seeing you all in Hong Kong on November 8th.
  • [Oct 2025] Our paper "TopFGL: A Topology-Aware and Distribution-Agnostic Federated Learning Framework Tackling Topological Heterogeneity on Graph Data" has been accepted by ICDE 2026! Congratulations to Junyang Wang!
  • [Sep 2025] The ANAI Workshop @ MobiCom 2025 has released the accepted papers! Congratulations to all authors!
  • [Aug 2025] Our paper "STIP: Three-Party Privacy-Preserving and Lossless Inference for Large Transformers in Production" has been accepted by NDSS 2026!
  • [Jun 2025] Our paper "Myo-Trainer" has been accepted by MobiCom 2025!
  • [May 2025] Our paper "Argus: Multi-view egocentric human mesh reconstruction based on stripped-down wearable mmwave add-on" received the Best Paper Honorable Mention Award at SenSys 2025!
For a complete list of publications, please refer to my Google Scholar profile.

Large Language Models (LLMs)

STIP: Three-Party Privacy-Preserving and Lossless Inference for Large Transformers in Production
Mu Yuan, Lan Zhang, Yihang Cheng, Miao-Hui Song, Guoliang Xing, Xiang-Yang Li
NDSS 2026
Code Official MindSpore Support Qwen2.5 Usage Example


SCX: Stateless KV-Cache Encoding for Cloud-Scale Confidential Transformer Serving
Mu Yuan, Lan Zhang, Liekang Zeng, Siyang Jiang, Bufang Yang, Di Duan, Guoliang Xing
SIGCOMM 2025
Code


Myo-Trainer: A Vision-based Muscle-Aware Motion Feedback System for In-Home Resistance Training
Yuting He, Xinyan Wang, Mu Yuan, Bufang Yang, Siyang Jiang, Yihua Huang, Doris S. F. Yu, Guoliang Xing, Hongkai Chen
MobiCom 2025 (🏅 ACM SenSys '24 Best Demo Runner-up Award)


RemoteRAG: A Privacy-Preserving LLM Cloud RAG Service
Yihang Cheng, Lan Zhang, Junyang Wang, Mu Yuan, Yunhao Yao
ACL 2025


A-VL: Adaptive Attention for Large Vision-Language Models
Junyang Zhang, Mu Yuan, Ruiguang Zhong, Puhan Luo, Huiyou Zhan, Ningkang Zhang, Chengchen Hu, Xiangyang Li
AAAI 2025

Mobile/Edge Intelligence

Venus: An Efficient Edge Memory-and-Retrieval System for VLM-based Online Video Understanding
Shengyuan Ye, Bei Ouyang, Tianyi Qian, Liekang Zeng, Mu Yuan, Xiaowen Chu, Weijie Hong, Xu Chen
IEEE INFOCOM 2026


IoT-Brain: Grounding LLMs for Semantic-Spatial Sensor Scheduling
Zhaomeng Zhou, Lan Zhang, Junyang Wang, Mu Yuan, Junda Lin, Jinke Song
ACM MobiCom 2026
Dataset


Argus: Multi-view egocentric human mesh reconstruction based on stripped-down wearable mmwave add-on
Di Duan, Shengzhe Lyu, Mu Yuan, Hongfei Xue, Tianxing Li, Weitao Xu, Kaishun Wu, Guoliang Xing
SenSys 2025 (🏅 Best Paper Honorable Mention Award)


PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale
Mu Yuan, Lan Zhang, Xuanke You, Xiang-Yang Li
ACM SIGCOMM 2023 (SIG Grant Award)
PDF Code


InFi: End-to-end Learnable Input Filter for Resource-efficient Mobile-centric Inference
Mu Yuan, Lan Zhang, Fengxiang He, Xueting Tong, Xiang-Yang Li
ACM MobiCom 2022
PDF Code

Multi-Model Scheduling

Mitigating Tail Latency for on-Device Inference with Load-Balanced Heterogeneous Models
Mu Yuan, Lan Zhang, Di Duan, Liekang Zeng, Miao-Hui Song, Zichong Li, Guoliang Xing, Xiang-Yang Li
IEEE TMC 2025


MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference
Mu Yuan, Lan Zhang, Zimu Zheng, Yi-Nan Zhang, Xiang-Yang Li
IEEE TPAMI 2023
PDF


MLink: Linking Black-box Models for Collaborative Multi-model Inference
Mu Yuan, Lan Zhang, Xiang-Yang Li
AAAI 2022 (Oral 4.5%)
PDF Code


Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling
Mu Yuan, Lan Zhang, Xiang-Yang Li, Hui Xiong
IEEE ICDE 2020
PDF

🎓 Doctoral Dissertation

异构协同模型推理 (Heterogeneous Collaborative Model Inference)
袁牧 (Mu Yuan)
CCF 全国优博 (CCF Doctoral Dissertation Award)
CCF 物联网专委优博 (CCF TCIoT Doctoral Dissertation Award)
中国科学技术大学校优博 (USTC Doctoral Dissertation Award)

🎤 Invited Talks

Keynote: Co-Designing the Edge and Cloud for Scalable and Secure AI Inference [Slides], ECCAI @ CoNEXT 2025, 2025.12.1

端云协同:驱动大模型高效与安全服务的系统研究, ACM 中国图灵大会 ACM TURC 2025, 2025.10.11

Enhancing AI System Performance and Security via Device-Cloud Collaboration [Best Oral Presentation Award], The 3rd International Conference on the Frontiers of Robotics and Software Engineering (FRSE2025), 2025.8.9

端云协同智能面向高效、安全、个性化的模型服务, 第十五届中国计算机学会优博论坛, 2025.8.4

端云协同范式赋能大模型高效机密推理, 中国科学技术大学专题报告, 2025.5.6

模型推理原生的智能物联网系统 Model Inference-Native AIoT Systems, 香港中文大学大模型可靠性技术沙龙, 2025.2.21

优秀博士论文报告, 第十八届中国物联网学术会议 CWSN 2024, 2024.9.21


📊 Conference Paper Presentations

SCX: Stateless KV-Cache Encoding for Cloud-Scale Confidential Transformer Serving [YouTube] [Bilibili], ACM SIGCOMM 2025, 2025.9.9

PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale [Bilibili] [Slides], ACM SIGCOMM 2023, 2023.9.13

InFi: End-to-end Learnable Input Filter for Resource-efficient Mobile-centric Inference [Bilibili] [Slides], ACM MobiCom 2022, 2022.10.18

MLink: Linking Black-box Models for Collaborative Multi-model Inference [Bilibili] [Slides (20min)] [Slides (1min)], AAAI 2022

Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling [Bilibili] [Slides], IEEE ICDE 2020

Reviewing Editor of Springer Nature (2025 - present)
Co-chair of ANAI Workshop 2025 (co-located with ACM MobiCom 2025)
Reviewer of:
ACM IMWUT 2024
IEEE INFOCOM 2025
IEEE TMC 2024, 2025
AAAI 2023, 2024, 2025
NeurIPS 2025
2025 ACM SenSys '25 Best Paper Honorable Mention Award [Link]
2024 ACM SenSys '24 Best Demo Runner-up Award [Link]
2024 CCF Doctoral Dissertation Award (10 Nationwide) [Link]
2024 CCF TCIoT Doctoral Dissertation Award (4 Nationwide) [Link]
2024 USTC Doctoral Dissertation Award [Link]
2024 CAS President Award [Link]
2023 ByteDance Scholars Award (13 Nationwide) [Link]
2023/2022/2020 National Scholarship
2018 SenseTime Scholarship (22 Nationwide) [Link]
2018 Grand Prize (1 out of 1530 teams) of the 4th National University Cloud Computing Contest [Link]