I am fortunate to work with Prof. Guoliang Xing at CUHK.
Before joining CUHK, I was advised by Prof. Xiang-Yang Li and Prof. Lan Zhang at USTC.
My research interest is designing theory-backed algorithms and building innovative systems for AI workloads.
News
- [Jun 2025] Our two papers "Myo-Trainer" and "Llambda" have been accepted by MobiCom 2025!
- [May 2025] Our paper "RemoteRAG: A Privacy-Preserving LLM Cloud RAG Service" has been accepted by ACL 2025!
- [May 2025] Our paper "Argus: Multi-view egocentric human mesh reconstruction based on stripped-down wearable mmwave add-on" received the Best Paper Honorable Mention Award at SenSys 2025!
- [Mar 2025] Our paper "SCX: Stateless KV-Cache Encoding for Cloud-Scale Confidential Transformer Serving" has been accepted by SIGCOMM 2025!
Selected Publications
Large Language Models (LLMs)
SCX: Stateless KV-Cache Encoding for Cloud-Scale Confidential Transformer Serving
Mu Yuan, Lan Zhang, Liekang Zeng, Siyang Jiang, Bufang Yang, Di Duan, Guoliang Xing
Accepted by SIGCOMM 2025
LLM-Driven Low-Resolution Vision System for On-Device Human Behavior Understanding
Siyang Jiang, Bufang Yang, Lilin Xu, Mu Yuan, Yeerzhati Abudunuer, Kaiwei Liu, Liekang Zeng, Hongkai Chen, Xiaofan Jiang, Zhenyu Yan, Guoliang Xing
Accepted by MobiCom 2025
Myo-Trainer: A Vision-based Muscle-Aware Motion Feedback System for In-Home Resistance Training
Yuting He, Xinyan Wang, Mu Yuan, Bufang Yang, Siyang Jiang, Yihua Huang, Doris S. F. Yu, Guoliang Xing, Hongkai Chen
Accepted by MobiCom 2025
RemoteRAG: A Privacy-Preserving LLM Cloud RAG Service
Yihang Cheng, Lan Zhang, Junyang Wang, Mu Yuan, Yunhao Yao
ACL 2025
Cite
@inproceedings{cheng-2025-remoterag,
title = "{R}emote{RAG}: A Privacy-Preserving {LLM} Cloud {RAG} Service",
author = "Cheng, Yihang and Zhang, Lan and Wang, Junyang and Yuan, Mu and Yao, Yunhao",
booktitle = "Findings of the Association for Computational Linguistics: ACL 2025",
month = jul,
year = "2025",
address = "Vienna, Austria",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2025.findings-acl.197/",
pages = "3820--3837",
ISBN = "979-8-89176-256-5",
}
A-VL: Adaptive Attention for Large Vision-Language Models
Junyang Zhang, Mu Yuan, Ruiguang Zhong, Puhan Luo, Huiyou Zhan, Ningkang Zhang, Chengchen Hu, Xiangyang Li
AAAI 2025
PDF
Mobile/Edge Intelligence
Argus: Multi-view egocentric human mesh reconstruction based on stripped-down wearable mmwave add-on
Di Duan, Shengzhe Lyu, Mu Yuan, Hongfei Xue, Tianxing Li, Weitao Xu, Kaishun Wu, Guoliang Xing
SenSys 2025 (🏅 Best Paper Honorable Mention Award)
PDF
PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale
Mu Yuan, Lan Zhang, Xuanke You, Xiang-Yang Li
ACM SIGCOMM 2023 (SIG Grant Award)
PDF
| Slides
| Code
| Cite
Mu Yuan, Lan Zhang, Xuanke You, and Xiang-Yang Li. 2023. PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale. In Proceedings of the ACM SIGCOMM 2023 Conference (ACM SIGCOMM '23). Association for Computing Machinery, New York, NY, USA, 724–737. https://doi.org/10.1145/3603269.3604825
@inproceedings{yuan-packetgame,
author = {Yuan, Mu and Zhang, Lan and You, Xuanke and Li, Xiang-Yang},
title = {PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale},
year = {2023},
isbn = {9798400702365},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3603269.3604825},
doi = {10.1145/3603269.3604825},
booktitle = {Proceedings of the ACM SIGCOMM 2023 Conference},
pages = {724–737},
numpages = {14},
location = {New York, NY, USA},
series = {ACM SIGCOMM '23}
}
InFi: End-to-end Learnable Input Filter for Resource-efficient Mobile-centric Inference
Mu Yuan, Lan Zhang, Fengxiang He, Xueting Tong, Xiang-Yang Li
ACM MobiCom 2022
PDF
| Slides
| Code
| Bilibili
| Cite
Mu Yuan, Lan Zhang, Fengxiang He, Xueting Tong, and Xiang-Yang Li. 2022. InFi: end-to-end learnable input filter for resource-efficient mobile-centric inference. In Proceedings of the 28th Annual International Conference on Mobile Computing And Networking (MobiCom '22). Association for Computing Machinery, New York, NY, USA, 228–241. https://doi.org/10.1145/3495243.3517016.
@inproceedings{yuan-infi,
author = {Yuan, Mu and Zhang, Lan and He, Fengxiang and Tong, Xueting and Li, Xiang-Yang},
title = {InFi: End-to-End Learnable Input Filter for Resource-Efficient Mobile-Centric Inference},
year = {2022},
isbn = {9781450391818},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3495243.3517016},
doi = {10.1145/3495243.3517016},
booktitle = {Proceedings of the 28th Annual International Conference on Mobile Computing And Networking},
pages = {228–241},
numpages = {14},
location = {Sydney, NSW, Australia},
series = {MobiCom '22}
}
Model Scheduling
MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference
Mu Yuan, Lan Zhang, Zimu Zheng, Yi-Nan Zhang, Xiang-Yang Li
IEEE TPAMI 2023
PDF
| Cite
M. Yuan, L. Zhang, Z. Zheng, Y. -N. Zhang and X. -Y. Li, "MLink: Linking Black-Box Models From Multiple Domains for Collaborative Inference," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 10, pp. 12085-12097, Oct. 2023, doi: 10.1109/TPAMI.2023.3283780.
@ARTICLE{yuan-mlink-tpami,
author={Yuan, Mu and Zhang, Lan and Zheng, Zimu and Zhang, Yi-Nan and Li, Xiang-Yang},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
title={MLink: Linking Black-Box Models From Multiple Domains for Collaborative Inference},
year={2023},
volume={45},
number={10},
pages={12085-12097},
doi={10.1109/TPAMI.2023.3283780}}
Efficient Deep Ensemble Inference via Query Difficulty-dependent Task Scheduling
Zichong Li, Lan Zhang, Mu Yuan, Miao-Hui Song, Qi Song
IEEE ICDE 2023
MLink: Linking Black-box Models for Collaborative Multi-model Inference
Mu Yuan, Lan Zhang, Xiang-Yang Li
AAAI 2022 (Oral 4.5%)
PDF
| Slides (20min version)
| Slides (1min version)
| Code
| Bilibili
| Cite
Mu Yuan, Lan Zhang, and Xiang-Yang Li. 2022. "MLink: Linking Black-Box Models for Collaborative Multi-Model Inference". Proceedings of the AAAI Conference on Artificial Intelligence 36 (9):9475-83. https://doi.org/10.1609/aaai.v36i9.21180.
@article{yuan-mlink,
title={MLink: Linking Black-Box Models for Collaborative Multi-Model Inference},
volume={36},
url={https://ojs.aaai.org/index.php/AAAI/article/view/21180},
DOI={10.1609/aaai.v36i9.21180},
number={9},
journal={Proceedings of the AAAI Conference on Artificial Intelligence},
author={Yuan, Mu and Zhang, Lan and Li, Xiang-Yang},
year={2022},
month={Jun.},
pages={9475-9483}
}
Adaptive Model Scheduling for Resource-efficient Data Labeling
Mu Yuan, Lan Zhang, Xiang-Yang Li, Lin-Zhuo Yang, Hui Xiong
ACM TKDD 2022
PDF
| Cite
Mu Yuan, Lan Zhang, Xiang-Yang Li, Lin-Zhuo Yang, and Hui Xiong. 2022. Adaptive Model Scheduling for Resource-efficient Data Labeling. ACM Trans. Knowl. Discov. Data 16, 4, Article 71 (August 2022), 22 pages. https://doi.org/10.1145/3494559.
@article{yuan-adms-tkdd,
author = {Yuan, Mu and Zhang, Lan and Li, Xiang-Yang and Yang, Lin-Zhuo and Xiong, Hui},
title = {Adaptive Model Scheduling for Resource-Efficient Data Labeling},
year = {2022},
issue_date = {August 2022},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
volume = {16},
number = {4},
issn = {1556-4681},
url = {https://doi.org/10.1145/3494559},
doi = {10.1145/3494559},
journal = {ACM Trans. Knowl. Discov. Data},
month = {jan},
articleno = {71},
numpages = {22},
keywords = {Model scheduling, reinforcement learning, data labeling}
}
Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling
Mu Yuan, Lan Zhang, Xiang-Yang Li, Hui Xiong
IEEE ICDE 2020
PDF
| Official Video
| Bilibili
| Slides
| Cite
Mu Yuan, Lan Zhang, Xiang-Yang Li and Hui Xiong, "Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling," 2020 IEEE 36th International Conference on Data Engineering (ICDE), Dallas, TX, USA, 2020, pp. 1858-1861, doi: 10.1109/ICDE48307.2020.00188.
@INPROCEEDINGS{yuan-adms-icde,
author={Yuan, Mu and Zhang, Lan and Li, Xiang-Yang and Xiong, Hui},
booktitle={2020 IEEE 36th International Conference on Data Engineering (ICDE)},
title={Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling},
year={2020},
volume={},
number={},
pages={1858-1861},
doi={10.1109/ICDE48307.2020.00188}
}
🎓 Doctoral Dissertation
异构协同模型推理 (Heterogeneous Collaborative Model Inference)
袁牧 (Mu Yuan)
CCF 全国优博 (CCF Doctoral Dissertation Award)
CCF 物联网专委优博 (CCF TCIoT Doctoral Dissertation Award)
中国科学技术大学校优博 (USTC Doctoral Dissertation Award)
PDF
Services
Reviewer of ACM IMWUT, IEEE INFOCOM, IEEE TMC, IEEE IoTJ, AAAI, NeurIPS
Co-chair of ANAI Workshop 2025 (co-located with ACM MobiCom 2025)
Funds and Awards
2024-2025 National Natural Science Foundation of China, Grant No.623B2093, RMB 300,000
2024 CCF Doctoral Dissertation Award (10 Nationwide) [Link]
2024 CCF TCIoT Doctoral Dissertation Award (4 Nationwide) [Link]
2024 USTC Doctoral Dissertation Award [Link]
2024 CAS President Award [Link]
2023 ByteDance Scholars Award (13 Nationwide) [Link]
2018 SenseTime Scholarship (22 Nationwide) [Link]
2018 Grand Price (1 out of 1530 teams) of the 4th National University Cloud Computing Contest [Link]