Hi there!
I am Xuanlei Zhao, a third-year PhD student in Computer Science at National University of Singapore advised by Yang You, where I also completed my master’s studies. I obtained my bachelor’s degree in CS & EE from Huazhong University of Science and Technology. Previously, I interned at Tencent Hunyuan with Kai Wang, Adobe Research with Yan Kang and Yuanjun Xiong, Pika with Chenlin Meng, Colossal-AI with Jiarui Fang.
My current research mainly focuses on efficient AI, including:
- Efficient parameter generation, for scaling and customizing foundation models.
- Efficient diffusion and autoregressive models, e.g., for video generation.
- Efficient machine learning system, with parallelism and low-level optimization.
- Co-optimization of algorithm and infrastructure.
📝 Selected Publications (all)
🕹️ Efficient Parameter Generation
-
HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing
Tencent HY Team
-
NeurIPS 2025Drag-and-Drop LLMs: Zero-Shot Prompt-to-WeightsZhiyuan Liang*, Dongwen Tang, Yuhao Zhou, Xuanlei Zhao, Mingjia Shi, Wangbo Zhao, Zekai Li, Peihao Wang, Konstantin Schürholt, Damian Borth, Michael M. Bronstein, Yang You, Zhangyang Wang*, Kai Wang*
🎬 Efficient Video Generation
-
ICLR 2025Real-Time Video Generation with Pyramid Attention BroadcastXuanlei Zhao*, Xiaolong Jin*, Kai Wang*†, Yang You† -
ICML 2025DSP: Dynamic Sequence Parallelism for Multi-Dimensional TransformersXuanlei Zhao, Shenggan Cheng, Chang Chen, Zangwei Zheng, Ziming Liu, Zheming Yang, Yang You -
Training Variable Sequences with Data-Centric Parallel
Geng Zhang*, Xuanlei Zhao*, Kai Wang†, Yang You†
⚙️ Efficient System Optimization
-
ICLR 2024AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence InferenceXuanlei Zhao, Shenggan Cheng, Guangyang Lu, Jiarui Fang, Haotian Zhou, Bin Jia, Ziming Liu, Yang You -
MLSys 2024HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained DevicesXuanlei Zhao*, Bin Jia*, Haotian Zhou*, Ziming Liu, Shenggan Cheng, Yang You -
PPoPP 2024FastFold: Optimizing AlphaFold Training and Inference on GPU ClustersShenggan Cheng, Xuanlei Zhao, Guangyang Lu, Jiarui Fang, Tian Zheng, Ruidong Wu, Xiwen Zhang, Jian Peng, Yang You
💡 Open-Source Projects
- HY-WU (Lead for algo and infra): An Extensible Functional Neural Memory Framework
- VideoSys (Project Lead): An Easy and Efficient System for Video Generation
- Colossal-AI (Top Contributor): Making large AI models cheaper, faster and more accessible
- FastFold (Top Contributor): Optimizing AlphaFold Training and Inference on GPU Clusters
💻 Internships
- 2025.12 - 2026.03, Tencent Hunyuan with Kai Wang.
- 2025.07 - 2025.11, Adobe Research with Yan Kang and Yuanjun Xiong.
- 2024, Pika with Chenlin Meng.
- 2022.07 - 2023.12, Colossal-AI with Jiarui Fang and Shenggui Li.
📖 Educations
- 2024.01 - now, PhD in Computer Science, National University of Singapore
- 2022.08 - 2023.12, Master in Computer Science, National University of Singapore
- 2018.09 - 2022.06, Bachelor in Computer Science & Electrical Information, Huazhong University of Science and Technology
💬 Invited Talks
- 2024.07, Real-Time Video Generation with Pyramid Attention Broadcast, Ventures [video]
- 2024.07, Speedup for Video Generation, Bytedance internal talk