here is a photo of me

Bio

Dr. Jiaao He receieved his PhD degree from the Department of Comperter Science and Technology, Tsinghua University in 2025, where he also received his bachelor degree in 2020. His research interests include designing and accelerating distributed systems for tensors with sparsity. He has developed FastMoE, the world’s first open-source distributed training framework for Mixture-of-Experts models based on PyTorch. He has a few papers published on ASPLOS, PPoPP, and other conferences. He used to be the team leader of the champion-winning Tsinghua Student Cluster Competition Team.

Industrial

ByteDance Inc., March 2025 - Present

Research Scientist on Infrastructure.

Internships

PAI, Alibaba Group, July-Aug 2020

Research Intern.

DPS, Sensetime Research, April 2018 - April 2019

Intern Researcher

Cloud Beaver Corporation, Oct. 2016 - Dec. 2016

Intern full-stack web developer.

Academic

PACMAN Lab, Dept. CS, Tsinghua Univ., Oct. 2017 - June, 2025

Advisor: Prof. Jidong Zhai.

  • Research Intern (2017-2020)
  • Member / Team Leader of the Student Cluster Competition Team (2018-2019)
  • PhD Student (2020-2025)

ALCHEM Lab, Dept. ECE, USC, July 2019 - Sep 2019

Summer intern advised by Prof. Xuehai Qian.

Publications

Main Contributions

  • Jiaao He, Shengqi Chen, Kezhao Huang, Jidong Zhai, HypeReca: Distributed Heterogeneous In-Memory Embedding Database for Training Recommender Models (ATC'25)

  • Jiaao He, Kezhao Huang, Jidong Zhai, FastDecode: High-Throughput LLM Serving through Disaggregating Attention Computation (ICML'24 Workshop LCFM)

  • Jiaao He, Jidong Zhai, Tiago Antunes, Haojie Wang, et al., FasterMoE: Modeling and Optimizing Training of Large-Scale Dynamic Pre-Trained Models (PPoPP'22)

  • Zixuan Ma, Jiaao He, Jiezhong Qiu, Huanqi Cao, et al., BAGUALU: Targeting Brain Scale Pretrained Models with over 37 Million Cores (PPoPP'22)

  • Qinyi Luo, Jiaao He (co-first), Youwei Zhuo, Xuehai Qian, Prague: High-Performance Heterogeneity-Aware Asynchronous Decentralized Training (ASPLOS'20)

Works Involved

  • Mingyu Xu, Tenglong Ao, Jiaao He, Jianqiao Lu, Guang Shi, Shu Zhong, DeltaFormer: Unlock the state space of Transformer(NeurIPS'25)

  • Haoyu Yang, Zan Zong, Yuyang Jin, Kinman Lei, Jiaao He, Qigang Yang, Jidong Zhai, UltraAttn: Efficiently Parallelizing Attention through Hierarchical Context-Tiling(SC'25)

  • Kezhao Huang, Siqi Zhu, Mingshu Zhai, Liyan Zheng, Kinman Lei, Jiaao He, Yuyang Jin, Jidong Zhai, mTuner: Accelerating Parameter-Efficient Fine-Tuning on Multi-GPU Servers with Elastic Tensor (ATC'25)

  • Mingshu Zhai, Jiaao He, Zixuan Ma, Zan Zong, Runqing Zhang, Jidong Zhai, SmartMoE: Efficiently Training Sparsely-Activated Models through Combining Offline and Online Parallelization (ATC'23)

  • Zixuan Ma, Haojie Wang, Guanyu Feng, Chen Zhang, Lei Xie, Jiaao He, Shengqi Chen, Jidong Zhai, Efficiently emulating high-bitwidth computation with low-bitwidth hardware (ICS'22)

  • Chen Zhang, Chenggang Zhao, Jiaao He, et.al., Critique of “Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility” by SCC Team From Tsinghua University (ITPDS'21)

  • Jiaao He, Chenggang Zhao, et.al., Student Cluster Competition 2018, Team Tsinghua University: Reproducing the SeisSol Optimization on Intel Skylake Architecture (PARCO'19)

Preprints & Posters

  • Team Seedance, Seedance 2.0: Advancing Video Generation for World Complexity(2604.14148)

  • Team Seedream, Seedream 4.0: Toward next-generation multimodal image generation (2509.20427)

  • Jiaao He, Shengqi Chen, Jidong Zhai, POSTER: Pattern-Aware Sparse Communication for Scalable Recommendation Model Training (PPoPP'24)

  • Jiaao He, Jidong Zhai, FastDecode: High-Throughput GPU-Efficient LLM Serving using Heterogeneous Pipelines (2403.11421)

  • Sha Yuan, …, Jiaao He, et al., A Roadmap for Big Model (2203.14101) (Shame on the plagiarism of certain co-authors in this paper)

  • Jiaao He, Jiezhong Qiu, et. al., FastMoE: A Fast Mixture-of-Expert Training System (2103.13262)

Teaching

  • Data structure and algorithm, Prof. Yuchun Ma, TA, spring 2023
  • Advanced Programming, Prof. Yuchun Ma, TA, fall 2022
  • Data structure and algorithm, Prof. Yuchun Ma, TA, spring 2022
  • Data structure and algorithm, Prof. Yuchun Ma, TA, spring 2021
  • Data structure, Prof. Hong Wang and Dr. Wentao Han, TA, spring 2021
  • Introduction to HPC, Prof. Jidong Zhai, TA, spring 2021

Misc

QBXT Education Corporation, Aug. 2016 - Oct. 2019

As a lecturer for Olympiad Informatics, give more than 20 lectures (more than 100 hours) to high school students in programming language, algorithm and data structure in OI contests.

Student Association of Algorithm and Contest, Dept. CS, Tsinghua, Sep. 2016 - Sep. 2017

Vice President, Head of the Platform and System Group. Designer and developer of the TUOJ online judge system.

Awards and Honors

  • Champion in Student Cluster Competition at SC'19 as the team leader, Denver, CO. 2019.

  • CCF Elite Colligate Award, China Computer Foundation, 2019.

  • Scholarship for techniques, CS Dept. Tsinghua. 2019.

  • Second place in Student Cluster Competiton at ISC'19 as the team leader, Frankfurt, Germany. 2019.

  • Second place in ASC'19 as the team leader, Dalian, China. 2019.

  • Outstanding Intern Reward in Sensetime Research, Beijing, China, 2018.

  • Champion in Student Cluster Competition at SC'18, Dallas, TX. 2018.

  • Scholarship for techniques, CS Dept. Tsinghua. 2018.

  • Champion in Student Cluster Competiton at ISC'18, Frankfurt, Germany. 2018.

  • Champion in ASC'18, Nanchang, China, 2018.

  • Star of 9#, Dept. CS, Tsinghua, Beijing, China, 2017.

  • Scholarship for techniques, CS Dept. Tsinghua. 2017.

  • 2nd place, Lanqiao International Programming Competition, Princeton, NJ, 2017.

  • Gold Medal (2nd place) in ACM-ICPC Asia Regional Contest, Qingdao, China, 2016.

  • Silver Medal in National Olympiad Informatics (NOI), Hangzhou, China, 2015.

  • Gold Medal in Asia-Pacific Informatics Olympiad (APIO), Beijing, China, 2015.

Social Services

  • President of the Students’ Cycling Club, Tsinghua University. Aug.2018 - June.2019

  • International Volunteer for Turtle protection, Sri Lanka, Aug.2017

Hobbies

Jiaao has been obsessed with high performance throughout his entire life. As an avid cyclist since 2017, the current activities focus on road and gravel cycling in the mountains, as well as trail running. He formerly competed in triathlons and downhill mountain biking, notably winning 3rd place in his age group at the 2018 Weihai Triathlon World Cup. Beyond that, you may also find Jiaao pushing limits on the racetrack—whether driving race cars, racing go-karts, or tearing up rally courses on the simulator.