XUESHEN-SMI 1.0 Thu, May 14, 2026

Research Runtime

Xueshen Liu

刘学深

Xueshen Liu profile photo
Driver version PhD-0.4.9
Mode Elastic LLM infra
Namespace UMICH CSE
Location Ann Arbor, MI

I build systems for cost efficient LLM training, inference, and reinforcement learning, focusing on designing elastic infrastructure to harvest heterogeneous resources.

LLM Large Language Model Infra Infrastructure Efficient Efficiency Elastic Elasticity Heter Heterogenity
PROCESS / GPU0 Education 3 workloads

GPU0 Education

academic milestones

workload
0 25 50 75 100 Aug 2018 Jan 2020 Jan 2021 Jan 2022 Jan 2023 Jan 2024 Jan 2025 May 2026 SJTU UMich UMich ICLR ICLR COLM

GPU1 Work

internships and industry experiences

hours
0 25 50 75 100 Apr 2024 Jan 2025 Jan 2026 Jun 2026 GM UMich Google Google CitSec

GPU2 Research

projects and publications

engagement
0 25 50 75 100 Apr 2022 Jan 2023 Jan 2024 Jan 2025 Jan 2026 May 2026 MM2-gb Plato LTE BCB CAKE NIPS HeterMoE RLBoost ICML COLM Foundry NSDI

GPU3 Blogs

sharing my learnings

posts
0 1 2 3 4 5 6 7 8 9 Feb 2024 Jan 2025 Jan 2026 May 2026 CUDA CUDA Triton CUDA CUDA PyTorch NCCL Triton Claude

Host Processes

16 May 2026 TALK Invited talk for Amazon Rufus AI lab Excited to give an invited talk, Towards Instantaneous Elasticity in LLM Infrastructure: From Harvesting Preemptible Resources to a General Cold-Start-Free Serving Stack.
15 May 2026 CONF Present RLBoost at NSDI'26 Excited to present RLBoost on NSDI'26 and meet talented researchers working on network and systems! 14 Apr 2026 PUB Foundry paper and code release Super excited to share Foundry! It is our recent work for fast LLM serving cold start via template-based CUDA graph context materialization. 13 Feb 2026 ROLE Incoming Citadel Securities QR internship Happy to share that I will work at Citadel Securities as Quantitative Researcher Intern for summer 2026 in Miami. 12 Dec 2025 PUB RLBoost accepted to NSDI'26 Happy to share that RLBoost has been accepted to NSDI'26. Wrapped up Student Researcher work with Systems Research @ Google! Huge thanks to all my collaborators. 11 Jul 2025 PUB Plato accepted to COLM'25 Happy to share that Plato has been accepted to COLM'25. This is our work on planning and parallel decoding for efficient LLM inference. 10 May 2025 PUB CAKE accepted to ICML'25 Excited to share that CAKE has been accepted to ICML'25! It is our work on reducing long-context prefill latency by overlapping KV-cache computation and loading. 09 May 2025 ROLE Systems Research @ Google Student Researcher Internship Excited to join Systems Research @ Google as a Student Researcher in Seattle, working on distributed RL systems for LLMs. 08 Dec 2024 CONF LTE NeurIPS spotlight Honored to share that LTE was presented as a Spotlight at NeurIPS'24. This work explores structured sparsity and efficient sparse FFN kernels for LLMs. 07 Oct 2024 CONF mm2-gb ACM BCB oral Happy to share that mm2-gb was selected as an Oral at ACM BCB'24. It is our work on GPU-accelerated minimap2 for long-read DNA mapping.
06 Sep 2024 ROLE CSE 589 Graduate Student Instructor Happy to start as Graduate Student Instructor for CSE 589 Advanced Computer Networks at the University of Michigan.
05 Aug 2024 TALK Invited talk for General Motors Research Excited to give an invited talk, Scalable & Latency-tolerant Edge/Cloud Computing via Deep Factor Graph.
04 May 2024 ROLE General Motors CAV Lab Research Intern Excited to intern in the Connected Autonomous Vehicle Lab at General Motors, working on latency-tolerant edge/cloud positioning systems.
03 May 2024 TALK Invited talk at AMD HPC Apps Knowledge Sync Excited to give an invited talk on Minimap2-gigabases (mm2-gb) at AMD HPC Apps Knowledge Sync.
02 Aug 2021 AWARD Roger King Scholarship Honored to receive the Roger King Scholarship from the College of Engineering at the University of Michigan.
01 Aug 2019 AWARD Robomaster Final Competition Happy to share that our team won Runner-up Team and Grand Prize at the Robomaster Final Competition.