Siyuan Chen (陈思元)

I am a second year Ph.D. student at computer science department at Carnegie Mellon University, happily advised by Phil Gibbons and Heather Miller, and I am closely working with Ben Titzer. I am a member of PDL and Catalyst at CMU. I finished my undergraduate at Turning Class of Peking University, where I was fortunately advised by Yun (Eric) Liang.

My research interests broadly focus on developing system support for machine learning. Specifically, I am interested in exploring system solutions for the efficient execution of irregular machine learning tasks. This includes optimizing the batching of dynamic neural networks (ED-Batch, ICML23’) and automating scheduling techniques for large language model (LLM) serving. Additionally, I am concentrating on machine learning at the edge, which involves memory-efficient and high-precision fine-tuning on commodity hardware through compressed offloading (LSP-Offload, Under Submission), as well as enabling machine learning deployment in the browser via WebGPU.

Previously, I am deeply engaged in hardware-aware optimizations for machine learning. I worked on performance modeling of data movement for tensor programs (Chimera, HPCA23’), mapping mechanism from DNN to SoC (COMB, DAC23’), and analytical-based simulator for fused program on general hardware (TileFlow, MICRO23’).

News

  • Jun. 2024 I begin the internship as student researcher in SystemResearch@Google (SRG) working on LLM serving, hosted by Samira Khan.
  • Sept. 2023 I am starting CS PhD at Carnegie Mellon University, co-advised by Phil Gibbons and Heather Miller. Hope for a stimulating and fruitful journey at Pittsburgh!
  • Aug. 2023 TileFlow is publicly available!
  • Apr. 2023 ED-Batch is accepted to ICML23’, thanks for the mentor and professors!

Publications

(*Equal Contribution)

  • TileFlow: A Framework for Modeling Fusion Dataflow via Tree-based Analysis. Size Zheng, Siyuan Chen, Siyuan Gao, Liancheng Jia, Guangyu Sun, Runsheng Wang, Yun Liang. MICRO 2023. PDF

  • ED-Batch: Efficient Automatic Batching of Dynamic Deep Neural Networks via Finite State Machine. Siyuan Chen, Pratik Fegade, Tianqi Chen, Phillip B. Gibbons, Todd C. Mowry. ICML 23’. PDF Code Poster Video.

  • Memory and Computation Coordinated Mapping of DNNs onto Complex Heterogeneous SoC. Size Zheng, Siyuan Chen, Yun Liang. The 60th Design Automation Conference (DAC), July 2023. PDF

  • Chimera: An Analytical Optimizing Framework for Effective Compute-intensive Operators Fusion. Size Zheng*, Siyuan Chen*, Pedi Song, Renze Chen, Xiuhong Li, Shengen Yan, Dahua Lin, Jingwen Leng, Yun Liang. 29th international symposium on High Performance Computer Architecture (HPCA), February 2023. PDF.

Under Submission

  • Practical Offloading for Fine-Tuning LLM on Commodity GPU via Learned Sparse Projectors Siyuan Chen, Zhuofeng Wang, Zelong Guan, Yudong Liu, Phillip B. Gibbons. PDF