Lijie (Derrick) Yang

School of Computer Science, Carnegie Mellon University

prof_pic.jpg

I am a senior Computer Science student at Carnegie Mellon University, with double concentrations in computer systems and machine learning. Currently, I’m a research assistant at the Catalyst Lab, where I’m fortunate to work under the guidance of Prof. Zhihao Jia and Prof. Tianqi Chen.

Before this, I conducted research with Prof. Ye-Qiong Song at LORIA lab on real-time networks, and with Prof. Umut Acar on developing language model serving systems. Additionally, I’ve contributed to open-source project at MIT’s HAN Lab, focusing on on-device LLM inference optimization.

Research Interests: My work lies at the intersection of builing efficient deep learning systems with algorithmic innovations. I’m particularly passionate about accelerating large language models, optimizing on-device AI, and enhancing network reliability for real-time applications.

news

Jan 20, 2025 TidalDecode is accepted to ICLR 2025, see you in Singapore :tada:!
Nov 14, 2024 Gave a talk at CMU Catalyst Lab on TidalDecode
Oct 08, 2024 Our project on position-persistent sparse attention (PPSA), TidalDecode, is on ArXiv!
Aug 24, 2024 Honored to be an early inductee into Phi Beta Kappa (ΦΒΚ) of Class 2025 :tada:!
Jul 03, 2024 BWE is accepted to LCN 2024 :tada:!
May 01, 2024 RalmSpec is accepted to ICML 2024 :tada:!
Mar 03, 2024 SpecInfer is accepted to ASPLOS 2024 :tada:!

selected publications

  1. ICLR
    TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
    Lijie Yang*, Zhihao Zhang*, Zhuofu Chen, Zikun Li, and Zhihao Jia
    to appear at International Conference on Learning Representations, 2025
  2. LCN
    Blocking-Waived Estimation: Improving the Worst-Case End-To-End Delay Analysis in Switched Ethernet
    Lijie Yang, Théo Docquier, Ludovic Thomas, and Ye-Qiong Song
    In Proceedings of Local Computer Networks, 2024
  3. ICML
    Accelerating Retrieval-augmented Language Model Serving with Speculation
    Zhihao Zhang, Alan Zhu, Lijie Yang, Yihua Xu, Lanting Li, Phitchaya Mangpo Phothilimthana, and Zhihao Jia
    In Proceedings of International Conference on Machine Learning, 2024
  4. ASPLOS
    SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification
    Xupeng Miao*, Gabriele Oliaro*, Zhihao Zhang*, Xinhao Cheng*, Zeyu Wang, Zhengxin Zhang, Rae Ying Yee Wong, Alan Zhu, Lijie Yang, and 6 more authors
    In Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, Apr 2024