Lijie (Derrick) Yang

I am a senior Computer Science student at Carnegie Mellon University, with double concentrations in computer systems and machine learning. Currently, I’m a research assistant at the Catalyst Lab, where I’m fortunate to work under the guidance of Prof. Zhihao Jia and Prof. Tianqi Chen.
Before this, I conducted research with Prof. Ye-Qiong Song at LORIA lab on real-time networks, and with Prof. Umut Acar on developing language model serving systems. Additionally, I’ve contributed to open-source project at MIT’s HAN Lab, focusing on on-device LLM inference optimization.
Research Interests: My work lies at the intersection of builing efficient deep learning systems with algorithmic innovations. I’m particularly passionate about accelerating large language models, optimizing on-device AI, and enhancing network reliability for real-time applications.
news
Jan 20, 2025 | TidalDecode is accepted to ICLR 2025, see you in Singapore ![]() |
---|---|
Nov 14, 2024 | Gave a talk at CMU Catalyst Lab on TidalDecode |
Oct 08, 2024 | Our project on position-persistent sparse attention (PPSA), TidalDecode, is on ArXiv! |
Aug 24, 2024 | Honored to be an early inductee into Phi Beta Kappa (ΦΒΚ) of Class 2025 ![]() |
Jul 03, 2024 | BWE is accepted to LCN 2024 ![]() |
May 01, 2024 | RalmSpec is accepted to ICML 2024 ![]() |
Mar 03, 2024 | SpecInfer is accepted to ASPLOS 2024 ![]() |