Lijie (Derrick) Yang

I am an incoming CS PhD student at Princeton University. I obtained my bachelor degree in Computer Science from Carnegie Mellon. At CMU, I’m fortunate to be advised by Prof. Zhihao Jia and work closely with Prof. Tianqi Chen.
Before this, I worked with Prof. Ye-Qiong Song at LORIA lab on real-time networks, and with Prof. Umut Acar on developing language model serving systems. Additionally, I’ve contributed to open-source project at MIT’s HAN Lab, focusing on on-device LLM inference optimization.
Research Interests: My work lies at the intersection of builing efficient deep learning systems with both algorithmic and compiler optimizations. I’m particularly passionate about exploring and expanding the potential (both efficiency- and accuracy-wise) of state-of-the-art AI models like language models in reasoning and long-context tasks.
news
Jan 20, 2025 | TidalDecode is accepted to ICLR 2025, see you in Singapore ![]() |
---|---|
Nov 14, 2024 | Gave a talk at CMU Catalyst Lab on TidalDecode |
Oct 08, 2024 | Our project on position-persistent sparse attention (PPSA), TidalDecode, is on ArXiv! |
Aug 24, 2024 | Honored to be an early inductee into Phi Beta Kappa (ΦΒΚ) of Class 2025 ![]() |
Jul 03, 2024 | BWE is accepted to LCN 2024 ![]() |
May 01, 2024 | RalmSpec is accepted to ICML 2024 ![]() |
Mar 03, 2024 | SpecInfer is accepted to ASPLOS 2024 ![]() |