- SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting accepted by ISCA 2025
- DVD-Quant: Data-free Video Diffusion Transformers Quantization under review
Research Experience
Worked closely with Prof. Guohao Dai and Prof. Yulun Zhang on research projects.
Education
Undergraduate student at the Department of EE, Shanghai Jiao Tong University (SJTU) (2023.9 - Present), Advisors: Prof. Guohao Dai, Prof. Yulun Zhang.
Background
Research Interests: Acceleration and compression of LLM/DiT/dLLM models through techniques such as quantization, cache mechanism, and specific algorithmic methods.