Scholar

Yingfa Chen

Google Scholar ID: IgPWvEQAAAAJ

PhD at Tsinghua University

machine learninglong-context modelinglanguage modeling

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

434

H-index

i10-index

Publications

Co-authors

Contact

TwitterOpen ↗GitHubOpen ↗

Publications

9 items

DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices

2026

Cited

Student-in-the-Loop Chain-of-Thought Distillation via Generation-Time Selection

2026

Cited

MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

2026

Cited

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

2026

Cited

StateX: Enhancing RNN Recall via Post-training State Expansion

2025

Cited

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

2025

Cited

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

2025

Cited

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

arXiv.org · 2024

Cited

Resume (English only)

Academic Achievements

Contributed to the COLING 2024 paper 'Robust and Scalable Model Editing for Large Language Models'.

Research Experience

Currently working on large language models, long-context modeling, and continual learning.

Education

PhD student at Tsinghua University, advised by Prof. Zhiyuan Liu.

Background

NLP PhD student at Tsinghua University, interested in LLM architectures, long-context modeling, and continual learning.

Miscellany

Sports: Badminton, jogging, etc.

Co-authors

0 total

Co-authors: 0 (list not available)