Scholar

Dongyang Fan

Google Scholar ID: U7yzfCkAAAAJ

EPFL

machine learningLLMs

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

36

H-index

3

i10-index

2

Publications

11

Co-authors

10

list available

Contact

Emaildongyang.fan@epfl.ch CVOpen ↗TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

7 items

HalluHard: A Hard Multi-Turn Hallucination Benchmark

2026

Cited

0

Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining

2025

Cited

0

TiMoE: Time-Aware Mixture of Language Experts

2025

Cited

0

URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training

2025

Cited

0

Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs

2025

Cited

0

From Fairness to Truthfulness: Rethinking Data Valuation Design

2025

Cited

0

On-device Collaborative Language Modeling via a Mixture of Generalists and Specialists

arXiv.org · 2024

Cited

1

Resume (English only)

Academic Achievements

Published multiple papers such as 'Apertus: Democratizing Open and Compliant LLMs for Global Language Environments', 'TiMoE: Time-Aware Mixture of Language Experts', 'URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training', 'Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs', etc. Won oral presentation award at COLM 2025.

Research Experience

Involved in multiple research projects, including Apertus team's pretraining work, TiMoE: Time-Aware Mixture of Language Experts, URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training, etc.

Education

4th-year PhD student at Machine Learning and Optimization Lab at EPFL, supervised by Prof. Martin Jaggi.

Background

Research interests include: Data-Efficient Language Modeling, Mixture-of-Experts architectures, Decentralized training methods, Accelerating LLM pretraining through metadata conditioning, Responsible Language Modeling, Data-compliant pretraining by respecting owners’ opt-out choices, Designing compensation frameworks for data contributors, Understanding and mitigating model hallucinations.

Miscellany

Likes arts and cultural stuff, enjoys outdoor activities like hiking, skiing, and sailing. Also paints from hiking trips.

Co-authors

10 total

Bettina Messmer

Celestine Mendler-Dünner

ELLIS Institute & Max Planck Institute for Intelligent Systems, Tübingen

PhD Student, EPFL

Sai Praneeth Karimireddy

Antoine Bosselut

Matin Ansaripour