Scholar

Haoyu Wang

Google Scholar ID: l_QMtXYAAAAJ

Tsinghua University

Reinforcement LearningLarge Language ModelSafety Alignment

Citations & Impact

All-time

Citations

H-index

i10-index

Publications

Co-authors

list available

Contact

Publications

3 items

2026

Cited

2026

Cited

2026

Cited

Resume (English only)

Academic Achievements

- 2025 NeurIPS Paper: 'Lifelong Safety Alignment for Language Models'
- 2025 ICML Paper: 'Safety Reasoning with Guidelines'
- 2025 ICML Paper: 'Mastering Massive Multi-Task Reinforcement Learning via MoE Decision Transformer'
- 2024 ICML MHFAIA Workshop Paper: 'Step-on-feet Tuning: Scaling Self-alignment of LLMs via Bootstrapping'
- TMLR Journal Paper: 'Are large language models really robust to word-level perturbations?'
- 2023 NeurIPS Paper: 'Learning better with less: Effective augmentation for sample-efficient visual reinforcement learning'
- Honors and Awards: XJTU Excellent Student Scholarship, Tsinghua Comprehensive Excellence Scholarship, Tsinghua Big Data Practice Scholarship

Research Experience

- 2023.09 - 2024.10, Internship at Tencent, Advised by: Peilin Zhao
- 2024.10 - 2025.07, Associate Member at Sea AI Lab, Collaborated with: Tianyu Pang, Li Shen, Dacheng Tao

Education

Background

Research Interest: AI alignment, especially ensuring the safety of LLM and using synthetic data to help LLM self-improve. Professional Field: Electrical Engineering and Automation.

Miscellany