Scholar

Yutao Mou

Google Scholar ID: f71f5YkAAAAJ

Peking University

AI SafetyLLM Alignment

Citations & Impact

All-time

Citations

270

H-index

i10-index

Publications

Co-authors

list available

Contact

Publications

5 items

2026

Cited

2025

Cited

2025

Cited

2025

Cited

2025

Cited

Resume (English only)

Academic Achievements

- Publications:
1. SaRO: Enhancing LLM Safety through Reasoning-based Alignment
2. Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective
3. SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types
4. UEGP: Unified Expert-Guided Pre-training for Knowledge Rekindle
5. Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery

Research Experience

- KCL Lab, Peking University, PhD student, supervised by Prof. Wei Ye and Prof. Shikun Zhang

Education

Background

- Research Interests: Building safe, reliable, and scalable artificial intelligence systems
- Main Research Areas:
1. Safety Evaluation and Red Teaming of Large Language Models (LLMs)
2. Post-Training and Safety Alignment of LLMs

Miscellany