Yutao Mou
Scholar

Yutao Mou

Google Scholar ID: f71f5YkAAAAJ
Peking University
AI SafetyLLM Alignment
Citations & Impact
All-time
Citations
270
 
H-index
9
 
i10-index
9
 
Publications
20
 
Co-authors
11
list available
Resume (English only)
Academic Achievements
  • - Publications:
  • 1. SaRO: Enhancing LLM Safety through Reasoning-based Alignment
  • 2. Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective
  • 3. SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types
  • 4. UEGP: Unified Expert-Guided Pre-training for Knowledge Rekindle
  • 5. Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery
Research Experience
  • - KCL Lab, Peking University, PhD student, supervised by Prof. Wei Ye and Prof. Shikun Zhang
Education
  • - Beijing University of Posts and Telecommunications, Master's, 2024
  • - Beijing University of Posts and Telecommunications, Bachelor's, 2021
  • - Supervisors: Prof. Wei Ye and Prof. Shikun Zhang
Background
  • - Research Interests: Building safe, reliable, and scalable artificial intelligence systems
  • - Main Research Areas:
  • 1. Safety Evaluation and Red Teaming of Large Language Models (LLMs)
  • 2. Post-Training and Safety Alignment of LLMs
Miscellany
  • Contact: Email / Scholar / Github