Scholar

Zihao Tang

Google Scholar ID: YBffTgsAAAAJ

Microsoft

LLMOOD

Citations & Impact

All-time

Citations

H-index

i10-index

Publications

Co-authors

Contact

Publications

1 items

2026

Cited

Resume (English only)

Academic Achievements

Publications:
- Distributionally Robust Optimization For Language Modeling
- Optimizing Language Models for Human Preferences is a Causal Inference Problem
- Token-level Direct Preference Optimization
- SimPO: Simple Preference Optimization with a Reference-Free Reward
- KL Divergence: Forward vs Reverse?

Research Experience

Education

Master's Degree student at Zhejiang University, advised by A.P. Kun Kuang and Prof. Fei Wu.

Background

Research interests include Model Compression (Data-Free Knowledge Distillation, Out-of-Domain Knowledge Distillation, etc.), Domain Adaptation, and Large-Small Model Collaboration. Currently, committed to LLM (Large Language Model), especially in reinforcement learning.

Miscellany

Co-authors

0 total

Co-authors: 0 (list not available)