Scholar

Zhihang Yuan

Google Scholar ID: iipYHLoAAAAJ

Bytedance

Efficient AIModel CompressionMLLM

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

2,001

H-index

21

i10-index

27

Publications

20

Co-authors

26

list available

Contact

No contact links provided.

Publications

33 items

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

2026

Cited

0

AltTS: A Dual-Path Framework with Alternating Optimization for Multivariate Time Series Forecasting

2026

Cited

0

Uncertainty-Aware Gradient Signal-to-Noise Data Selection for Instruction Tuning

2026

Cited

0

MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping

2025

Cited

0

OTARo: Once Tuning for All Precisions toward Robust On-Device LLMs

2025

Cited

0

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

2025

Cited

0

LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs

2025

Cited

0

Know When to Explore: Difficulty-Aware Certainty as a Guide for LLM Reinforcement Learning

2025

Cited

0

Resume (English only)

Co-authors

26 total

School of Integrated Circuits, Peking University

Assistant Professor at University of Central Florida

PKU Math-PKU CS-Tencent AI Lab-Shenzhen University

Yu Wang (汪玉)

Department of Electronic Engineering, Tsinghua University, China

Southeast University

University of Illinois Chicago

Associate Professor of Shanghai Jiao Tong University