Scholar
Zhihang Yuan
Google Scholar ID: iipYHLoAAAAJ
Bytedance
Efficient AI
Model Compression
MLLM
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
2,001
H-index
21
i10-index
27
Publications
20
Co-authors
26
list available
Contact
No contact links provided.
Publications
33 items
6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models
2026
Cited
0
AltTS: A Dual-Path Framework with Alternating Optimization for Multivariate Time Series Forecasting
2026
Cited
0
Uncertainty-Aware Gradient Signal-to-Noise Data Selection for Instruction Tuning
2026
Cited
0
MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
2025
Cited
0
OTARo: Once Tuning for All Precisions toward Robust On-Device LLMs
2025
Cited
0
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
2025
Cited
0
LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
2025
Cited
0
Know When to Explore: Difficulty-Aware Certainty as a Guide for LLM Reinforcement Learning
2025
Cited
0
Load more
Resume (English only)
Co-authors
26 total
Guangyu Sun
School of Integrated Circuits, Peking University
Yuzhang Shang
Assistant Professor at University of Central Florida
Dawei Yang
Houmo AI
Bingzhe Wu
PKU Math-PKU CS-Tencent AI Lab-Shenzhen University
Yu Wang (汪玉)
Department of Electronic Engineering, Tsinghua University, China
Sifan Zhou
Southeast University
Yan Yan
University of Illinois Chicago
Guohao Dai
Associate Professor of Shanghai Jiao Tong University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up