Scholar

Yunyang Xiong

Google Scholar ID: k5FaRwcAAAAJ

University of Wisconsin-Madison

Computer VisionMachine LearningDeep Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

3,270

H-index

i10-index

Publications

Co-authors

Contact

CVOpen ↗GitHubOpen ↗

Publications

8 items

Exploring Audio Hallucination in Egocentric Video Understanding

2026

Cited

Small Vision-Language Models are Smart Compressors for Long Video Understanding

2026

Cited

Neural Computers

2026

Cited

Efficient Universal Perception Encoder

2026

Cited

EgoAVU: Egocentric Audio-Visual Understanding

2026

Cited

VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

arXiv.org · 2026

Cited

ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization

2025

Cited

EdgeTAM: On-Device Track Anything Model

2025

Cited

Resume (English only)

Academic Achievements

Published numerous papers in top conferences such as CVPR, ICML, NeurIPS, including 'Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts' and 'Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference'.

Research Experience

Worked as a research intern at Google Mobile Vision on neural architecture search (MobileDets); also did research internships at Facebook AI and Amazon Lab 126.

Education

Ph.D. in Computer Sciences from the University of Wisconsin-Madison, under the supervision of Vikas Singh.

Background

Currently a senior research scientist at Meta Reality Labs. Research interests include natural language modeling, efficient Transformers, and neural architecture search, with a particular focus on efficient AI.

Co-authors

0 total

Co-authors: 0 (list not available)