Scholar
Xianhang Li
Google Scholar ID: YKpFz4YAAAAJ
Ph.D. in UCSC
Computer Vision
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,273
H-index
15
i10-index
17
Publications
20
Co-authors
14
list available
Contact
Email
xianhang710@gmail.com
CV
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
7 items
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
2026
Cited
0
Rethinking JEPA: Compute-Efficient Video SSL with Frozen Teachers
2025
Cited
0
A New Benchmark for Evaluating Code Translation with Third-Party Libraries
2025
Cited
0
OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning
2025
Cited
0
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning
2025
Cited
0
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
arXiv.org · 2024
Cited
16
Scaling White-Box Transformers for Vision
arXiv.org · 2024
Cited
3
Resume (English only)
Academic Achievements
Paper accepted at NeurIPS 2024: 'Scaling White-Box Transformers for Vision'
Paper accepted at TMLR 2024: 'Unleashing the Power of Visual Prompting At the Pixel Level'
Two papers accepted at CVPR 2024: 'Revisiting Adversarial Training at Scale' and 'Learning to Bootstrap for Combating Label Noise'
Paper accepted at NeurIPS 2023: 'An Inverse Scaling Law for CLIP Training'
Paper accepted at ECCV 2022: 'In Defense of Image Pre-Training for Spatiotemporal Recognition'
Paper accepted at WACV 2022: 'Pose-guided Generative Adversarial Net for Novel View Action Synthesis'
Paper accepted at ICLR 2021: 'CT-Net: Channel Tensorization Network for Video Classification'
Paper accepted at CVPR 2020: 'SmallBigNet: Integrating Core and Contextual Views for Video Classification'
Additional publications at ICLR 2021, NeurIPS ML Safety Workshop 2022, etc.
Awarded the Jack Baskin and Peggy Downes-Baskin Fellowship in 2024 (sole recipient)
Released Recap-DataComp-1B in 2024, recaptioning 1.3 billion images from DataComp-1B using an LLaMA-3-powered LLaVA model
Co-authors
14 total
Yuyin Zhou
Assistant Professor, Computer Science and Engineering, Genomics Institute, UC Santa Cruz
Cihang Xie
Assistant Professor, University of California, Santa Cruz
Jieru Mei
Google
Alan Yuille
Professor of Cognitive Science and Computer Science, Johns Hopkins University
Zeyu Wang
PhD Student, University of California, Santa Cruz
Yali Wang
Professor, Shenzhen Institutes of Advanced Technology,Chinese Academy of Sciences
Yu Qiao
Professor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CAS
Chen Wei
Assistant Professor, Rice University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up