Scholar
Cong Huang
Google Scholar ID: My24T6cAAAAJ
University of Science and Technology of China
Image/Video processing
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
82
H-index
4
i10-index
2
Publications
6
Co-authors
6
list available
Contact
No contact links provided.
Publications
10 items
3D-Mix for VLA: A Plug-and-Play Module for Integrating VGGT-based 3D Information into Vision-Language-Action Models
2026
Cited
0
CyboRacket: A Perception-to-Action Framework for Humanoid Racket Sports
2026
Cited
0
Cybo-Waiter: A Physical Agentic Framework for Humanoid Whole-Body Locomotion-Manipulation
2026
Cited
0
ScalSelect: Scalable Training-Free Multimodal Data Selection for Efficient Visual Instruction Tuning
2026
Cited
0
A$^2$-LLM: An End-to-end Conversational Audio Avatar Large Language Model
2026
Cited
0
LangForce: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries
2026
Cited
2
TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers
2026
Cited
2
PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence
2025
Cited
0
Load more
Resume (English only)
Co-authors
6 total
Yan Lu
Microsoft Research Asia
Dong Liu
University of Science and Technology of China
Jiahao Li
Microsoft Research Asia
Bin Li
Microsoft Research
Tao Hu
Research Scientist, PICO,Bytedance
Xiulian Peng
Researcher at Microsoft Research Asia
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up