Scholar
Zhifeng Kong
Google Scholar ID: jAOD1dsAAAAJ
Senior Research Scientist, NVIDIA
Deep Generative Models
Diffusion Models
Audio Foundation Models
Audio LM
Trustworthy ML
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
3,053
H-index
15
i10-index
16
Publications
20
Co-authors
22
list available
Contact
No contact links provided.
Publications
12 items
Benchmarking Single-Factor Physical Video-to-Audio Generation
2026
Cited
0
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music
2026
Cited
0
Music Flamingo: Scaling Music Understanding in Audio Language Models
2025
Cited
0
UALM: Unified Audio Language Model for Understanding, Generation and Reasoning
2025
Cited
0
Audio Flamingo Sound-CoT Technical Report: Improving Chain-of-Thought Reasoning in Sound Understanding
2025
Cited
0
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
2025
Cited
0
Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge
2025
Cited
0
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
2025
Cited
0
Load more
Resume (English only)
Co-authors
22 total
Bryan Catanzaro
NVIDIA
Wei Ping
Distinguished Research Scientist, NVIDIA
Rafael Valle
NVIDIA, UC Berkeley, CNMAT
Co-author 4
Kamalika Chaudhuri
FAIR @ Meta
Arushi Goel
Research Scientist, NVIDIA
Co-author 7
Dahua Lin
The Chinese University of Hong Kong
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up