AgoraResearch hub
ExploreLibraryProfile
Account
Zhifeng Kong
Scholar

Zhifeng Kong

Google Scholar ID: jAOD1dsAAAAJ
Senior Research Scientist, NVIDIA
Deep Generative ModelsDiffusion ModelsAudio Foundation ModelsAudio LMTrustworthy ML
Homepage↗Google Scholar↗
Citations & Impact
All-time
Citations
3,053
 
H-index
15
 
i10-index
16
 
Publications
20
 
Co-authors
22
list available
Contact
No contact links provided.
Publications
12 items
Benchmarking Single-Factor Physical Video-to-Audio Generation
2026
Cited
0
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music
2026
Cited
0
Music Flamingo: Scaling Music Understanding in Audio Language Models
2025
Cited
0
UALM: Unified Audio Language Model for Understanding, Generation and Reasoning
2025
Cited
0
Audio Flamingo Sound-CoT Technical Report: Improving Chain-of-Thought Reasoning in Sound Understanding
2025
Cited
0
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
2025
Cited
0
Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge
2025
Cited
0
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
2025
Cited
0
Resume (English only)
Co-authors
22 total
Bryan Catanzaro
Bryan Catanzaro
NVIDIA
Wei Ping
Wei Ping
Distinguished Research Scientist, NVIDIA
Rafael Valle
Rafael Valle
NVIDIA, UC Berkeley, CNMAT
Co-author 4
Co-author 4
Kamalika Chaudhuri
Kamalika Chaudhuri
FAIR @ Meta
Arushi Goel
Arushi Goel
Research Scientist, NVIDIA
Co-author 7
Co-author 7
Dahua Lin
Dahua Lin
The Chinese University of Hong Kong

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?