Scholar

Zhifeng Kong

Google Scholar ID: jAOD1dsAAAAJ

Senior Research Scientist, NVIDIA

Deep Generative ModelsDiffusion ModelsAudio Foundation ModelsAudio LMTrustworthy ML

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

3,053

H-index

15

i10-index

16

Publications

20

Co-authors

22

list available

Contact

No contact links provided.

Publications

13 items

Unified Audio Intelligence Without Regressing on Text Intelligence

2026

Cited

0

Benchmarking Single-Factor Physical Video-to-Audio Generation

2026

Cited

0

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

2026

Cited

0

Music Flamingo: Scaling Music Understanding in Audio Language Models

2025

Cited

0

UALM: Unified Audio Language Model for Understanding, Generation and Reasoning

2025

Cited

0

Audio Flamingo Sound-CoT Technical Report: Improving Chain-of-Thought Reasoning in Sound Understanding

2025

Cited

0

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models

2025

Cited

0

Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge

2025

Cited

0

Resume (English only)

Co-authors

22 total

Bryan Catanzaro

Distinguished Research Scientist, NVIDIA

NVIDIA, UC Berkeley, CNMAT

Kamalika Chaudhuri

Research Scientist, NVIDIA

The Chinese University of Hong Kong