Scholar

Zhi Zhong

Google Scholar ID: iRVT3A8AAAAJ

Sony

AudioRepresentation LearningMusic TechnologyAI-based Contents CreationDeep Generative Models

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

128

H-index

6

i10-index

4

Publications

19

Co-authors

39

list available

Contact

No contact links provided.

Publications

10 items

MusTBENCH: Benchmarking and Advancing Temporal Grounding in Music LLMs

2026

Cited

0

Break-the-Beat! Controllable MIDI-to-Drum Audio Synthesis

2026

Cited

0

Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models

2026

Cited

0

Do Foundational Audio Encoders Understand Music Structure?

2025

Cited

0

FoleyBench: A Benchmark For Video-to-Audio Models

2025

Cited

0

Studies for : A Human-AI Co-Creative Sound Artwork Using a Real-time Multi-channel Sound Generation Model

2025

Cited

0

SoundReactor: Frame-level Online Video-to-Audio Generation

2025

Cited

0

TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models

2025

Cited

0

Resume (English only)

Co-authors

39 total

Distinguished Engineer, Sony

Takashi Shibuya

Shusuke Takahashi

Sony Group Corporation

SB Intuitions / Softbank

Sony Research Inc.