Scholar
Zhi Zhong
Google Scholar ID: iRVT3A8AAAAJ
Sony
Audio
Representation Learning
Music Technology
AI-based Contents Creation
Deep Generative Models
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
128
H-index
6
i10-index
4
Publications
19
Co-authors
39
list available
Contact
No contact links provided.
Publications
8 items
Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models
2026
Cited
0
Do Foundational Audio Encoders Understand Music Structure?
2025
Cited
0
FoleyBench: A Benchmark For Video-to-Audio Models
2025
Cited
0
Studies for : A Human-AI Co-Creative Sound Artwork Using a Real-time Multi-channel Sound Generation Model
2025
Cited
0
SoundReactor: Frame-level Online Video-to-Audio Generation
2025
Cited
0
TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models
2025
Cited
0
SpecMaskFoley: Steering Pretrained Spectral Masked Generative Transformer Toward Synchronized Video-to-audio Synthesis via ControlNet
2025
Cited
0
Cross-Modal Learning for Music-to-Music-Video Description Generation
2025
Cited
0
Resume (English only)
Co-authors
39 total
Yuki Mitsufuji
Distinguished Engineer, Sony
Takashi Shibuya
Sony
Shusuke Takahashi
Sony Group Corporation
Kazuki Shimada
Sony
Mengjie Zhao
SB Intuitions / Softbank
WeiHsiang Liao
Sony Research Inc.
Yuhta Takida
Sony AI
Koichi Saito
Sony AI
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up