Scholar
Ruohao Guo
Google Scholar ID: hMWIp6MAAAAJ
Peking University
Multi-Modal Learning
Computer Vision
Video Generation
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
879
H-index
11
i10-index
11
Publications
20
Co-authors
0
Contact
No contact links provided.
Publications
15 items
SeaVIS: Sound-Enhanced Association for Online Audio-Visual Instance Segmentation
2026
Cited
0
vLinear: A Powerful Linear Model for Multivariate Time Series Forecasting
2026
Cited
0
Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks
2025
Cited
0
SimToken: A Simple Baseline for Referring Audio-Visual Segmentation
2025
Cited
0
Teacher-Guided Pseudo Supervision and Cross-Modal Alignment for Audio-Visual Video Parsing
2025
Cited
0
TEn-CATS: Text-Enriched Audio-Visual Video Parsing with Multi-Scale Category-Aware Temporal Graph
2025
Cited
0
Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers
2025
Cited
0
Mettle: Meta-Token Learning for Memory-Efficient Audio-Visual Adaptation
2025
Cited
0
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up