Scholar
Roman Bachmann
Google Scholar ID: -KHAy7kAAAAJ
PhD Student at EPFL
Multimodality
Scalable foundation models
World modeling
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,282
H-index
10
i10-index
10
Publications
14
Co-authors
14
list available
Contact
No contact links provided.
Publications
5 items
Weblica: Scalable and Reproducible Training Environments for Visual Web Agents
2026
Cited
0
(1D) Ordered Tokens Enable Efficient Test-Time Search
2026
Cited
0
VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization
2026
Cited
0
How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks
2025
Cited
0
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
2025
Cited
0
Resume (English only)
Co-authors
14 total
Amir Zamir
Professor of Computer Science, EPFL
David Mizrahi
Apple
Oğuzhan Fatih Kár
Apple
Andrei Atanov
EPFL
Ainaz Eftekhar
PhD Student, University of Washington
Afshin Dehghan
AI/ML @Apple
Alexander (Sasha) Sax
FAIR (Meta AI)
Mingfei Gao
Apple Inc.
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up