Scholar
Roman Bachmann
Google Scholar ID: -KHAy7kAAAAJ
PhD Student at EPFL
Multimodality
Scalable foundation models
World modeling
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,282
H-index
10
i10-index
10
Publications
14
Co-authors
14
list available
Contact
Email
roman.bachmann@epfl.ch
GitHub
Open ↗
Publications
2 items
How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks
2025
Cited
0
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
2025
Cited
0
Resume (English only)
Academic Achievements
Published multiple papers at top-tier venues including NeurIPS, ICML, ECCV, CVPR, SIGGRAPH, ICCV, and 3DV
Best Paper Award at SIGGRAPH 2022 (CLIPasso)
Best Presentation and 3rd Best Paper at Central European Seminar on Computer Graphics 2019
1st Place in Young Investigator Award competition at 8th Int. Congress on Science and Skiing (2019)
Several papers selected for oral presentations (e.g., 3DV 2019, NeurIPS 2023 Spotlight)
Co-authors
14 total
Amir Zamir
Professor of Computer Science, EPFL
David Mizrahi
Apple
Oğuzhan Fatih Kár
Apple
Andrei Atanov
EPFL
Ainaz Eftekhar
PhD Student, University of Washington
Afshin Dehghan
AI/ML @Apple
Alexander (Sasha) Sax
FAIR (Meta AI)
Mingfei Gao
Apple Inc.
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up