Scholar

Tobias Weyand

Google Scholar ID: US56Kw8AAAAJ

Google Reserach

Computer VisionMachine Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

3,155

H-index

i10-index

Publications

Co-authors

Contact

TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

6 items

Minerva-Ego: Spatiotemporal Hints for Egocentric Video Understanding

2026

Cited

CURVE: A Benchmark for Cultural and Multilingual Long Video Reasoning

2026

Cited

CAViAR: Critic-Augmented Video Agentic Reasoning

2025

Cited

MINERVA: Evaluating Complex Video Reasoning

2025

Cited

Neptune: The Long Orbit to Benchmarking Long Video Understanding

arXiv.org · 2024

Cited

VideoPrism: A Foundational Visual Encoder for Video Understanding

International Conference on Machine Learning · 2024

Cited

Resume (English only)

Academic Achievements

Published several papers such as 'Minerva: Evaluating Complex Video Reasoning' (2025), 'Neptune: The Long Orbit to Benchmarking Long Video Understanding' (2024), 'Extending Video Masked Autoencoders to 128 frames' (2024), 'VideoPrism: A Foundational Visual Encoder for Video Understanding' (2024), 'VideoGLUE: Video General Understanding Evaluation of Foundation Models' (2024), and 'Google Landmarks Dataset v2 - A Large-Scale Benchmark for Instance-Level Recognition and Retrieval'.

Research Experience

Works at Google DeepMind on video understanding; previously worked on some Nintendo DS homebrew projects.

Education

PhD from RWTH Aachen Vision Lab, supervised by Bastian Leibe; Diploma (MS) in Computer Science from RWTH Aachen University.

Background

A Computer Vision researcher focusing on video understanding, particularly long video representations. Research interests include landmark recognition, image localization, large-scale image retrieval, and clustering.

Miscellany

Interested in Nintendo DS homebrew projects, which can be found on his GitHub page.

Co-authors

0 total

Co-authors: 0 (list not available)