Scholar

Vighnesh Birodkar

Google Scholar ID: whm4LjQAAAAJ

Google

Computer VisionMachine Learning

Citations & Impact

All-time

Citations

3,634

H-index

i10-index

Publications

Co-authors

Contact

Publications

1 items

arXiv.org · 2024

Cited

Resume (English only)

Academic Achievements

- Trained auto encoders with diffusion loss (without GANs) and achieved both better compression and higher quality generation.
- Contributed to Google's flagship AI model, Gemini 2.5 and Veo.
- Trained an LLM with discrete video tokens for zero-shot video generation.
- Fixed a critical flaw in Region Proposal networks to achieve strong mask generation across classes.
- Showed that 10% of ImageNet can be discarded without any loss in evaluation accuracy.

Research Experience

Works in the video pre-training team at Luma AI; contributed to the Gemini 2.5 and Veo projects by adding video data and metrics to the Gemini codebase and training a prototype diffusion transformer that became Veo 1.

Background

A machine learning researcher in the space of video generative models. Deeply interested in compression, real-time, and long context generation. Has been generating videos since 2017. Currently works in the video pre-training team at Luma AI.

Miscellany

Co-authors

0 total

Co-authors: 0 (list not available)