Scholar

Stefan Stojanov

Google Scholar ID: XC_WricAAAAJ

Postdoc at Stanford Vision Lab and Neuro AI Lab

Computer VisionMachine Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

544

H-index

i10-index

Publications

Co-authors

list available

Contact

CVOpen ↗TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

9 items

$Δ$ynamics: Language-Based Representation for Inferring Rigid-Body Dynamics From Videos

2026

Cited

Autoregressive Flow Matching for Motion Prediction

2025

Cited

World Modeling with Probabilistic Structure Integration

2025

Cited

Weakly-Supervised Learning of Dense Functional Correspondences

2025

Cited

Discovering and using Spelke segments

2025

Cited

Taming generative video models for zero-shot optical flow extraction

2025

Cited

MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models

2025

Cited

Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals

2025

Cited

Resume (English only)

Academic Achievements

Published multiple papers including 'Taming Generative Video Models for Zero-shot Optical Flow Extraction', 'Discovering and Using Spelke Segments', 'Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals', and more. Participated in projects like the BabyView Dataset and ZeroShape: Regression-based Zero-shot Shape Reconstruction.

Research Experience

Currently an applied scientist at Amazon, focusing on video understanding. Previously worked in the Stanford Vision and Learning Lab and the Stanford NeuroAILab, developing video foundation models and fine-grained object representations.

Education

Received a PhD from the Georgia Institute of Technology, advised by James Rehg; Postdoctoral researcher at Stanford University, advised by Jiajun Wu and Dan Yamins.

Background

Research interests include computer vision and machine learning, particularly 3D vision, self-supervision, data synthesis, and video-based learning. Aims to develop systems capable of efficiently learning rich, granular representations of the physical world.

Co-authors

10 total