Scholar
Aishwarya Agrawal
Google Scholar ID: znH6xJ8AAAAJ
University of Montreal, Mila, Google DeepMind
Artificial Intelligence
Multimodal Vision-Language
Computer Vision
NLP
Deep Learning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
8,237
H-index
15
i10-index
23
Publications
20
Co-authors
0
Contact
No contact links provided.
Publications
16 items
How and What to Imagine? Visual Thinking in Unified Multimodal Models for Cross-View Spatial Reasoning
2026
Cited
0
RiT: Vanilla Diffusion Transformers Suffice in Representation Space
2026
Cited
0
From Where Things Are to What They Are For: Benchmarking Spatial-Functional Intelligence in Multimodal LLMs
2026
Cited
0
Discovering Failure Modes in Vision-Language Models using RL
2026
Cited
0
Communicating about Space: Language-Mediated Spatial Integration Across Partial Views
2026
Cited
0
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
2025
Cited
0
Controlling Multimodal LLMs via Reward-guided Decoding
2025
Cited
0
The Promise of RL for Autoregressive Image Editing
2025
Cited
0
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up