Scholar
Ritesh Thawkar
Google Scholar ID: 9-2AnjQAAAAJ
Master of Science in Computer Vision, MBZUAI
Computer Vision
LLMs/VLMs
LLM Agents
RAG
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
101
H-index
3
i10-index
2
Publications
6
Co-authors
0
Contact
No contact links provided.
Publications
8 items
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device
2026
Cited
0
EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards
2025
Cited
0
How Good are Foundation Models in Step-by-Step Embodied Reasoning?
2025
Cited
0
Beyond Simple Edits: Composed Video Retrieval with Dense Modifications
2025
Cited
0
Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs
2025
Cited
0
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding
2025
Cited
0
Time Travel: A Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts
2025
Cited
0
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
2025
Cited
0
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up