Scholar
Federico Cocchi
Google Scholar ID: BRG3e1EAAAAJ
PhD student, University of Modena and Reggio Emilia
Computer Vision
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
346
H-index
7
i10-index
7
Publications
13
Co-authors
10
list available
Contact
Email
federico.cocchi.97@gmail.com
CV
Open ↗
GitHub
Open ↗
Publications
4 items
ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering
2025
Cited
0
RAID: A Dataset for Testing the Adversarial Robustness of AI-Generated Image Detectors
2025
Cited
0
LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning
2025
Cited
0
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
arXiv.org · 2024
Cited
0
Resume (English only)
Academic Achievements
Paper 'Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering' accepted at CVPR 2025
Paper 'The (R)Evolution of Multimodal Large Language Models: A Survey' accepted at ACL Findings 2024
Paper 'Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs' accepted at CVPR Workshop 2024
Published research at top-tier conferences including CVPR, ECCV, ACL, ICCV Workshop, and ICPR
Proposed ReflectiVA model that integrates external knowledge via reflective tokens for improved knowledge-based VQA
Developed Wiki-LLaVA with hierarchical retrieval pipeline to augment MLLMs with external knowledge without compromising standard benchmark performance
Background
Final-year PhD student in the Italian National PhD Program in Artificial Intelligence
Research focuses on Multimodal LLMs, especially at the intersection of vision and language
Aims to enhance reasoning and understanding capabilities of models
Explores post-training techniques to enrich models with retrieval and reranking using multimodal data
Uses HPC systems for multi-GPU/multi-node foundation model training through collaboration with CINECA
Research interests include Generative AI, Computer Vision, and Natural Language Processing
Co-authors
10 total
Rita Cucchiara
Università degli Studi di Modena e Reggio Emilia, Italia
Marcella Cornia
Associate Professor, University of Modena and Reggio Emilia
Lorenzo Baraldi
Associate Professor, University of Modena e Reggio Emilia
Davide Caffagni
PhD student, AImageLab
Nicholas Moratelli
Applied Scientist Intern @ Amazon | Phd Student @ University of Modena and Reggio Emilia
Sara Sarto
University of Modena and Reggio Emilia
Samuele Poppi
Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), previously GenAI at Meta
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up