Peng Tang
Scholar

Peng Tang

Google Scholar ID: h_oYR-IAAAAJ
Meta
Multi-modal LLMVision LanguageComputer Vision
Citations & Impact
All-time
Citations
2,901
 
H-index
20
 
i10-index
22
 
Publications
20
 
Co-authors
0
 
Resume (English only)
Academic Achievements
  • Two papers for GUI/web agents are accepted to ACL 2025 Findings.
  • One paper for transformer decoder inference acceleration is accepted to NAACL 2024 Findings.
  • One paper for text-based VQA is accepted to NAACL 2024 Industry Track.
  • One paper for reasoning-based chart VQA is accepted to CVPR 2024.
  • One paper for document understanding and one paper for VL model knowledge distillation are accepted to AAAI 2024.
Research Experience
  • Currently a Research Scientist at Meta.
  • Worked as an intern in the Internet Media group at Microsoft Research Asia, advised by Chunyu Wang and Jingdong Wang.
  • Worked as a visiting student in the CCVL group at Johns Hopkins University, advised by Prof. Alan Yuille.
  • Worked as an intern at Tencent AI Lab, advised by Lin Ma and Zequn Jie.
  • Worked as an Applied Scientist at Amazon AWS AI Labs and Salesforce Research.
Background
  • Research interests: Computer Vision and Multi-modal LLM.
Co-authors
0 total
Co-authors: 0 (list not available)