Scholar
Armin Mustafa
Google Scholar ID: 0xOHqkMAAAAJ
Associate Professor in Computer Vision and AI, CVSSP, University of Surrey
Computer Vision
AI
3D/4D Vision
Scene Understanding
Machine Perception
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
998
H-index
14
i10-index
21
Publications
20
Co-authors
12
list available
Contact
Email
a.mustafa@surrey.ac.uk
CV
Open ↗
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
5 items
Catalogue Grounded Multimodal Attribution for Museum Video under Resource and Regulatory Constraints
2026
Cited
0
SSLAM: Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes
International Conference on Learning Representations · 2025
Cited
0
PAL: Probing Audio Encoders via LLMs -- A Study of Information Transfer from Audio Encoders to LLMs
2025
Cited
0
Joint Reconstruction of Spatially-Coherent and Realistic Clothed Humans and Objects from a Single Image
2025
Cited
0
Deconstruct Complexity (DeComplex): A Novel Perspective on Tackling Dense Action Detection
2025
Cited
0
Resume (English only)
Academic Achievements
Published multiple high-impact papers in computer vision and AI, including:
- 'CAD - Contextual Multi-modal Alignment for Dynamic AVQA', WACV 2024
- 'DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification', AAAI 2024
- 'UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer', ICCVW 2023
- 'PAT: Position-Aware Transformer for Dense Multi-Label Action Detection', ICCVW 2023
- 'SEM-POS: Grammatically and Semantically Correct Video Captioning', CVPRW 2023
- 'KPE: Keypoint Pose Encoding for Transformer-based Image Generation', BMVC 2022
- '4D Temporally Coherent Multi-Person Semantic Reconstruction and Segmentation', IJCV 2022
- 'Multi-person Implicit Reconstruction from a Single Image', CVPR 2021
Research Experience
Currently Lecturer in Computer Vision and AI at CVSSP, University of Surrey
Worked at Samsung Research Institute, Bangalore, India (2010–2013) in Computer Vision for 3 years
Led or participated in key research projects including:
- CoSTAR National Lab for R&D in Creative Technology (2023–2033, AHRC funded)
- AI4ME: AI for Personalised Media Experiences (2021–2026, Prosperity Partnership with BBC R&D)
- 4D Vision for Perceptive Machines (2018–2023, RAEng funded)
- ALIVE: Live Action Light Fields for Immersive VR Experiences (2016–2017, Innovate UK)
- IMPART: Intelligent Management Platform for Advanced Real-Time media processes (2013–2015, EU FP7)
Co-authors
12 total
Adrian Hilton
Professor of Computer Vision & Graphics, University of Surrey
Hansung Kim
Associate Professor, University of Southampton
Jean-Yves Guillemaut
Senior Lecturer in 3D Computer Vision, University of Surrey, UK
Philip J.B. Jackson
Professor, CVSSP, Univ. of Surrey, UK
Marco Volino
Lecturer in Computer Vision and Graphics at University of Surrey
Andrew Gilbert
University of Surrey
Faegheh Sardari
Senior Scientist at Microsoft
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up