Scholar
Kun Ding
Google Scholar ID: kbwv2tkAAAAJ
CASIA
CV
Multimodal
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
508
H-index
11
i10-index
13
Publications
20
Co-authors
9
list available
Contact
No contact links provided.
Publications
19 items
WikiSeeker: Rethinking the Role of Vision-Language Models in Knowledge-Based Visual Question Answering
2026
Cited
0
Beyond Sequential Distance: Inter-Modal Distance Invariant Position Encoding
2026
Cited
0
SeaVIS: Sound-Enhanced Association for Online Audio-Visual Instance Segmentation
2026
Cited
0
CC-VQA: Conflict- and Correlation-Aware Method for Mitigating Knowledge Conflict in Knowledge-Based Visual Question Answering
2026
Cited
0
InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing
2026
Cited
0
Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions
2026
Cited
0
DSFC-Net: A Dual-Encoder Spatial and Frequency Co-Awareness Network for Rural Road Extraction
2026
Cited
0
PDE-Agent: A toolchain-augmented multi-agent framework for PDE solving
2025
Cited
0
Load more
Resume (English only)
Co-authors
9 total
Shiming Xiang
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
Co-author 2
Bin Fan
University of Science and Technology Beijing, previously at NLPR, CASIA
Gaofeng MENG (孟高峰)
Institute of Automation, Chinese Academy of Sciences
Cheng Da
Alibaba Group
Qi Yang
PhD student, Institute of Automation Chinese Academy of Sciences
Tao Zhang
Phd Candidate, Institute of Automation, Chinese Academy of Sciences
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up