Scholar
Kaixiang Ji
Google Scholar ID: PNTIf4gAAAAJ
Ant Group
Computer Vision
Multimodal
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
139
H-index
6
i10-index
5
Publications
17
Co-authors
10
list available
Contact
No contact links provided.
Publications
9 items
Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation
2025
Cited
0
ARGenSeg: Image Segmentation with Autoregressive Image Generation Model
2025
Cited
0
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer
2025
Cited
0
GUI-Shepherd: Reliable Process Reward and Verification for Long-Sequence GUI Tasks
2025
Cited
0
HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and Generation
2025
Cited
0
M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning
2025
Cited
0
Ming-Omni: A Unified Multimodal Model for Perception and Generation
2025
Cited
0
Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction
2025
Cited
0
Load more
Resume (English only)
Co-authors
10 total
Jian Wang
Senior Staff Algorithm Engineer, Ant Group
Weixiang Hong
National University of Singapore
Ziyuan Huang
Ant Group
Co-author 4
Jingdong Chen
Senior Staff Algorithm Engineer, Ant Group
Co-author 6
Xin Guo
Staff Research Scientist, SAIS
Jiajia Liu
Ant Group
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up