K
Scholar

Kanzhi Cheng

Google Scholar ID: S2IPVnwAAAAJ
Nanjing University Ph.D Student
Failed to load scholar profile
Citations & Impact
All-time
Citations
0
 
H-index
0
 
i10-index
0
 
Publications
0
 
Co-authors
0
 
Publications
0 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • - Publications:
  • - ACL 2024 (Main): SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents
  • - NeurIPS 2025 (Poster): GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
  • - ACL 2025 (Main): OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
  • - NAACL 2025 (Main): Vision-Language Models Can Self-Improve Reasoning via Reflection
  • - ACL 2025 (Findings): CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era
  • - ACMMM 2023: Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
  • - NLPCC 2022: ADS-Cap: A Framework for Accurate and Diverse Stylized Captioning with Unpaired Stylistic Corpora
  • - WCUA@ICML 2025: ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows
  • - ACL 2025 (Main): Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
  • - ACL 2025 (Main): Interative Evolution: A Neural-symbolic Self-Training Framework for Large Language Models
  • - ICLR 2025 (Spotlight): OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
  • - Preprint: A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond
Research Experience
  • - Research Internships: Shanghai AI Lab, Tsinghua AIR, Microsoft Research
  • - Collaborated with: Dr. Zhiyong Wu, Dr. Hao Zhou, Dr. Qianhui Wu
Education
  • - PhD: Nanjing University, 2021.9 - present, Advisors: Dr. Jiajun Chen & Dr. Jianbing Zhang
Background
  • - Research Interests: Multimodal intelligence, particularly GUI agents, large vision-language models
  • - Professional Field: Natural Language Processing (NLP)
  • - Introduction: PhD student in the NLP Group at Nanjing University, advised by Dr. Jiajun Chen and Dr. Jianbing Zhang. Previously, worked as a research intern at Shanghai AI Lab, Tsinghua AIR, and Microsoft Research.
Miscellany
  • - Personal Interests: Not mentioned
Co-authors
0 total
Co-authors: 0 (list not available)