- Open Source Projects: MinerU2.5, PDF-Extract-Kit, MinerU
- Project Achievements: MinerU has over 40k stars on GitHub, MinerU 2.5 version released, multiple papers accepted by top conferences
Research Experience
- Research Scientist at Shanghai AI Laboratory (current position)
- Data Algorithm Research at SenseTime Group Inc. (2020-2022)
Education
- Ph.D.: University of Chinese Academy of Sciences (2020)
- Joint Ph.D. Training: University of Central Florida (2018-2019), supervised by Professors Yongdong Zhang and Guo-Jun Qi
Background
- Research Interests: Intelligent Document Understanding, Multimodal Large Language Models, Data-Centric AI
- Position: Research Scientist at Shanghai AI Laboratory
- Advisor: Dr. Conghui He
- Project Lead: MinerU project, an open-source toolkit for high-quality document parsing with over 40k stars on GitHub
Miscellany
- Recruitment: Recruiting students with strong interest in research, including algorithm interns, young researchers, and doctoral students for joint training