Scholar

Yunsheng Li

Google Scholar ID: hJrIyCwAAAAJ

Microsoft

computer vision

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

3,907

H-index

i10-index

Publications

Co-authors

list available

Contact

CVOpen ↗GitHubOpen ↗

Publications

8 items

RubricRL: Simple Generalizable Rewards for Text-to-Image Generation

2025

Cited

Improving Code Localization with Repository Memory

2025

Cited

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

2025

Cited

Show and Segment: Universal Medical Image Segmentation via In-Context Learning

2025

Cited

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

2025

Cited

Benchmarking Large and Small MLLMs

2025

Cited

Olympus: A Universal Task Router for Computer Vision Tasks

arXiv.org · 2024

Cited

SCHEME: Scalable Channel Mixer for Vision Transformers

2023

Cited

Resume (English only)

Academic Achievements

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge
SCHEME: Scalable Channel Mixer for Vision Transformers
Fully Authentic Visual Question Answering Dataset from Online Communities
Dense Network Expansion for Class Incremental Learning
Should All Proposals Be Treated Equally in Object Detection?
MicroNet: Towards Image Recognition with Extremely Low FLOPs
Dynamic Transfer for Multi-Source Domain Adaptation
Revisiting Dynamic Convolution via Matrix Decomposition
Explainable Object-Induced Action Decision for Autonomous Vehicles
Bidirectional Learning for Domain Adaptation of Semantic Segmentation
Efficient Multi-Domain Learning by Covariance Normalization
Deep Scene Image Classification with the MFAFVNet
Deep Hashing with Hash-Consistent Large Margin Proxy Embeddings
Semantic Fisher Scores for Task Transfer: Using Objects to Classify Scenes

Education

2015-2021: Ph.D. student at the University of California, San Diego, focusing on overcoming resource-constrained computer vision topics such as efficient neural network architecture design and domain adaptation.

Background

Yunsheng Li is a Senior Researcher at Microsoft Azure GenAI Group. He is working on the development of multi-modality large language models. His research interests include computer vision (segmentation, domain adaptation), deep learning (network architecture design), and multi-modality large language models. His representative works include phi-3-vision, MicroNet, and BDL.

Co-authors

19 total

Nuno Vasconcelos

Professor of Electrical and Computer Engineering, University of California San Diego

Dongdong Chen

Principal Research Manager, GenAI, Microsoft

Mengchen Liu