Scholar

Humphrey Shi

Google Scholar ID: WBvt5A8AAAAJ

Georgia Tech | UIUC || ...

𝐇𝐢𝐠𝐡 𝐏𝐞𝐫𝐟𝐨𝐫𝐦𝐚𝐧𝐜𝐞 𝐀𝐈Computer VisionMultimodalCreative AIAI Systems

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

20,175

H-index

i10-index

104

Publications

Co-authors

list available

Contact

Emailshihonghui3@gmail.com CVOpen ↗TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

26 items

WorldBagel: Uncovering the Power of Unified Multimodal Models for Vision-Language-Action-World Modeling

2026

Cited

SOLAR: AI-Powered Speed-of-Light Performance Analysis

2026

Cited

AVO: Agentic Variation Operators for Autonomous Evolutionary Search

2026

Cited

Le-DETR: Revisiting Real-Time Detection Transformer with Efficient Encoder Design

2026

Cited

DuoGen: Towards General Purpose Interleaved Multimodal Generation

2026

Cited

VibeTensor: System Software for Deep Learning, Fully Generated by AI Agents

2026

Cited

SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning

2025

Cited

PAI-Bench: A Comprehensive Benchmark For Physical AI

2025

Cited

Resume (English only)

Academic Achievements

Published papers across CVPR, NeurIPS, ICLR, ICCV, ECCV, TPAMI (including highly cited works); won a dozen international AI competitions (ImageNet, NVIDIA AI City Challenge, NIST TRECVID, DAC-System Design Contest, etc.); open-source impact: NATTEN, StreamingT2V, Text2Video-Zero, Versatile Diffusion, OneFormer—widely adopted across academia and industry; frequently GitHub trending; models downloaded millions of times; Hugging Face: #1 most-liked US university lab (Nov 2023); received National Science Foundation CAREER Award; selected among 100 outstanding early-career engineers in the US by the National Academy of Engineering Frontiers of Engineering; industry recognition: Apple App Store “Trend of the Year: Generative AI” (2023); TIPA Best Consumer AI App (2023); IBM Research Accomplishment Award (2018).

Research Experience

Served as Chief Scientist at Picsart (2021-2025), where he built a global AI team from the ground up across research, engineering, and product, and delivered AI tools used by 150M+ monthly users; also worked as Research Staff Member at IBM T. J. Watson Research Center, and professor at Oregon & UIUC.

Education

No detailed educational background information provided.

Background

A professor at Georgia Tech, and an engineer-researcher working across high-performance AI, multimodal AI, and computer vision. His mission is to build the next generation high-performance, multimodal, and creative AI systems that empower intelligence, creativity, and humanity.

Miscellany

Leadership & mentoring: program chair, CVPR 2027; teaches Computer Vision (UG/Grad); advised students now at NVIDIA, Google, OpenAI, Meta, Apple, Amazon, Tesla, etc.; for future collaborators & investors: building high-performance multimodal & agentic AI—from kernels and algorithms to deployed systems and products; email/DM welcome.

Co-authors

94 total