Scholar

Liang-Chieh Chen

Google Scholar ID: ACjYGPUAAAAJ

Senior Principal Scientist, Amazon FAR, LA

Computer VisionMachine Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

115,073

H-index

i10-index

Publications

Co-authors

129

list available

Contact

GitHubOpen ↗

Publications

15 items

Taming Outlier Tokens in Diffusion Transformers

2026

Cited

Large Language Models are Universal Reasoners for Visual Generation

2026

Cited

Frequency-Aware Flow Matching for High-Quality Image Generation

2026

Cited

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

2026

Cited

Autoregressive Image Generation with Masked Bit Modeling

2026

Cited

Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers

2025

Cited

ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction

2025

Cited

Deeply Supervised Flow-Based Generative Models

2025

Cited

Resume (English only)

Academic Achievements

Led or contributed to influential DeepLab derivatives: Auto-DeepLab, Panoptic-DeepLab (winner of ICCV 2019 Mapillary Vistas Panoptic Segmentation track and top performer on Cityscapes), Axial-DeepLab, ViP-DeepLab, MaX-DeepLab, kMaX-DeepLab.
Co-developed MobileNetv2 and MobileNetv3, now standard architectures for efficient mobile vision models.
Multiple papers accepted at top venues including ICCV 2025 and NeurIPS 2025, covering diffusion transformers, physics-based video generation, panoptic captioning, flow-based generative models, etc.
Served as Area Chair for ICCV (2019, 2025), CVPR (2020, 2023–2026), ECCV (2020, 2024), NeurIPS (2022, 2024–2025), ICML (2025).
Action Editor for Transactions on Machine Learning Research (TMLR).
Received Outstanding Reviewer awards at CVPR 2021 and 2018, and Top Reviewer at NeurIPS 2019.

Background

Currently a Research Scientist at Apple AI/ML, building cutting-edge visual generative models.
Widely recognized for co-developing the DeepLab series (DeepLabv1 to DeepLabv3+) with George Papandreou.
Introduced atrous (dilated) convolution in December 2014, now a foundational technique for dense prediction tasks.
Known for collaborative work on MobileNetv2 and MobileNetv3, which have become standards for efficient mobile neural networks.
His team focuses on fundamental research in computer vision, visual generation, and representation learning.

Co-authors

129 total