ExecuTorch -- A Unified PyTorch Solution to Run AI Models On-Device

📅 2026-05-05
📈 Citations: 0
Influential: 0
📄 PDF

career value

223K/year
🤖 AI Summary
This work proposes ExecuTorch, the first end-to-end deployment framework natively integrated with the PyTorch ecosystem, addressing the fragmentation commonly encountered in edge AI deployment. By introducing an extensible backend abstraction, quantization-aware optimizations, and a unified model serialization format, ExecuTorch preserves the original model semantics while seamlessly targeting heterogeneous hardware—from microcontrollers to specialized accelerators—without sacrificing low latency or offline execution capabilities. The framework bridges the gap between research and production workflows, enabling consistent development and efficient deployment across a broad spectrum of devices, ranging from wearables to compute clusters, thereby significantly enhancing both deployment efficiency and cross-platform consistency.
📝 Abstract
Local execution of AI on edge devices is important for low latency and offline operation. However, deploying models on diverse hardware remains fragmented, often requiring model conversion or complete reimplementation outside the PyTorch ecosystem where the model was originally authored. We introduce ExecuTorch, a unified PyTorch-native deployment framework for edge AI. ExecuTorch enables seamless deployment of machine learning models across heterogeneous compute environments. It scales from embedded microcontrollers to complex system-on-chips (SoCs) with dedicated accelerators, powering devices ranging from wearables and smartphones to large compute clusters. ExecuTorch preserves PyTorch semantics while allowing customization, support for optimizations like quantization, and pluggable execution "backends". These features together enable fast experimentation, allowing researchers to validate deployment behavior entirely within PyTorch, bridging the gap between research and production.
Problem

Research questions and friction points this paper is trying to address.

edge AI
model deployment
hardware heterogeneity
PyTorch ecosystem
on-device execution
Innovation

Methods, ideas, or system contributions that make the work stand out.

ExecuTorch
on-device AI
PyTorch-native deployment
heterogeneous hardware
pluggable backends
M
Mergen Nachin
Meta
D
Digant Desai
Meta
S
Sicheng Stephen Jia
Meta
C
Chen Lai
Meta
M
Mengwei Liu
Meta
J
Jacob Szwejbka
Meta
R
Raziel Alvarez
Work done while at Meta
R
RJ Ascani
Meta
D
Dave Bort
Meta
M
Manuel Candales
Meta
A
Andrew Caples
Meta
Yanan Cao
Yanan Cao
Institute of Information Engineering, Chinese Academy of Sciences
Z
Zhengxu Chen
Meta
Soumith Chintala
Soumith Chintala
Meta AI
Artificial IntelligenceDeep LearningMachine LearningComputer Vision
G
Gregory Comer
Meta
Tanvir Islam
Tanvir Islam
Okta, Inc.
Machine LearningArtificial IntelligenceCybersecuritySensing
S
Songhao Jia
Meta
T
Tarun Karuturi
Meta
J
Jack Khuu
Meta
A
Abhinay Kukkadapu
Meta
T
Tugsbayasgalan Manlaibaatar
Meta
A
Andrew Or
Meta
K
Kimish Patel
Meta
S
Siddartha Pothapragada
Meta
L
Lucy Qiu
Meta