Responsive DNN Adaptation for Video Analytics against Environment Shift via Hierarchical Mobile-Cloud Collaborations

๐Ÿ“… 2025-04-30
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
To address the sluggish adaptation of expert DNN models and degraded cloud-centric performance under dynamic environmental changes in mobile video analytics, this paper proposes MOCHA, a mobile-cloud hierarchical collaborative framework. Methodologically, MOCHA introduces a novel co-design of lightweight on-device model reuse and cloud-side semantic indexing retrieval; establishes a foundation-model-driven taxonomy for expert models; and devises an active prefetching caching strategy. It integrates on-device fast fine-tuning, cloud-side semantic analysis, structured model indexing, and demand-driven prefetching. Evaluated across three DNN tasks, MOCHA improves adaptation-phase accuracy by up to 6.8%, reduces response latency to 1/35.5 of the baseline, and cuts retraining time by 3.0ร—โ€”significantly enhancing real-time adaptability in dynamic environments.

Technology Category

Application Category

๐Ÿ“ Abstract
Mobile video analysis systems often encounter various deploying environments, where environment shifts present greater demands for responsiveness in adaptations of deployed"expert DNN models". Existing model adaptation frameworks primarily operate in a cloud-centric way, exhibiting degraded performance during adaptation and delayed reactions to environment shifts. Instead, this paper proposes MOCHA, a novel framework optimizing the responsiveness of continuous model adaptation through hierarchical collaborations between mobile and cloud resources. Specifically, MOCHA (1) reduces adaptation response delays by performing on-device model reuse and fast fine-tuning before requesting cloud model retrieval and end-to-end retraining; (2) accelerates history expert model retrieval by organizing them into a structured taxonomy utilizing domain semantics analyzed by a cloud foundation model as indices; (3) enables efficient local model reuse by maintaining onboard expert model caches for frequent scenes, which proactively prefetch model weights from the cloud model database. Extensive evaluations with real-world videos on three DNN tasks show MOCHA improves the model accuracy during adaptation by up to 6.8% while saving the response delay and retraining time by up to 35.5x and 3.0x respectively.
Problem

Research questions and friction points this paper is trying to address.

Optimizes DNN adaptation responsiveness via mobile-cloud collaboration
Reduces delays with on-device reuse and fast fine-tuning
Accelerates model retrieval using domain semantics taxonomy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical mobile-cloud collaboration for DNN adaptation
On-device model reuse and fast fine-tuning
Structured taxonomy for expert model retrieval
๐Ÿ”Ž Similar Papers
No similar papers found.
M
Maozhe Zhao
Shanghai Jiao Tong University, Shanghai, China
Shengzhong Liu
Shengzhong Liu
Shanghai Jiao Tong University
F
Fan Wu
Shanghai Jiao Tong University, Shanghai, China
Guihai Chen
Guihai Chen
Professor of Computer Science
Computer Science and Technology