TY - JOUR
T1 - An Integrated Cloud-Edge-Device Adaptive Deep Learning Service for Cross-Platform Web
AU - Huang, Yakun
AU - Qiao, Xiuquan
AU - Tang, Jian
AU - Ren, Pei
AU - Liu, Ling
AU - Pu, Calton
AU - Chen, Junliang
N1 - Funding Information:
This work was supported in part by the National Key R&D Program of China under Grant 2018YFE0205503, in part by the Funds for International Cooperation and Exchange of NSFC under Grant 61720106007, and in part by the 111 Project under Grant B18008.
Publisher Copyright:
© 2002-2012 IEEE.
PY - 2023/4/1
Y1 - 2023/4/1
N2 - Deep learning shows great promise in providing more intelligence to the cross-platform web. However, insufficient infrastructure, heavy models, and intensive computation limit the use of deep learning with low-performing web browsers. We propose DeepAdapter, an integrated cloud-edge-device framework that ties the edge, the remote cloud, with the device by cross-platform web technology for adaptive deep learning services towards lower latency, lower mobile energy, and higher system throughput. DeepAdapter consists of context-aware pruning, service updating, and online scheduling. First, the offline pruning module provides a context-aware pruning algorithm that incorporates the latency, the network condition, and the device's computing capability to fit various contexts. Second, the service updating module optimizes branch model cache on the edge for massive mobile users and updates the new model pruning requirements. Third, the online scheduling module matches optimal branch models for mobile users. Also, a two-stage DRL-based online scheduling method named DeepScheduler can handle high concurrent requests between edge centers and remote cloud by designing the reward prediction model. Extensive experiments show that DeepAdapter can decrease average latency by 1.33x, reduce average mobile energy by 1.4x, and improve system throughput by 2.1x with considerable accuracy.
AB - Deep learning shows great promise in providing more intelligence to the cross-platform web. However, insufficient infrastructure, heavy models, and intensive computation limit the use of deep learning with low-performing web browsers. We propose DeepAdapter, an integrated cloud-edge-device framework that ties the edge, the remote cloud, with the device by cross-platform web technology for adaptive deep learning services towards lower latency, lower mobile energy, and higher system throughput. DeepAdapter consists of context-aware pruning, service updating, and online scheduling. First, the offline pruning module provides a context-aware pruning algorithm that incorporates the latency, the network condition, and the device's computing capability to fit various contexts. Second, the service updating module optimizes branch model cache on the edge for massive mobile users and updates the new model pruning requirements. Third, the online scheduling module matches optimal branch models for mobile users. Also, a two-stage DRL-based online scheduling method named DeepScheduler can handle high concurrent requests between edge centers and remote cloud by designing the reward prediction model. Extensive experiments show that DeepAdapter can decrease average latency by 1.33x, reduce average mobile energy by 1.4x, and improve system throughput by 2.1x with considerable accuracy.
KW - DNN compression
KW - Mobile computing
KW - adaptive service
KW - cross-platform web
KW - edge computing
UR - http://www.scopus.com/inward/record.url?scp=85118528034&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85118528034&partnerID=8YFLogxK
U2 - 10.1109/TMC.2021.3122279
DO - 10.1109/TMC.2021.3122279
M3 - Article
AN - SCOPUS:85118528034
SN - 1536-1233
VL - 22
SP - 1950
EP - 1967
JO - IEEE Transactions on Mobile Computing
JF - IEEE Transactions on Mobile Computing
IS - 4
ER -