Online Algorithms for Hierarchical Inference in Deep Learning applications at the Edge

Moothedath, Vishnu Narayanan; Champati, Jaya Prakash; Gross, James

dc.contributor.author	Moothedath, Vishnu Narayanan
dc.contributor.author	Champati, Jaya Prakash
dc.contributor.author	Gross, James
dc.date.accessioned	2023-04-10T17:44:01Z
dc.date.available	2023-04-10T17:44:01Z
dc.date.issued	2023-04-03
dc.identifier.uri	https://hdl.handle.net/20.500.12761/1685
dc.description.abstract	We consider a resource-constrained Edge Device (ED) embedded with a small-size ML model (S-ML) for a generic classification application, and an Edge Server (ES) that hosts a large-size ML model (L-ML). Since the inference accuracy of S-ML is lower than that of the L-ML, offloading all the data samples to the ES results in high inference accuracy, but it defeats the purpose of embedding S-ML on the ED and deprives the benefits of reduced latency, bandwidth savings, and energy efficiency of doing local inference. To get the best out of both worlds, i.e., the benefits of doing inference on the ED and the benefits of doing inference on ES, we explore the idea of Hierarchical Inference (HI), wherein S-ML inference is only accepted when it is correct, otherwise the data sample is offloaded for L-ML inference. However, the ideal implementation of HI is infeasible as the correctness of the S-ML inference is not known to the ED. We thus propose an online meta-learning framework to predict the correctness of the S-ML inference. The resulting online learning problem turns out to be a Prediction with Expert Advice (PEA) problem with continuous expert space. We consider the full feedback scenario, where the ED receives feedback on the correctness of the S-ML once it accepts the inference, and the no-local feedback scenario, where the ED does not receive the ground truth for the classification, and propose the HIL-F and HIL-N algorithms and prove a regret bound that is sublinear with the number of data samples.We evaluate and benchmark the performance of the proposed algorithms for image classification applications using four datasets, namely, Imagenette and Imagewoof [18],MNIST [24], and CIFAR- 10 [23].	es
dc.description.sponsorship	EU	es
dc.language.iso	eng	es
dc.publisher	arXiv	es
dc.title	Online Algorithms for Hierarchical Inference in Deep Learning applications at the Edge	es
dc.type	technical report	es
dc.type.hasVersion	AO	es
dc.rights.accessRights	open access	es
dc.monograph.type	technical_report	es
dc.page.total	23	es
dc.relation.projectID	EU Grant Agreement No. 101062011	es
dc.relation.projectName	DIME (Distributed Inference for energy efficient Monitoring at the network Edge)	es
dc.identifier.report	arXiv:2304.00891	es
dc.identifier.report	TR-IMDEA-Networks-2023-2

Files in this item

Name:: Online Algorithms for Hierarchical ...
Size:: 689.2Kb
Format:: PDF

This item appears in the following Collection(s)

IMDEA Networks

Show simple item record