Early Exiting with Compressible Activations for Efficient Neural Network Inference
dc.contributor.author | Beysens, Jona | |
dc.contributor.author | Sénéclauze, Martin | |
dc.contributor.author | Dallemagne, Philippe | |
dc.contributor.author | Bigdeli, Siavash | |
dc.date.accessioned | 2023-03-31T07:41:36Z | |
dc.date.available | 2023-03-31T07:41:36Z | |
dc.date.issued | 2023 | |
dc.description.abstract | Cloud-based Machine Learning (ML) incurs high latency and high sensitivity to connectivity failures. This can be improved by maximizing the ML processing on the local device. However, since these low-power devices typically have limited resources, the early-exit mechanism has been used to hierarchically split a neural network in parts over multiple devices, trading off computation with communication costs. But, the intermediate activations can be significantly large at the split point. In this paper, we present a novel entropy-based technique that learns to intelligently compress activations during training and efficiently encodes them during inference. We show that in an early-exit configuration, entropy regularization with Huffman coding can save up to 54% in communication cost, while keeping the classification accuracy of MobileNetV2 on CIFAR-10 above 85%. | en_US |
dc.description.sponsorship | Internal research funding by DATAPROG (RES3) | en_US |
dc.identifier.citation | IEEE, Smart Systems Integration, Bruges, Belgium | en_US |
dc.identifier.uri | https://hdl.handle.net/20.500.12839/1173 | |
dc.language.iso | en | en_US |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Hierarchical machine learning | en_US |
dc.subject | Activation compression | en_US |
dc.subject | Entropy regularization | en_US |
dc.subject | Early exiting | en_US |
dc.title | Early Exiting with Compressible Activations for Efficient Neural Network Inference | en_US |
dc.type | Conference | en_US |
dc.type.csemdivisions | Div-M | en_US |
dc.type.csemresearchareas | Data & AI | en_US |
dc.type.csemresearchareas | IoT & Vision | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 20230213_SSI2023_manuscript_submission_preprint_footer.pdf
- Size:
- 480.02 KB
- Format:
- Adobe Portable Document Format
- Description:
- Main article
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 2.82 KB
- Format:
- Item-specific license agreed upon to submission
- Description: