Sudharsan, Bharath ORCID: 0000-0001-5906-113X, Patel, Pankesh ORCID: 0000-0001-5973-4197, Breslin, John G. ORCID: 0000-0001-5790-050X and Ali, Muhammad Intizar ORCID: 0000-0002-0674-2131 (2021) Enabling machine learning on the edge using SRAM conserving efficient neural networks execution approach. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases ECML PKDD 2021, 13-17 Sept 2021, Bilbao, Spain. ISBN 978-3-030-86516-0
Abstract
Edge analytics refers to the application of data analytics and Machine Learning (ML) algorithms on IoT devices. The concept of edge analytics is gaining popularity due to its ability to perform AI-based analytics at the device level, enabling autonomous decision-making, without depending on the cloud. However, the majority of Internet of Things (IoT) devices are embedded systems with a low-cost microcontroller unit (MCU) or a small CPU as its brain, which often are incapable of handling complex ML algorithms. In this paper, we propose an approach for the efficient execution of already deeply compressed, large neural networks (NNs) on tiny IoT devices. After optimizing NNs using state-of-the-art deep model compression methods, when the resultant models are executed by MCUs or small CPUs using the model execution sequence produced by our approach, higher levels of conserved SRAM can be achieved. During the evaluation for nine popular models, when comparing the default NN execution sequence with the sequence produced by our approach, we found that 1.61-38.06% less SRAM was used to produce inference results, the inference time was reduced by 0.28-4.9 ms, and energy consumption was reduced by 4-84 mJ. Despite achieving such high conserved levels of SRAM, our method 100% preserved the accuracy, F1 score, etc. (model performance).
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Uncontrolled Keywords: | Edge AI; Resource-Constrained Devices; Intelligent Microcontrollers; SRAM Conservation; Offline Inference. |
Subjects: | Computer Science > Artificial intelligence Computer Science > Machine learning Engineering > Electronics Engineering > Electronic engineering |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering |
Published in: | Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. ECML PKDD 2021. Lecture Notes in Computer Science 12979. Springer. ISBN 978-3-030-86516-0 |
Publisher: | Springer |
Official URL: | https://doi.org/10.1007/978-3-030-86517-7_2 |
Copyright Information: | © 2021 Springer |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
ID Code: | 26953 |
Deposited On: | 01 Apr 2022 12:19 by Muhammad Intizar Ali . Last Modified 18 Jan 2023 12:30 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB |
Metrics
Altmetric Badge
Dimensions Badge
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record