Early-Exit DNN Inference on HMPSoCs

Open Access
Authors
Publication date 2025
Host editors
  • A. Ding
  • O. Rana
  • F. Awaysheh
  • V. Hilt
  • Y. Simmhan
Book title 2025 IEEE International Conference on Edge Computing & Communications
Book subtitle IEEE EDGE 2025 : Helsinki, Finland, 7-12 July 2025 : proceedings
ISBN
  • 9798331555603
ISBN (electronic)
  • 9798331555597
Event Proc. of the IEEE Int. Conference on Edge Computing and Communications (EDGE 2025)
Pages (from-to) 75-82
Publisher Los Alamitos, California: IEEE Computer Society
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Using Heterogeneous Multi-Processor System-on-Chips (HMPSoCs) for Deep Neural Network (DNN) inference has become commonplace in edge devices. However, reducing the DNN inference latency on resource-constrained edge devices remains a first-class constraint. Early-Exit (EE) DNNs offer variable, input-dependent inference latency, aiming to reduce average inference latency by lowering the latency of simple inputs at the cost of increased latency for complex inputs. However, the overheads introduced by exit branches reduce the potential performance gains with EE DNNs.Furthermore, existing CPU- or GPU-only implementations of EE DNN inference under-utilize the HMPSoC. To address this limitation, we propose a cooperative and parallel CPU-GPU execution approach1 for EE DNN inference that effectively distributes computations across all HMPSoC processors, minimizing latency variations. Our approach allows EE DNNs to achieve reduced average latency on HMPSoCs relative to their static counterparts, significantly reducing average- and worst-case inference latencies and enhancing the speed-up of EE DNN compared to the best single-processor inference. On average, the worst-case inference latency decreased by 24.8% across three commonly used EE DNNs, providing latency comparable to a static model without compromising accuracy on an RK3399PRO HMPSoC.
Document type Conference contribution
Language English
Published at https://doi.org/10.1109/EDGE67623.2025.00017
Other links https://github.com/Saeed-Khalilian/EEDNN_on_HMPSoCs.git https://www.proceedings.com/82143.html
Downloads
Early-Exit_DNN_Inference_on_HMPSoCs (Final published version)
Permalink to this page
Back