Distributed DNN inference at the edge

Open Access
Authors
Supervisors
Award date 05-02-2025
ISBN
  • 9789493391895
Series ASCI dissertation series, 463
Number of pages 142
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
As deep neural networks (DNNs) grow increasingly complex, their computational demands often surpass the capacity of edge devices, which typically possess limited resources. This thesis explores strategies for efficient and robust deployment of large DNNs on resource-constrained edge environments, where "edge" refers to devices located along the data path between sources and the cloud. Deploying DNNs at the edge offers advantages such as enhanced privacy, efficiency, and reliability but presents challenges due to the constrained resources at the Edge.
The thesis is divided into two parts. The first part addresses the challenge of optimal partitioning and deployment of DNNs over multiple resource-constrained edge devices. The AutoDiCE framework automates model partitioning, code generation, and communication optimization across devices, while a Design Space Exploration (DSE) technique identifies optimal distribution strategies to minimize energy and memory usage while maximizing system inference throughput.
The second part focuses on enhancing system robustness against potential device failures or connectivity issues. RobustDiCE ensures distributed inference accuracy by prioritizing critical neurons and partially replicating them across devices, maintaining functionality even in failure scenarios. Additionally, EASTER, a similar partitioning method for large language models, balances resource utilization and robustness.
Overall, this thesis presents innovative solutions for efficient and fault-tolerant DNN deployment at the edge, optimizing resource utilization and ensuring reliable operation. The proposed methods advance the adoption of distributed edge AI in resource-constrained environments.
Document type PhD thesis
Language English
Downloads
Permalink to this page
cover
Back