Adaptive AI Task Partitioning and Offloading in Heterogeneous Edge-Cloud Networks
Published in , 2026
This master’s thesis presents REAP, a runtime energy-aware framework for dynamically partitioning and offloading deep neural network inference across heterogeneous edge, fog, and cloud devices. The framework profiles computation and network conditions, evaluates possible model split points, and periodically adapts the execution strategy based on latency, communication overhead, and energy consumption. Evaluated on a physical three-node testbed using VGG-16, AlexNet, and MobileNetV2, REAP reduced total energy consumption by up to 35.82% and end-to-end latency by up to 22.92% compared with static partitioning.
Recommended citation: A. A. Deng and E. Butkus, “Adaptive AI task partitioning and offloading in heterogeneous edge-cloud networks: REAP—Runtime Energy-Aware Adaptive Partitioning and Offloading Framework,” M.Sc. thesis, Dept. Comput. Syst. Sci., Stockholm Univ., Stockholm, Sweden, 2026.
Download Paper
