Zitat
T. Fei, Efficient Software Implementation of Deep Learning Architectures on Embedded Systems. 2025 [Online]. Available: https://stup.ferit.hr/2025/03/10/gp-efficient-software-implementation-of-deep-learning-architectures-on-embedded-systems/?utm_source=rss&utm_medium=rss&utm_campaign=gp-efficient-software-implementation-of-deep-learning-architectures-on-embedded-systems
Abstract
This scientific talk discusses methods for optimising deep learning architectures on embedded systems. It highlights key challenges, such as limited processing power, memory constraints, and real-time performance requirements. Model compression techniques, including quantisation, pruning, knowledge distillation, and weight sharing, are explored to reduce memory usage and computational complexity. Hardware-software co-design is emphasised, leveraging specialised accelerators like NPUs, GPUs, and FPGAs to improve efficiency. Additionally, software optimisation techniques, demonstrated through a radar-based hand gesture recognition project, showcase how deep learning can be effectively deployed on edge devices while balancing accuracy, performance, and resource constraints.