DNN Acceleration
DNN acceleration focuses on optimizing deep neural network performance and energy efficiency, primarily addressing the computational demands of increasingly complex models. Current research emphasizes techniques like model pruning (including novel unaligned block-wise methods), hardware-software co-design for specialized architectures (e.g., RISC-V based SoCs and optical computing), and algorithmic innovations such as event-driven processing. These advancements are crucial for deploying DNNs on resource-constrained devices (mobile, IoT) and high-performance computing platforms, impacting diverse fields from mobile AI to large-scale data centers.
Papers
July 30, 2024
July 29, 2024
November 15, 2023
May 15, 2023
April 20, 2022
December 4, 2021