Paper ID: 2405.07748

Neural Network Compression for Reinforcement Learning Tasks

Dmitry A. Ivanov, Denis A. Larionov, Oleg V. Maslennikov, Vladimir V. Voevodin

In real applications of Reinforcement Learning (RL), such as robotics, low latency and energy efficient inference is very desired. The use of sparsity and pruning for optimizing Neural Network inference, and particularly to improve energy and latency efficiency, is a standard technique. In this work, we perform a systematic investigation of applying these optimization techniques for different RL algorithms in different RL environments, yielding up to a 400-fold reduction in the size of neural networks.

Submitted: May 13, 2024

Topics

Reinforcement Learning
Reinforcement Learning Algorithm
Sparsity Increase
Efficient Inference
Neural Network Compression
Neural Network Inference
Reinforcement Learning Task

Links

arXiv PDF