ReLU Layer

ReLU (Rectified Linear Unit) layers are a fundamental component of many neural networks, primarily used for introducing non-linearity and enabling the learning of complex patterns. Current research focuses on understanding ReLU's theoretical properties, including its impact on network injectivity, expressivity, and approximation capabilities, often within the context of specific architectures like convolutional neural networks and transformers. This research aims to improve training efficiency, enhance model interpretability, and address challenges such as "dying ReLU" and computational cost in various applications, from image classification to reinforcement learning. The findings contribute to a deeper understanding of neural network behavior and inform the design of more efficient and effective deep learning models.

Papers

May 30, 2024

May 7, 2024

A Significantly Better Class of Activation Functions Than ReLU Like Activation Functions
Mathew Mithra Noel, Yug Oswal
Activation Function True Class ReLU Layer

March 12, 2024

xMLP: Revolutionizing Private Inference with Exclusive Square Activation
Jiajie Li, Jinjun Xiong
Homomorphic Encryption ReLU Layer Rectified Linear Unit Cryptographic Primitive Exclusive Square Activation

February 5, 2024

Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts
Anastasis Kratsios, Haitz Sáez de Ocáriz Borde, Takashi Furuya, Marc T. Law
Deep Learning Model Expert Knowledge Mixture of Expert ReLU Layer Sparse Activation Traditional Deep Learning Approximation Rate

January 26, 2024

Expressive Power of ReLU and Step Networks under Floating-Point Operations
Yeachan Park, Geonho Hwang, Wonyeol Lee, Sejun Park
Neural Network ReLU Layer Expressive Power P Bit Floating Point Finite Data

January 4, 2024

Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Feature Model
Hien Dang, Tho Tran, Tan Nguyen, Nhat Ho
Deep Neural Network Cross Entropy Loss Neural Collapse Cross Entropy ReLU Layer Imbalanced Learning

November 18, 2023

Polynomial-Time Solutions for ReLU Network Training: A Complexity Classification via Max-Cut and Zonotopes
Yifei Wang, Mert Pilanci
Polynomial Time ReLU Network ReLU Layer Two Layer ReLU Polynomial Time Approximation Polynomial Zonotopes Decay Regularization Difficulty Score

November 7, 2023

Expressivity of ReLU-Networks under Convex Relaxations
Maximilian Baader, Mark Niklas Müller, Yuhao Mao, Martin Vechev
ReLU Layer Convex Relaxation Behavior Expressivity Style Deep ReLU Network Neuron Relaxation

October 6, 2023

ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models
Iman Mirzadeh, Keivan Alizadeh, Sachin Mehta, Carlo C Del Mundo, Oncel Tuzel, Golnoosh Samei, Mohammad Rastegari, Mehrdad Farajtabar
LLM Inference ReLU Layer ReLU Activation Activation Sparsity Inference Memory Usage

October 5, 2023

The Geometric Structure of Fully-Connected ReLU Layers
Jonatan Vallin, Karl Larsson, Mats G. Larson
Convolutional Neural Network ReLU Network ReLU Layer Geometric Structure Geometric Complexity

October 3, 2023

The Inhibitor: ReLU and Addition-Based Attention for Efficient Transformers
Rickard Brännvall
Efficient Transformer ReLU Layer Dot Product Attention Additive Attention Standard Transformer

September 15, 2023

Replacing softmax with ReLU in Vision Transformers
Mitchell Wortsman, Jaehoon Lee, Justin Gilmer, Simon Kornblith
Vision Transformer Softmax Function ReLU Layer Softmax Attention Dynamic Activation

August 20, 2023

AutoReP: Automatic ReLU Replacement for Fast Private Network Inference
Hongwu Peng, Shaoyi Huang, Tong Zhou, Yukui Luo, Chenghong Wang, Zigeng Wang, Jiahui Zhao, Xi Xie, Ang Li, Tony Geng, Kaleel Mahmood, Wujie Wen, Xiaolin Xu, Caiwen Ding
ImageNet Dataset ReLU Layer Tiny ImageNet Private Inference Polynomial Approximation ReLU Operation

August 10, 2023

ReLU and Addition-based Gated RNN
Rickard Brännvall, Henrik Forsgren, Fredrik Sandin, Marcus Liwicki
Recurrent Neural Network LSTM Network Long Term Memory Gated Recurrent Unit ReLU Layer

August 5, 2023

Approximating Positive Homogeneous Functions with Scale Invariant Neural Networks
Stefan Bamberger, Reinhard Heckel, Felix Krahmer
Inverse Problem ReLU Layer Linear Inverse Problem Optimal Reconstruction Scale Invariant Robust Recovery

July 18, 2023

Convex Geometry of ReLU-layers, Injectivity on the Ball and Local Reconstruction
Daniel Haider, Martin Ehler, Peter Balazs
Convex Set ReLU Layer Efficient Reconstruction Ball Rolling Semantic Frame Restricted Approximate Invertibility

July 13, 2023

Deep Network Approximation: Beyond ReLU to Diverse Activation Functions
Shijun Zhang, Jianfeng Lu, Hongkai Zhao
Deep Neural Network Activation Function ReLU Layer Deep Neural Network Approximation ReLU Regression

June 13, 2023

Symmetric Neural-Collapse Representations with Supervised Contrastive Loss: The Impact of ReLU and Batching
Ganesh Ramachandra Kini, Vala Vakilian, Tina Behnia, Jaidev Gill, Christos Thrampoulidis
Global Impact Contrastive Loss Cross Entropy Loss ReLU Layer Supervised Contrastive Loss Rank Collapse Representational Geometry

June 6, 2023

Globally injective and bijective neural operators
Takashi Furuya, Michael Puthawala, Matti Lassas, Maarten V. de Hoop
Neural Operator Low Rank Operator Learning ReLU Layer