Interpretable Layer

Interpretable layers in neural networks aim to enhance the transparency and understandability of deep learning models, addressing the "black box" problem. Current research focuses on developing novel layer architectures, such as simplicial maps and piecewise linear functions, and integrating them into existing models like CNNs and transformers, often using techniques like attention mechanisms and class activation mapping to highlight relevant features. This work is significant because it improves trust and allows for better debugging and refinement of complex models, particularly in high-stakes applications like legal decision-making and medical diagnosis where understanding model reasoning is crucial.

Papers

December 2, 2024

EmojiDiff: Advanced Facial Expression Control with High Identity Preservation in Portrait Generation
Liangwei Jiang, Ruida Li, Zhifeng Zhang, Shuo Fang, Chenguang Ma
Identity Generation Identity Preservation Portrait Generation Controllable Expression Interpretable Layer

October 23, 2024

Understanding Layer Significance in LLM Alignment
Guangyuan Shi, Zexin Lu, Xiaoyu Dong, Wenlong Zhang, Xuanyu Zhang, Yujie Feng, Xiao-Ming Wu
Large Language Model LLM Alignment Alignment Approach Human Alignment Alignment Dataset Interpretable Layer

October 1, 2024

Spatial Action Unit Cues for Interpretable Deep Facial Expression Recognition
Soufiane Belharbi, Marco Pedersoli, Alessandro Lameiras Koerich, Simon Bacon, Eric Granger
Facial Expression Recognition Expression Recognition Class Activation Interpretable Classifier Interpretable Layer Action Cue Interpretable Deep

March 22, 2024

SIMAP: A simplicial-map layer for neural networks
Rocio Gonzalez-Diaz, Miguel A. Gutiérrez-Naranjo, Eduardo Paluzo-Hidalgo
Neural Network Simplicial Complex Interpretable Layer

February 19, 2024

Language-guided Image Reflection Separation
Haofeng Zhong, Yuchen Hong, Shuchen Weng, Jinxiu Liang, Boxin Shi
Interpretable Layer Reflection Separation

February 12, 2024

Multiple Random Masking Autoencoder Ensembles for Robust Multimodal Semi-supervised Learning
Alexandru-Raul Todoran, Marius Leordeanu
Computer Vision Semi Supervised Satellite Observation Satellite Data Interpretable Layer Generating Future Observation Mask Autoencoder

February 1, 2024

Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues
Soufiane Belharbi, Marco Pedersoli, Alessandro Lameiras Koerich, Simon Bacon, Eric Granger
Facial Expression Recognition Expression Recognition Interpretable Deep Learning Interpretable Classifier Action Cue Interpretable Deep Interpretable Layer

October 5, 2023

DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
Anna Langedijk, Hosein Mohebbi, Gabriele Sarti, Willem Zuidema, Jaap Jumelet
Cross Attention Encoder Decoder Model Decoder Only Transformer Encoder Decoder Transformer Interpretable Layer

March 28, 2023

Randomly Initialized Subnetworks with Iterative Weight Recycling
Matt Gorbett, Darrell Whitley
Lottery Ticket Lottery Ticket Hypothesis Interpretable Layer

March 8, 2022

Measuring the Mixing of Contextual Information in the Transformer
Javier Ferrando, Gerard I. Gállego, Marta R. Costa-jussà
Transformer Based Transformer Architecture Contextual Information Multi Head Attention Attention Weight Mixing Process Interpretable Layer

January 1, 2022

Interpretable Low-Resource Legal Decision Making
Rohan Bhambhoria, Hui Liu, Samuel Dahan, Xiaodan Zhu
Deep Learning Deep Learning Model Interpretable Layer Case Summarization

November 20, 2021

SPINE: Soft Piecewise Interpretable Neural Equations
Jasdeep Singh Grover, Harsh Minesh Domadia, Raj Anant Tapase, Grishma Sharma
Piecewise Linear Fully Connected Layer Lumbar SPINE Interpretable Layer Data Fitting