Kronecker Product Approximation

Kronecker product approximation is a technique used to efficiently approximate large matrices, particularly those arising in machine learning optimization problems, by representing them as the Kronecker product of smaller matrices. Current research focuses on applying this approximation within optimization algorithms like Shampoo and natural gradient methods, particularly in decentralized and Riemannian settings, improving computational efficiency and scalability. This approach is proving valuable in diverse applications, including accelerating training of deep neural networks and enhancing Bayesian model selection for improved generalization and data efficiency. The resulting speedups and improved performance are significant for handling the increasingly large datasets and complex models prevalent in modern machine learning.

Papers

June 25, 2024

A New Perspective on Shampoo's Preconditioner
Depen Morwani, Itai Shapira, Nikhil Vyas, Eran Malach, Sham Kakade, Lucas Janson
Hessian Matrix New Perspective Adaptive Preconditioner Hair Strand Kronecker Product Approximation

March 16, 2023

Decentralized Riemannian natural gradient methods with Kronecker-product approximations
Jiang Hu, Kangkang Deng, Na Li, Quanzheng Li
Decentralized FL Fisher Information Riemannian Gradient Decentralized Optimization Kronecker Product Approximation

February 22, 2022

Invariance Learning in Deep Neural Networks with Differentiable Laplace Approximations
Alexander Immer, Tycho F. A. van der Ouderaa, Gunnar Rätsch, Vincent Fortuin, Mark van der Wilk
Deep Learning Deep Neural Network Data Augmentation Laplace Approximation Invariance Learning Kronecker Product Approximation

Kronecker Product Approximation

Papers

A New Perspective on Shampoo's Preconditioner

Decentralized Riemannian natural gradient methods with Kronecker-product approximations

Invariance Learning in Deep Neural Networks with Differentiable Laplace Approximations