Multiplicative Size Scaling

Multiplicative size scaling in machine learning investigates how model performance changes with increases in model parameters, training data, and other resources. Current research focuses on optimizing this scaling across various model architectures, including transformers, diffusion models, and graph neural networks, often employing techniques like parameter-efficient fine-tuning and improved data sampling strategies to enhance efficiency and generalization. These investigations are crucial for developing more powerful and resource-efficient AI systems, impacting fields ranging from natural language processing and computer vision to scientific computing and robotics. A key theme is moving beyond simple scaling to understand and optimize the interplay between model size, data quality, and training methodologies.

Papers

July 15, 2024

SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation
Jordan Juravsky, Yunrong Guo, Sanja Fidler, Xue Bin Peng
Reinforcement Learning External Control Multiplicative Size Scaling Physic Based Character Animation Kinematic Model

July 14, 2024

Have ASkotch: Fast Methods for Large-scale, Memory-constrained Kernel Ridge Regression
Pratik Rathore, Zachary Frangella, Madeleine Udell
Large Scale Multiplicative Size Scaling Kernel Ridge Regression

July 9, 2024

June 26, 2024

On Scaling Up 3D Gaussian Splatting Training
Hexu Zhao, Haoyang Weng, Daohan Lu, Ang Li, Jinyang Li, Aurojit Panda, Saining Xie
Gaussian Splatting 3D Reconstruction Multiplicative Size Scaling Large Scale 3D Reconstruction

June 11, 2024

June 6, 2024

Scaling and evaluating sparse autoencoders
Leo Gao, Tom Dupré la Tour, Henk Tillman, Gabriel Goh, Rajan Troll, Alec Radford, Ilya Sutskever, Jan Leike, Jeffrey Wu
Supervised Autoencoder Multiplicative Size Scaling Sparse Autoencoders Scaling Deep Network

May 1, 2024

Scaling and renormalization in high-dimensional regression
Alexander Atanasov, Jacob A. Zavatone-Veth, Cengiz Pehlevan
High Dimensional Multiplicative Size Scaling Random Feature Ridge Regression Random Matrix Neural Scaling Law Renormalization Group Fine Grained Bias

April 22, 2024

Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph
Xiaochen Kev Gao, Feng Yao, Kewen Zhao, Beilei He, Animesh Kumar, Vish Krishnan, Jingbo Shang
Large Language Model Multiplicative Size Scaling Graph Model Patent Text Model Scaling Fine Grained Claim Dependency Patent Indicator

April 12, 2024

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies
Zichao Li, Cihang Xie, Ekin Dogus Cubuk
Training Data Raw Data Contrastive Language Image Multiplicative Size Scaling Comprehensive Analysis Training Strategy

April 4, 2024

Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation
Asad Ali Shahid, Yashraj Narang, Vincenzo Petrone, Enrico Ferrentino, Ankur Handa, Dieter Fox, Marco Pavone, Loris Roveda
Deep Reinforcement Learning Humanoid Robot Multiplicative Size Scaling Agent Self Evolution GPU Based

April 2, 2024

Corpus Considerations for Annotator Modeling and Scaling
Olufunke O. Sarumi, Béla Neuendorf, Joan Plepi, Lucie Flek, Jörg Schlötterer, Charles Welch
Large Corpus Multiplicative Size Scaling Corpus Based Efficient Annotation

March 26, 2024

Mechanistic Design and Scaling of Hybrid Architectures
Michael Poli, Armin W Thomas, Eric Nguyen, Pragaash Ponnusamy, Björn Deiseroth, Kristian Kersting, Taiji Suzuki, Brian Hie, Stefano Ermon, Christopher Ré, Ce Zhang, Stefano Massaroli
Deep Learning Architecture Scaling Law Multiplicative Size Scaling Hybrid Framework Recurrent Neural Network Architecture Optimal Architecture

March 4, 2024

DyCE: Dynamically Configurable Exiting for Deep Learning Compression and Real-time Scaling
Qingyuan Wang, Barry Cardiff, Antoine Frappé, Benoit Larras, Deepu John
Multiplicative Size Scaling Compression Technique Neural Network Compression Modern Deep Learning Random Access Memory Self Reconfiguration

February 27, 2024

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Biao Zhang, Zhongtao Liu, Colin Cherry, Orhan Firat
Large Language Model Full Model Medical LLM Raw Data Multiplicative Size Scaling Parameter Efficient Finetuning Finetuning Method

February 20, 2024

Scaling physics-informed hard constraints with mixture-of-experts
Nithin Chalapathi, Yiheng Du, Aditi Krishnapriyan
Mixture of Expert Multiplicative Size Scaling Differentiable Physic Neural PDE Solver Hard Constraint Physical Constraint Differentiable Optimization

February 19, 2024

Beyond Uniform Scaling: Exploring Depth Heterogeneity in Neural Architectures
Akash Guna R. T, Arnav Chavan, Deepak Gupta
Vision Transformer Neural Architecture Loss Landscape Multiplicative Size Scaling ImageNet LT Unknown Heterogeneity Uniform Scaling

February 18, 2024

Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning
Zhiyang Xu, Chao Feng, Rulin Shao, Trevor Ashby, Ying Shen, Di Jin, Yu Cheng, Qifan Wang, Lifu Huang
Vision Language Model Vision Paper New Task Multiplicative Size Scaling Visual Instruction Tuning Visual Instruction Multi Modal Benchmark

February 2, 2024

A Dynamical Model of Neural Scaling Laws
Blake Bordelon, Alexander Atanasov, Cengiz Pehlevan
Gradient Descent Multiplicative Size Scaling Dynamical Model Neural Scaling Law Optimal Scaling