Training Data

Training data is crucial for machine learning model development, with current research focusing on improving data quality, efficiency, and mitigating biases. Active areas include generating synthetic data to address scarcity or privacy concerns, developing algorithms to optimize data selection and usage (e.g., self-paced learning, active learning), and mitigating issues like data contamination and imbalance through techniques such as data augmentation, selective parameter merging, and novel loss functions. The quality and characteristics of training data significantly impact model performance, generalization, and robustness, influencing various applications from natural language processing and image recognition to scientific computing and medical diagnosis.

1037papers

Papers - Page 35

January 14, 2024

January 11, 2024

Knowledge Translation: A New Pathway for Model Compression
Wujie Sun, Defang Chen, Jiawei Chen, Yan Feng, Chun Chen, Can Wang
Model Compression New Pathway Knowledge Based Training Data Deep Learning

January 10, 2024

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
Evan Hubinger, Carson Denison, Jesse Mu, Mike Lambert, Meg Tong, Monte MacDiarmid, Tamera Lanham, Daniel M. Ziegler, Tim Maxwell+30
Training Data Large Language Model Deceptive Diffusion Adversarial Training Backdoor Attack Backdoor Behavior Malicious Agent

January 9, 2024

Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems
Qinyi Luo, Penghan Wang, Wei Zhang, Fan Lai, Jiachen Mao, Xiaohan Wei, Jun Song, Wei-Yu Tsai, Shuai Yang, Yuxi Hu, Xuehai Qian
Training Data Deep Learning Fine Grained Jina Embeddings Recommender System

January 5, 2024

Training and Serving System of Foundation Models: A Comprehensive Survey
Jiahang Zhou, Yanyu Chen, Zicong Hong, Wuhui Chen, Yue Yu, Tao Zhang, Hui Wang, Chuanfu Zhang, Zibin Zheng
Foundation Model Training Data Efficient Training Comprehensive Survey State of the Art Artificial General Intelligence

January 4, 2024

January 3, 2024

The Power of Training: How Different Neural Network Setups Influence the Energy Demand
Daniel Geißler, Bo Zhou, Mengxi Liu, Sungho Suh, Paul Lukowicz
Training Data High Performance Computing System NN Hyperparameters Real Power Electricity Demand Different Neural Network

January 1, 2024

December 31, 2023

Training towards significance with the decorrelated event classifier transformer neural network
Jaebak Kim
Training Data Importance Aware Neural Network Neural Architecture Natural Language Processing Transformer Based Event Transformer

December 29, 2023

Generating Enhanced Negatives for Training Language-Based Object Detectors
Shiyu Zhao, Long Zhao, Vijay Kumar B. G, Yumin Suh, Dimitris N. Metaxas, Manmohan Chandraker, Samuel Schulter
Language Based Training Data Text to Image Diffusion Model Object Detector Negative Information Negation Detection

Training Data

Papers - Page 35

MapNeXt: Revisiting Training and Scaling Practices for Online Vectorized HD Map Construction

Enhanced Few-Shot Class-Incremental Learning via Ensemble Models

Microphone Conversion: Mitigating Device Variability in Sound Event Classification

The Unreasonable Effectiveness of Easy Training Data for Hard Tasks

Enhancing Consistency and Mitigating Bias: A Data Replay Approach for Incremental Learning

Knowledge Translation: A New Pathway for Model Compression

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems

Training and Serving System of Foundation Models: A Comprehensive Survey

Comprehensive Exploration of Synthetic Data Generation: A Survey

Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training

Understanding LLMs: A Comprehensive Overview from Training to Inference

The Power of Training: How Different Neural Network Setups Influence the Energy Demand

Self-supervised learning for skin cancer diagnosis with limited training data

Digger: Detecting Copyright Content Mis-usage in Large Language Model Training

Training towards significance with the decorrelated event classifier transformer neural network

Generating Enhanced Negatives for Training Language-Based Object Detectors

The Duck's Brain: Training and Inference of Neural Networks in Modern Database Engines

SparseProp: Efficient Event-Based Simulation and Training of Sparse Recurrent Spiking Neural Networks

Layer Attack Unlearning: Fast and Accurate Machine Unlearning via Layer Level Attack and Knowledge Distillation