Training Data Leakage

Training data leakage refers to the vulnerability of machine learning models, particularly large language models (LLMs) and deep neural networks, to revealing sensitive information from their training datasets. Current research focuses on understanding how various attack vectors, including gradient inversion and exploiting specific character patterns, can extract this data, even from seemingly secure training methods like differential privacy. This is a significant concern for privacy and intellectual property, impacting the development and deployment of machine learning systems across diverse applications, driving efforts to develop more robust training and defense mechanisms.

7papers

Papers

November 17, 2024

Stealing Training Graphs from Graph Neural Networks
Minhua Lin, Enyan Dai, Junjie Xu, Jinyuan Jia, Xiang Zhang, Suhang Wang
Graph Diffusion Training Graph Training Data Leakage Graph Neural Network

October 9, 2024

PII-Scope: A Benchmark for Training Data PII Leakage Assessment in LLMs
Krishna Kanth Nakka, Ahmed Frikha, Ricardo Mendes, Xue Jiang, Xuebing Zhou
Data Extraction Attack Privacy Attack New Benchmark Medical LLM Identifiable Information QuEry Based Attack Training Data Leakage Adversarial Attack

June 3, 2024

Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients
Weijun Li, Qiongkai Xu, Mark Dras
Part Aware Transformer Data Leakage Tree Specie Differential Privacy Forest Environment Training Data Leakage Private Training Gradient Inversion Attack

May 9, 2024

Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models
Yang Bai, Ge Pei, Jindong Gu, Yong Yang, Xingjun Ma
Training Corpus Training Data Extraction Scale Pre Trained Language Model Training Data Leakage Large Language Model

October 12, 2023

When Machine Learning Models Leak: An Exploration of Synthetic Training Data
Manel Slokom, Peter-Paul de Wolf, Martha Larson
Machine Learning Model Sensitive Attribute Propensity Score Synthetic Data Synthetic Training Data Environment Exploration Training Data Leakage

August 8, 2023

Accurate, Explainable, and Private Models: Providing Recourse While Minimizing Training Data Leakage
Catherine Huang, Chelse Swoopes, Christina Xiao, Jiaqi Ma, Himabindu Lakkaraju
Training Data Leakage Algorithmic Recourse Private Model Robust Recourse PRIvacy Leakage

October 20, 2022

Analysing Training-Data Leakage from Gradients through Linear Systems and Gradient Matching
Cangxiong Chen, Neill D. F. Campbell
Linear System Gradient Matching Gradient Leakage Attack Gradient Based Attack Training Data Leakage

November 19, 2021

Understanding Training-Data Leakage from Gradients in Neural Networks for Image Classification
Cangxiong Chen, Neill D. F. Campbell
Deep Network Image Classification Gradient Based Attack Deep Learning Model Neural Network Training Data Leakage

Training Data Leakage

Papers

Stealing Training Graphs from Graph Neural Networks

PII-Scope: A Benchmark for Training Data PII Leakage Assessment in LLMs

Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients

Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

When Machine Learning Models Leak: An Exploration of Synthetic Training Data

Accurate, Explainable, and Private Models: Providing Recourse While Minimizing Training Data Leakage

Analysing Training-Data Leakage from Gradients through Linear Systems and Gradient Matching

Understanding Training-Data Leakage from Gradients in Neural Networks for Image Classification