the latest in aiBeta

Generalisation Ability

Generalization ability in machine learning focuses on a model's capacity to perform well on unseen data, a crucial aspect for real-world applications. Current research investigates how model architecture, training techniques (like incorporating noise or prompt engineering), and optimization strategies (such as targeting "flat minima") influence generalization. This research is vital because improved generalization leads to more robust and reliable AI systems across diverse domains, from natural language processing to computer vision, ultimately impacting the effectiveness and trustworthiness of AI applications.

6papers

Papers

January 1, 2025

Labels Generated by Large Language Model Helps Measuring People's Empathy in Vitro
Large Language Model Pre Trained Language Model Label Information Person Name Generalisation Ability Cognitive Empathy

July 19, 2024

Voices in a Crowd: Searching for Clusters of Unique Perspectives
Latent Embeddings Human VOICE Crowded Environment Individual Annotator Minority Group Different Cluster Language Model Synthesized View Generalisation Ability

February 13, 2024

A PAC-Bayesian Link Between Generalisation and Flat Minima
Generalisation Ability Generalization Performance Generalization Bound Gradient Based Explanation Flat Minimum Strong Generalization PAC Bayesian Log Sobolev

June 30, 2023

Navigating Noise: A Study of How Noise Influences Generalisation and Calibration of Neural Networks
Neural Network Massart Noise Learned Representation Strong Generalization Label Noise Label Invariant Augmentation Study Feature Generalisation Ability Calibration Performance

May 31, 2023

Large Language Models Are Not Strong Abstract Reasoners
Abstract Reasoning Language Model Generalisation Ability LLM Generation Estimated Team Strength

November 30, 2022

Learning Label Modular Prompts for Text Classification in the Wild
Wild Challenge Generalisation Ability Language Model Text Classification Text Classification Task Prompt Based Pseudo

May 29, 2022

Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning
Strong Generalization Generalisation Ability Prompt Learning Limited Memorization Multi Agent Decoupling Coefficient