Comprehensive Trustworthiness

Comprehensive trustworthiness in artificial intelligence (AI) focuses on developing and evaluating AI systems that are reliable, fair, robust, safe, and private. Current research emphasizes benchmarking and improving trustworthiness across various model architectures, including large language models (LLMs), multimodal LLMs, and smaller on-device models, often using techniques like reinforcement learning from human feedback and data-centric approaches to address biases and vulnerabilities. This research is crucial for building public trust in AI and ensuring responsible deployment in high-stakes applications, driving the development of more reliable and ethical AI systems.

Papers

October 22, 2023

LUNA: A Model-Based Universal Analysis Framework for Large Language Models
Da Song, Xuan Xie, Jiayang Song, Derui Zhu, Yuheng Huang, Felix Juefei-Xu, Lei Ma
Artificial Intelligence Universal Model Comprehensive Trustworthiness

July 31, 2023

On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and Outlook
Mingyuan Fan, Chengyu Wang, Cen Chen, Yang Liu, Jun Huang
Large Language Model Diffusion Model Timely Survey Generative Model Outlook Attention Comprehensive Trustworthiness State of the Art Generative Large Generative Model Large Scale Generative Model

June 20, 2023

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
Boxin Wang, Weixin Chen, Hengzhi Pei, Chulin Xie, Mintong Kang, Chenhui Zhang, Chejian Xu, Zidi Xiong, Ritik Dutta, Rylan Schaeffer, Sang T. Truong, Simran Arora, Mantas Mazeika, Dan Hendrycks, Zinan Lin, Yu Cheng, Sanmi Koyejo, Dawn Song, Bo Li
Adversarial Robustness Generative Pre Trained Transformer Comprehensive Evaluation Comprehensive Trustworthiness Trust Inference

June 1, 2023

A Survey on Fairness-aware Recommender Systems
Di Jin, Luzhi Wang, He Zhang, Yizhen Zheng, Weiping Ding, Feng Xia, Shirui Pan
Timely Survey Recommender System Procedural Fairness Comprehensive Trustworthiness Fairness Aware Recommender System

May 19, 2023

Trustworthy Federated Learning: A Survey
Asadullah Tariq, Mohamed Adel Serhani, Farag Sallabi, Tariq Qayyum, Ezedin S. Barka, Khaled A. Shuaib
Timely Survey Federated Learning Centralized Machine Learning Comprehensive Trustworthiness

May 5, 2023

Assessing Trustworthiness of Autonomous Systems
Gregory Chance, Dhaminda B. Abeywickrama, Beckett LeClair, Owen Kerr, Kerstin Eder
Autonomous System Verification Task Comprehensive Trustworthiness Safety Assurance

April 21, 2023

Auditing and Generating Synthetic Data with Controllable Trust Trade-offs
Brian Belgodere, Pierre Dognin, Adam Ivankay, Igor Melnyk, Youssef Mroueh, Aleksandra Mojsilovic, Jiri Navratil, Apoorva Nitsure, Inkit Padhi, Mattia Rigotti, Jerret Ross, Yair Schiff, Radhika Vedpathak, Richard A. Young
Synthetic Data Audit Evidence Comprehensive Trustworthiness Model Trustworthiness Auditing Framework

April 7, 2023

The Effect of Robot Skill Level and Communication in Rapid, Proximate Human-Robot Collaboration
Kin Man Lee, Arjun Krishna, Zulfiqar Zaidi, Rohan Paleja, Letian Chen, Erin Hedlund-Botti, Mariah Schrum, Matthew Gombolay
Human Robot Collaboration Timely Communication Robot Skill Comprehensive Trustworthiness Agile Robot Trustworthy Robot

February 20, 2023

FederatedTrust: A Solution for Trustworthy Federated Learning
Pedro Miguel Sánchez Sánchez, Alberto Huertas Celdrán, Ning Xie, Gérôme Bovet, Gregorio Martínez Pérez, Burkhard Stiller
Federated Learning Solution Path Federated Prompt Cooperation Federated Setting Trustworthy Machine Learning Comprehensive Trustworthiness

February 17, 2023

Function Composition in Trustworthy Machine Learning: Implementation Choices, Insights, and Questions
Manish Nagireddy, Moninder Singh, Samuel C. Hoffman, Evaline Ju, Karthikeyan Natesan Ramamurthy, Kush R. Varshney
DCU Insight AQ Yes No Question Practical Implementation Trustworthy Machine Learning Comprehensive Trustworthiness Bias Mitigation Algorithm Function Composition

February 6, 2023

Trust, but Verify: Using Self-Supervised Probing to Improve Trustworthiness
Ailin Deng, Shen Li, Miao Xiong, Zhirui Chen, Bryan Hooi
Appropriate Trust Trustworthy Machine Learning Comprehensive Trustworthiness Self Supervised Probing

December 16, 2022

Fine-grained Czech News Article Dataset: An Interdisciplinary Approach to Trustworthiness Analysis
Matyáš Boháček, Michal Bravanský, Filip Trhlík, Václav Moravec
Fine Grained Interdisciplinary Perspective Comprehensive Trustworthiness Online News Czech Dataset

November 29, 2022

Birds of a Feather Trust Together: Knowing When to Trust a Classifier via Adaptive Neighborhood Aggregation
Miao Xiong, Shen Li, Wenjie Feng, Ailin Deng, Jihai Zhang, Bryan Hooi
Deep Convolutional Neural Network Simple Classifier Bird Specie Comprehensive Trustworthiness Neighborhood Selection Neighborhood Aggregation Mislabel Detection

June 26, 2022

Checking Trustworthiness of Probabilistic Computations in a Typed Natural Deduction System
Fabio Aurelio D'Asaro, Francesco Genco, Giuseppe Primiero
Comprehensive Trustworthiness Logical Rule Deduction System Bayesian Computation Machine to Machine Trust Building

February 17, 2022

Measuring Trustworthiness or Automating Physiognomy? A Comment on Safra, Chevallier, Gr\`ezes, and Baumard (2020)
Rory W Spanton, Olivia Guest
Facial Attribute Online Comment Digital Phenotyping Comprehensive Trustworthiness User Trust Human Trust Human Society