AI System

AI systems are rapidly evolving, prompting intense research into their safety, reliability, and societal impact. Current research focuses on mitigating risks through improved model explainability and interpretability, developing robust auditing and verification methods, and establishing clear liability frameworks. This work spans various model architectures, including large language models and embodied agents, and addresses crucial challenges in fairness, bias, and user trust, with implications for both scientific understanding and the responsible deployment of AI in diverse applications.

Papers

May 25, 2024

An Empirical Exploration of Trust Dynamics in LLM Supply Chains
Agathe Balayn, Mireia Yurrita, Fanny Rancourt, Fabio Casati, Ujwal Gadiraju
AI System Value Chain Trust Dynamic

May 17, 2024

Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence
Adrien Basdevant, Camille François, Victor Storchan, Kevin Bankston, Ayah Bdeir, Brian Behlendorf, Merouane Debbah, Sayash Kapoor, Yann LeCun, Mark Surman, Helen King-Turvey, Nathan Lambert, Stefano Maffulli, Nik Marda, Govind Shivkumar, Justine Tunney
Artificial Intelligence New Framework Foundation Model AI System Open Source Proceeding Modality Mutual Influence Software Stack

May 16, 2024

Societal Adaptation to Advanced AI
Jamie Bernardi, Gabriel Mukobi, Hilary Greaves, Lennart Heim, Markus Anderljung
Artificial Intelligence AI System Advanced AI Human AI Decision Making Societal Adaptation

May 10, 2024

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems
David "davidad" Dalrymple, Joar Skalse, Yoshua Bengio, Stuart Russell, Max Tegmark, Sanjit Seshia, Steve Omohundro, Christian Szegedy, Ben Goldhaber, Nora Ammann, Alessandro Abate, Joe Halpern, Clark Barrett, Ding Zhao, Tan Zhi-Xuan, Jeannette Wing, Joshua Tenenbaum
Artificial Intelligence New Framework AI System Artificial Intelligence System Safety Guarantee AI Safety Safe AI

May 9, 2024

People cannot distinguish GPT-4 from a human in a Turing test
Cameron R. Jones, Benjamin K. Bergen
Artificial Intelligence GPT 4 AI System Human Generated Person Name Level Test Cognitive Intelligence Intelligent System

April 30, 2024

Reimagining AI in Social Work: Practitioner Perspectives on Incorporating Technology in their Practice
Katie Wassal, Carolyn Ashurst, Jiri Hron, Miri Zilka
Artificial Intelligence AI System Practice Mode Technology Information Data AI Application AI Tool Participatory Design Child Welfare

April 25, 2024

Uncovering Deceptive Tendencies in Language Models: A Simulated Company AI Assistant
Olli Järviniemi, Evan Hubinger
Language Model AI System AI Assistant Simulation Environment Real World Scenario Deception Detection Public Perception

April 23, 2024

AI Procurement Checklists: Revisiting Implementation in the Age of AI Governance
Tom Zick, Mason Kortz, David Eaves, Finale Doshi-Velez
Artificial Intelligence AI System Practical Implementation Speech Based Age Artificial Intelligence Governance AI Tool Public Procurement

April 18, 2024

April 16, 2024

The Dearth of the Author in AI-Supported Writing
Max Kreminski
AI System Expressive Speech Author Name Creative Task Writing Tool

April 10, 2024

Frontier AI Ethics: Anticipating and Evaluating the Societal Impacts of Language Model Agents
Seth Lazar
AI System AI Ethic Generative AI System Generative Agent Societal Impact

April 9, 2024

Automatic Authorities: Power and AI
Seth Lazar
Artificial Intelligence Real Power AI System Autonomous Framework

April 5, 2024

Balancing Progress and Responsibility: A Synthesis of Sustainability Trade-Offs of AI-Based Systems
Apoorva Nalini Pradeep Kumar, Justus Bogner, Markus Funke, Patricia Lago
AI System Critical Synthesis Much Progress Trade Offs Artificial Intelligence Adoption Higher Order Responsibility

March 25, 2024

"It is there, and you need it, so why do you not use it?" Achieving better adoption of AI systems by domain experts, in the case study of natural science research
Auste Simkute, Ewa Luger, Michael Evans, Rhianne Jones
Artificial Intelligence Case Study AI System Human Ai Collaboration Scientific Research Domain Expert Adoption Strategy Artificial Intelligence Adoption AI Practitioner

March 23, 2024

The Limits of Perception: Analyzing Inconsistencies in Saliency Maps in XAI
Anna Stubbin, Thompson Chyrikov, Jim Zhao, Christina Chajo
Artificial Intelligence AI System Saliency Map Continuum Limit xAI Community Perception Aware Hard to Easy Inconsistency

March 15, 2024

Safety Cases: How to Justify the Safety of Advanced AI Systems
Joshua Clymer, Nick Gabrieli, David Krueger, Thomas Larsen
AI System Human SAFETY Target Argument Advanced AI Rationale Alignment Safety Case

March 13, 2024

AI System

Papers

An Empirical Exploration of Trust Dynamics in LLM Supply Chains

Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence

Societal Adaptation to Advanced AI

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

People cannot distinguish GPT-4 from a human in a Turing test

Reimagining AI in Social Work: Practitioner Perspectives on Incorporating Technology in their Practice

Uncovering Deceptive Tendencies in Language Models: A Simulated Company AI Assistant

AI Procurement Checklists: Revisiting Implementation in the Age of AI Governance

Accounting for AI and Users Shaping One Another: The Role of Mathematical Models

Introducing v0.5 of the AI Safety Benchmark from MLCommons

The Dearth of the Author in AI-Supported Writing

Frontier AI Ethics: Anticipating and Evaluating the Societal Impacts of Language Model Agents

Automatic Authorities: Power and AI

Balancing Progress and Responsibility: A Synthesis of Sustainability Trade-Offs of AI-Based Systems

"It is there, and you need it, so why do you not use it?" Achieving better adoption of AI systems by domain experts, in the case study of natural science research

The Limits of Perception: Analyzing Inconsistencies in Saliency Maps in XAI

Safety Cases: How to Justify the Safety of Advanced AI Systems

(Beyond) Reasonable Doubt: Challenges that Public Defenders Face in Scrutinizing AI in Court

Scaling Instructable Agents Across Many Simulated Worlds

Optimizing Risk-averse Human-AI Hybrid Teams