Human SAFETY

Human safety in the context of rapidly advancing AI systems, particularly large language models (LLMs) and autonomous vehicles, is a critical research area focusing on mitigating risks associated with harmful outputs, unreliable predictions, and unforeseen interactions. Current research emphasizes developing robust safety mechanisms, including novel algorithms like Precision Knowledge Editing for LLMs and Physics-Enhanced Residual Policy Learning for autonomous vehicle control, as well as exploring multi-objective learning frameworks to balance safety and performance. These efforts are crucial for ensuring the responsible deployment of AI technologies across various sectors, ultimately improving the reliability and trustworthiness of these systems in real-world applications.

Papers

November 1, 2024

Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models
Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen
Large Language Model Human SAFETY Multiple Expert Automatic Usefulness Prediction

October 29, 2024

Standardization Trends on Safety and Trustworthiness Technology for Advanced AI
Jonghong Jeon
Artificial Intelligence Human SAFETY Data Standardization Comprehensive Trustworthiness

October 28, 2024

ROADFIRST: A Comprehensive Enhancement of the Systemic Approach to Safety for Improved Risk Factor Identification and Evaluation
Shriyan Reyya, Yao Cheng
Human SAFETY Feature Enhancement Crash Data Risk Factor System Approach Accident Hotspot

October 22, 2024

AGSENet: A Robust Road Ponding Detection Method for Proactive Traffic Safety
Ronghui Zhang, Shangyu Yang, Dakang Lyu, Zihan Wang, Junzhou Chen, Yilong Ren, Bolin Gao, Zhihan Lv
Saliency Map Human SAFETY Visual Saliency Nighttime Datasets Pothole Detection Saliency Network Road Flooding

October 21, 2024

Enhancing Trust and Safety in Digital Payments: An LLM-Powered Approach
Devendra Dahiphale, Naveen Madiraju, Justin Lin, Rutvik Karve, Monu Agrawal, Anant Modwal, Ramanan Balakrishnan, Shanay Shah, Govind Kaushal, Priya Mandawat, Prakash Hariramani, Arif Merchant (Google, Inc)
LLM Based Human SAFETY Building Trust Anti Phishing Payment System

October 18, 2024

LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Yujun Zhou, Jingdong Yang, Kehan Guo, Pin-Yu Chen, Tian Gao, Werner Geyer, Nuno Moniz, Nitesh V Chawla, Xiangliang Zhang
Large Language Model Human SAFETY Safety Guarantee Likely LAB of Origin Safety Benchmark

October 15, 2024

To Err is AI : A Case Study Informing LLM Flaw Reporting Practices
Sean McGregor, Allyson Ettinger, Nick Judd, Paul Albee, Liwei Jiang, Kavel Rao, Will Smith, Shayne Longpre, Avijit Ghosh, Christopher Fiorelli, Michelle Hoang, Sven Cattell, Nouha Dziri
Language Model Artificial Intelligence Human SAFETY Best Practice LLM Safety Bug Report

October 10, 2024

How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
Seongyun Lee, Geewook Kim, Jiyeon Kim, Hyunji Lee, Hoyeon Chang, Sue Hyun Park, Minjoon Seo
Vision Language Model Human SAFETY Safety Fine Tuning Safety Layer

October 9, 2024

Root Defence Strategies: Ensuring Safety of LLM at the Decoding Level
Xinyi Zeng, Yuying Shang, Yutao Zhu, Jiawei Chen, Yu Tian
Large Language Model Medical LLM Human SAFETY Speculative Decoding Accurate Decoding Malicious Prompt

October 7, 2024

SoK: Towards Security and Safety of Edge AI
Tatjana Wingarz, Anne Lauscher, Janick Edinger, Dominik Kaaser, Stefan Schulte, Mathias Fischer
Large Language Model Human SAFETY Security Related Edge AI AI Application Decentralized Manner Safety Risk

October 2, 2024

Precision Knowledge Editing: Enhancing Safety in Large Language Models
Xuying Li, Zhuo Li, Yuji Kosuga, Yasuhiro Yoshida, Victor Bian
Large Language Model Human SAFETY Knowledge Editing Knowledge Editing Method Toxicity Detection Datasets

September 23, 2024

Physics Enhanced Residual Policy Learning (PERPL) for safety cruising in mixed traffic platooning under actuator and communication delay
Keke Long, Haotian Shi, Yang Zhou, Xiaopeng Li
Reinforcement Learning Human SAFETY Mixed Traffic Physic Aware New Unimorph Actuator Vehicle Control Communication Delay Residual Physic

September 15, 2024

Proximal Ranking Policy Optimization for Practical Safety in Counterfactual Learning to Rank
Shashank Gupta, Harrie Oosterhuis, Maarten de Rijke
Proximal Policy Optimization Human SAFETY Stable Rank Counterfactual Learning Inverse Propensity

September 14, 2024

September 13, 2024

Switching Sampling Space of Model Predictive Path-Integral Controller to Balance Efficiency and Safety in 4WIDS Vehicle Navigation
Mizuho Aoki, Kohei Honda, Hiroyuki Okuda, Tatsuya Suzuki
Human SAFETY Balancing Efficiency Navigation System Model Predictive Path Integral Control Efficient Navigation Model Predictive Path Integral Mission Related Maneuverability Analysis

September 12, 2024

Disturbance-Robust Backup Control Barrier Functions: Safety Under Uncertain Dynamics
David E.J. van Wijk, Samuel Coogan, Tamas G. Molnar, Manoranjan Majji, Kerianne L. Hobbs
Control Barrier Function Human SAFETY Safety Critical Control Uncertain Dynamic Control Constraint

September 6, 2024

Stacked Universal Successor Feature Approximators for Safety in Reinforcement Learning
Ian Cannon, Washington Garcia, Thomas Gresavage, Joseph Saurine, Ian Leong, Jared Culbertson
Reinforcement Learning Human SAFETY Reinforcement Learning Agent Continuous Control Safety Critical Control Reinforcement Learning Environment Successor Representation

September 5, 2024

Human SAFETY

Papers

Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models

Standardization Trends on Safety and Trustworthiness Technology for Advanced AI

ROADFIRST: A Comprehensive Enhancement of the Systemic Approach to Safety for Improved Risk Factor Identification and Evaluation

AGSENet: A Robust Road Ponding Detection Method for Proactive Traffic Safety

Enhancing Trust and Safety in Digital Payments: An LLM-Powered Approach

LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs

To Err is AI : A Case Study Informing LLM Flaw Reporting Practices

How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?

Root Defence Strategies: Ensuring Safety of LLM at the Decoding Level

SoK: Towards Security and Safety of Edge AI

Precision Knowledge Editing: Enhancing Safety in Large Language Models

Physics Enhanced Residual Policy Learning (PERPL) for safety cruising in mixed traffic platooning under actuator and communication delay

Proximal Ranking Policy Optimization for Practical Safety in Counterfactual Learning to Rank

A Data-Informed Analysis of Scalable Supervision for Safety in Autonomous Vehicle Fleets

Real-Time Adaptive Industrial Robots: Improving Safety And Comfort In Human-Robot Collaboration

Switching Sampling Space of Model Predictive Path-Integral Controller to Balance Efficiency and Safety in 4WIDS Vehicle Navigation

Disturbance-Robust Backup Control Barrier Functions: Safety Under Uncertain Dynamics

Stacked Universal Successor Feature Approximators for Safety in Reinforcement Learning

Achieving the Safety and Security of the End-to-End AV Pipeline

Safety vs. Performance: How Multi-Objective Learning Reduces Barriers to Market Entry