AI System
AI systems are rapidly evolving, prompting intense research into their safety, reliability, and societal impact. Current research focuses on mitigating risks through improved model explainability and interpretability, developing robust auditing and verification methods, and establishing clear liability frameworks. This work spans various model architectures, including large language models and embodied agents, and addresses crucial challenges in fairness, bias, and user trust, with implications for both scientific understanding and the responsible deployment of AI in diverse applications.
Papers
Distributed Learning and Inference Systems: A Networking Perspective
Hesham G. Moussa, Arashmid Akhavain, S. Maryam Hosseini, Bill McCormick
Open Problems in Machine Unlearning for AI Safety
Fazl Barez, Tingchen Fu, Ameya Prabhu, Stephen Casper, Amartya Sanyal, Adel Bibi, Aidan O'Gara, Robert Kirk, Ben Bucknall, Tim Fist, Luke Ong, Philip Torr, Kwok-Yan Lam, Robert Trager, David Krueger, Sören Mindermann, José Hernandez-Orallo, Mor Geva, Yarin Gal
The State of Post-Hoc Local XAI Techniques for Image Processing: Challenges and Motivations
Rech Leong Tian Poh, Sye Loong Keoh, Liying Li
Toward Information Theoretic Active Inverse Reinforcement Learning
Ondrej Bajgar, Sid William Gould, Rohan Narayan Langford Mitta, Jonathon Liu, Oliver Newcombe, Jack Golden
Efficient Human-in-the-Loop Active Learning: A Novel Framework for Data Labeling in AI Systems
Yiran Huang, Jian-Feng Yang, Haoda Fu
The Digital Ecosystem of Beliefs: does evolution favour AI over humans?
David M. Bossens, Shanshan Feng, Yew-Soon Ong
Is AI Robust Enough for Scientific Research?
Jun-Jie Zhang, Jiahao Song, Xiu-Cheng Wang, Fu-Peng Li, Zehan Liu, Jian-Nan Chen, Haoning Dang, Shiyao Wang, Yiyan Zhang, Jianhui Xu, Chunxiang Shi, Fei Wang, Long-Gang Pang, Nan Cheng, Weiwei Zhang, Duo Zhang, Deyu Meng
Usage Governance Advisor: from Intent to AI Governance
Elizabeth M. Daly, Sean Rooney, Seshu Tirupathi, Luis Garces-Erice, Inge Vejsbjerg, Frank Bagehorn, Dhaval Salwala, Christopher Giblin, Mira L. Wolf-Bauwens, Ioana Giurgiu, Michael Hind, Peter Urbanetz
AI Benchmarks and Datasets for LLM Evaluation
Todor Ivanov, Valeri Penchev