Level Test

Level testing encompasses a broad range of techniques for evaluating the performance and reliability of systems, from software and AI models to physical structures and robotic systems. Current research focuses on developing automated testing frameworks, leveraging AI models (like LLMs and CNNs) for test generation, analysis, and evaluation, and employing techniques such as Structure from Motion for high-precision 3D modeling in physical testing. These advancements aim to improve the efficiency, accuracy, and robustness of testing processes across diverse scientific and engineering domains, ultimately leading to more reliable and trustworthy systems.

Papers

March 14, 2024

Sabi\'a-2: A New Generation of Portuguese Large Language Models
Thales Sales Almeida, Hugo Abonizio, Rodrigo Nogueira, Ramon Pires
Large Language Model Level Test Brazilian Portuguese Next Generation

March 1, 2024

LocalRQA: From Generating Data to Locally Training, Testing, and Deploying Retrieval-Augmented QA Systems
Xiao Yu, Yunan Lu, Zhou Yu
Large Language Model Open Source Level Test Data Generation Local Training Retrieval Method QA System OpenQA System

February 26, 2024

February 14, 2024

Middleware-based multi-agent development environment for building and testing distributed intelligent systems
Francisco José Aguayo-Canela, Héctor Alaiz-Moretón, María Teresa García-Ordás, José Alberto Benítez-Andrades, Carmen Benavides, Paulo Novais, Isaías García-Rodríguez
Agent Smith Internet of Thing Level Test Internet of Thing Device Intelligent System Building PCC

February 10, 2024

Understanding Test-Time Augmentation
Masanari Kimura
Data Augmentation Level Test Test Time Augmentation Surprising Effectiveness Time Augmentation

February 7, 2024

An approach to automated videogame beta testing
Jennifer Hernández-Bécares, Luis Costero, Pedro Pablo Gómez-Martín
Constructive Approach Level Test Human Programmer Game Testing

January 29, 2024

January 6, 2024

A Survey on Verification and Validation, Testing and Evaluations of Neurosymbolic Artificial Intelligence
Justus Renkhoff, Ke Feng, Marc Meier-Doernberg, Alvaro Velasquez, Houbing Herbert Song
Global Evaluation Verification Task Level Test External Validation Neurosymbolic AI Symbolic AI

December 23, 2023

Short-lived High-volume Multi-A(rmed)/B(andits) Testing
Su Jia, Andrew Li, R. Ravi, Nishant Oli, Paul Duff, Ian Anderson
Movie Recommendation Level Test Prior Distribution Digital Economy

December 19, 2023

The curious case of the test set AUROC
Michael Roberts, Alon Hazan, Sören Dittmer, James H. F. Rudd, Carola-Bibiane Schönlieb
Machine Learning Model Level Test Performance Metric ROC Curve

December 18, 2023

Turing's Test, a Beautiful Thought Experiment
Bernardo Gonçalves
Large Language Model Artificial Intelligence Level Test Turing Machine Thought Experiment

December 17, 2023

Evaluating AI Vocational Skills Through Professional Testing
David Noever, Matt Ciolino
Artificial Intelligence Artificial Intelligence Model Level Test Professional Certification

November 13, 2023

October 31, 2023

October 10, 2023