Global Evaluation

Global evaluation in various scientific domains focuses on developing robust and reliable methods for assessing the performance of models and systems, often addressing challenges in data diversity, evolving data distributions, and the need for human-centered metrics. Current research emphasizes the development of comprehensive benchmarks and evaluation frameworks, often incorporating techniques like Item Response Theory and multi-faceted metrics beyond simple accuracy, and utilizing diverse model architectures including Large Language Models (LLMs), Convolutional Neural Networks (CNNs), and Graph Neural Networks (GNNs). These advancements are crucial for ensuring the trustworthiness and effectiveness of AI systems across diverse applications, from medical diagnosis to autonomous driving, and for fostering reproducible and comparable research within the scientific community.

723papers

Papers - Page 45

April 26, 2022

Evaluation of Self-taught Learning-based Representations for Facial Emotion Recognition
Bruna Delazeri, Leonardo L. Veras, Alceu de S. Britto, Jean Paul Barddal, Alessandro L. Koerich
Global Evaluation Diverse Representation Supervised Autoencoder Representation Learning Facial Emotion Recognition Self Directed Learning Unsupervised Representation Unsupervised Feature Learning

April 22, 2022

Evaluation of Multi-Scale Multiple Instance Learning to Improve Thyroid Cancer Classification
Maximilian E. Tschuchnig, Philipp Grubmüller, Lea M. Stangassinger, Christina Kreutzer, Sébastien Couillard-Després, Gertie J. Oostingh+2
Global Evaluation Thyroid Cancer Multi Scale Multiple Instance Learning Deep Learning Auto Differentiation

April 20, 2022

Evaluation of Robust Point Set Registration Applied to Automotive Doppler Radar
Karim Haggag
Global Evaluation Ego Motion Estimation Map Registration Iterative Closest Point Automotive Radar Point Set Robotics Domain

April 19, 2022

April 18, 2022

April 12, 2022

April 11, 2022

April 10, 2022

Towards Evaluation of Autonomously Generated Musical Compositions: A Comprehensive Survey
Daniel Kvak
Global Evaluation Comprehensive Survey Subjective Knowledge Compositional Ability Musical Form Sustained Creativity Autonomous System Aesthetic Feature Adaptation Concern

April 8, 2022

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model
Hongyi Yuan, Zheng Yuan, Ruyi Gan, Jiaxing Zhang, Yutao Xie, Sheng Yu
Global Evaluation Language Model Domain Specific Language Model Text Generation Pretrained Language Model

April 6, 2022

SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis
Georgia Maniati, Alexandra Vioni, Nikolaos Ellinas, Karolos Nikitaras, Konstantinos Klapsas, June Sig Sung, Gunu Jho, Aimilios Chalamandaris+1
Global Evaluation Text to Speech Acoustic Model

April 1, 2022

Evaluation of Fake News Detection with Knowledge-Enhanced Language Models
Chenxi Whitehouse, Tillman Weyde, Pranava Madhyastha, Nikos Komninos
Global Evaluation Fake News Detection Fake News Language Model

March 31, 2022

March 30, 2022

Evaluation of semantic relations impact in query expansion-based retrieval systems
Lorenzo Massai
Global Evaluation Semantic Relation Natural Language Processing Query Information Synthetic Query Generation

March 29, 2022

Investigating Data Variance in Evaluations of Automatic Machine Translation Metrics
Jiannan Xiang, Huayang Li, Yahui Liu, Lemao Liu, Guoping Huang, Defu Lian, Shuming Shi
Metric Evaluation Translation Metric Global Evaluation Data Variance Translation Task Metric Library

March 28, 2022

RoBoa: Construction and Evaluation of a Steerable Vine Robot for Search and Rescue Applications
Pascal Auf der Maur, Betim Djambazi, Yves Haberthür, Patricia Hörmann, Alexander Kübler, Michael Lustenberger, Samuel Sigrist, Oda Vigen+5
Global Evaluation Humanitarian Response Vine Robot Robot Person Construction Industry Rescue Robot

Global Evaluation

Papers - Page 45

Evaluation of Self-taught Learning-based Representations for Facial Emotion Recognition

Evaluation of Multi-Scale Multiple Instance Learning to Improve Thyroid Cancer Classification

Evaluation of Robust Point Set Registration Applied to Automotive Doppler Radar

GAM(e) changer or not? An evaluation of interpretable machine learning models based on additive model constraints

UID2021: An Underwater Image Dataset for Evaluation of No-reference Quality Assessment Metrics

Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

NFT Appraisal Prediction: Utilizing Search Trends, Public Market Data, Linear Regression and Recurrent Neural Networks

Learning Performance Graphs from Demonstrations via Task-Based Evaluations

EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data

A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference

Evaluation of Automatic Text Summarization using Synthetic Facts

Towards Evaluation of Autonomously Generated Musical Compositions: A Comprehensive Survey

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model

SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis

Evaluation of Fake News Detection with Knowledge-Enhanced Language Models

On the Evaluation of NLP-based Models for Software Engineering

Effective data screening technique for crowdsourced speech intelligibility experiments: Evaluation with IRM-based speech enhancement

Evaluation of semantic relations impact in query expansion-based retrieval systems

Investigating Data Variance in Evaluations of Automatic Machine Translation Metrics

RoBoa: Construction and Evaluation of a Steerable Vine Robot for Search and Rescue Applications