Performance Score

Performance scores, central to evaluating machine learning models and other systems, are undergoing significant refinement. Research focuses on developing more nuanced scoring methods that go beyond simple accuracy metrics, incorporating aspects like attention weights, retrieval-augmented generation, and even multi-modal feedback. These advancements aim to improve model interpretability, address biases, and provide more reliable assessments of system capabilities across diverse applications, from automated essay grading to generative AI evaluation. The ultimate goal is to create more robust and trustworthy evaluation frameworks that better reflect real-world performance.

Papers

December 21, 2022

Predicting the Score of Atomic Candidate OWL Class Axioms
Ali Ballout, Andrea G B Tettamanzi, Célia da Costa Pereira
Performance Score Heuristic Rule Ontology Learning OWL Reasoner

December 15, 2022

Deep Learning-Based Automatic Assessment of AgNOR-scores in Histopathology Images
Jonathan Ganz, Karoline Lipnik, Jonas Ammeling, Barbara Richter, Chloé Puget, Eda Parlak, Laura Diehl, Robert Klopfleisch, Taryn A. Donovan, Matti Kiupel, Christof A. Bertram, Katharina Breininger, Marc Aubreville
Histopathology Image Performance Score Entire Transcription Process Based Assessment

December 6, 2022

Proposal of a Score Based Approach to Sampling Using Monte Carlo Estimation of Score and Oracle Access to Target Density
Curtis McDonald, Andrew Barron
Open Sampling Posterior Sampling Monte Carlo Performance Score Score Based Test Oracle Generative Algorithm Backward Stochastic Differential Equation Monte Carlo Sampling Target Density

October 25, 2022

Useful Confidence Measures: Beyond the Max Score
Gal Yona, Amir Feder, Itay Laish
Machine Learning Performance Score Entropy Based Confidence Measure

October 9, 2022

FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation
Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon
Score Based Generative Performance Score Score Based Diffusion Model Diverse Equation Fokker Planck Equation Conditional Score

October 6, 2022

SCORE: A Second-Order Conic Initialization for Range-Aided SLAM
Alan Papalia, Joseph Morales, Kevin J. Doherty, David M. Rosen, John J. Leonard
Simultaneous Localization Performance Score Pin Slam Convex Relaxation Conic Optimization Second Order Cone

September 22, 2022

Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions
Sitan Chen, Sinho Chewi, Jerry Li, Yuanzhi Li, Adil Salim, Anru R. Zhang
Diffusion Model Theoretical Understanding Probabilistic Model Langevin Dynamic Open Sampling Score Based Generative Performance Score Modeling Assumption

July 13, 2022

Semi-supervised Ranking for Object Image Blur Assessment
Qiang Li, Zhaoliang Yao, Jingjing Wang, Ye Tian, Pengju Yang, Di Xie, Shiliang Pu
Semi Supervised Learning Semi Supervised Face Image Performance Score Pairwise Similarity Pairwise Ranking

June 2, 2022

The match file format: Encoding Alignments between Scores and Performances
Francesco Foscarin, Emmanouil Karystinaios, Silvan David Peter, Carlos Cancino-Chacón, Maarten Grachten, Gerhard Widmer
Performance Score Music Transcription Better Alignment Musical Score File Classification

May 10, 2022

Turtle Score -- Similarity Based Developer Analyzer
Sanjjushri Varshini, Ponshriharini V, Santhosh Kannan, Snekha Suresh, Harshavardhan Ramesh, Rohith Mahadevan, Raja CSP Raman
Performance Score Learner Model Code Similarity Recruitment Domain

April 21, 2022

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md Sahidullah, Tomi Kinnunen, Nicholas Evans
Speaker Verification Hybrid Fusion Performance Score Spoofing Aware Speaker Verification Fusion Based Deep Learning Automatic Speaker Verification

March 15, 2022

Better Uncertainty Calibration via Proper Scores for Classification and Beyond
Sebastian G. Gruber, Florian Buettner
Classification Code Performance Score Model Calibration Uncertainty Calibration Calibration Error Model Trustworthiness

March 1, 2022

Improving Performance of Automated Essay Scoring by using back-translation essays and adjusted scores
You-Jin Jong, Yong-Jin Kim, Ok-Chol Ri
Long Short Term Memory System Performance Performance Score Back Translation Essay Scoring

January 31, 2022

Score vs. Winrate in Score-Based Games: which Reward for Reinforcement Learning?
Luca Pasqualini, Gianluca Amato, Marco Fantozzi, Rosa Gini, Alessandro Marchetti, Carlo Metta, Francesco Morandin, Maurizio Parton
Reinforcement Learning Reward Report Performance Score Zero Sum Game Score Based Optimal Strategy Deterministic Game

January 6, 2022

ConTrip: Consensus Sentiment review Analysis and Platform ratings in a single score
José Bonet, José Bonet
User Sentiment Performance Score Human Rating Consensus Value Consensus Score

December 14, 2021

SCORE: Approximating Curvature Information under Self-Concordant Regularization
Adeyemi D. Adeoye, Alberto Bemporad
Performance Score Non Convex Optimization Approximate Curvature Gauss Newton Quasi Newton Method Regularization Function Self Concordant

November 12, 2021

Fully Automatic Page Turning on Real Scores
Florian Henkel, Stephanie Schwaiger, Gerhard Widmer
Performance Score Multi Modal Deep Learning Position Estimation Automatic Page