Direct Assessment

Direct assessment encompasses a broad range of techniques for evaluating diverse systems and phenomena, from the psychological traits of language models to the precision of 3D models and the performance of autonomous vehicles. Current research focuses on developing robust and reliable assessment methods, often employing machine learning models like VQ-VAEs, various neural networks (including vision transformers and graph neural networks), and large language models (LLMs) for automated analysis and evaluation. These advancements are crucial for improving the trustworthiness and reliability of AI systems, enhancing diagnostic capabilities in healthcare, and optimizing performance in various engineering and scientific domains.

Papers

November 11, 2023

ALBA: Adaptive Language-based Assessments for Mental Health
Vasudha Varadarajan, Sverker Sikström, Oscar N. E. Kjell, H. Andrew Schwartz
Adaptive Importance Mental Health Direct Assessment Item Response Theory Psychometric Property Language Assessment

November 8, 2023

GCS-ICHNet: Assessment of Intracerebral Hemorrhage Prognosis using Self-Attention with Domain Knowledge Integration
Xuhao Shan, Xinyang Li, Ruiquan Ge, Shibin Wu, Ahmed Elazab, Jichao Zhu, Lingyan Zhang, Gangyong Jia, Qingying Xiao, Xiang Wan, Changmiao Wang
Self Attention Domain Knowledge Direct Assessment 3D Brain Intracerebral Hemorrhage Intracranial Hemorrhage

November 3, 2023

Plot Retrieval as an Assessment of Abstract Semantic Association
Shicheng Xu, Liang Pang, Jiangnan Li, Mo Yu, Fandong Meng, Huawei Shen, Xueqi Cheng, Jie Zhou
Information Retrieval Direct Assessment Retrieval Model Semantic Association Lexicon Based Retrieval

October 30, 2023

Facial asymmetry: A Computer Vision based behaviometric index for assessment during a face-to-face interview
Shuvam Keshari, Tanusree Dutta, Raju Mullick, Ashish Rathor, Priyadarshi Patnaik
Computer Vision Direct Assessment Psychometric Property Behavioral Measure Face to Face Behavioral Alignment Behavioural Analysis

October 17, 2023

Exploration of the Assessment for AVP Algorithm Training in Underground Parking Garages Simulation Scenario
Wenjin Li
Environment Exploration Direct Assessment Simulation Environment Valet Parking

October 4, 2023

Assessment of Prediction Intervals Using Uncertainty Characteristics Curves
Jiri Navratil, Benjamin Elder, Matthew Arnold, Soumya Ghosh, Prasanna Sattigeri
Direct Assessment Model Uncertainty Prediction Interval Uncertainty Expression

October 1, 2023

Segmentation-based Assessment of Tumor-Vessel Involvement for Surgical Resectability Prediction of Pancreatic Ductal Adenocarcinoma
Christiaan Viviers, Mark Ramaekers, Amaan Valiuddin, Terese Hellström, Nick Tasios, John van der Ven, Igor Jacobs, Lotte Ewals, Joost Nederend, Peter de With, Misha Luyer, Fons van der Sommen
Direct Assessment Pancreatic Cancer Tumor Associated Vasculature

September 29, 2023

Assessment and treatment of visuospatial neglect using active learning with Gaussian processes regression
Ivan De Boi, Elissa Embrechts, Quirine Schatteman, Rudi Penne, Steven Truijen, Wim Saeys
Active Learning Direct Assessment Gaussian Process Regression Cognitive Impairment Accurate Treatment Visual Stimulus Based Assessment

September 27, 2023

Assessment of Local Climate Zone Products via Simplified Classification Rule with 3D Building Maps
Hunsoo Song, Gaia Cervini, Jinha Jung
Direct Assessment Classification Rule Building Footprint Set Valued Building Mapping

September 25, 2023

Assessment of a new GeoAI foundation model for flood inundation mapping
Wenwen Li, Hyunho Lee, Sizhe Wang, Chia-Yu Hsu, Samantha T. Arundel
Direct Assessment Vision Foundation Model Geospatial Artificial Intelligence GeoAI System Flood Inundation Mapping Geospatial Foundation Model GeoAI Research

September 20, 2023

Assessment of Pre-Trained Models Across Languages and Grammars
Alberto Muñoz-Ortiz, David Vilares, Carlos Gómez-Rodríguez
Pre Trained Model Direct Assessment Unknown Language Multilingual Large Language Model Sequence Labeling Dependency Parsing Dependency Structure Grammar Description

September 14, 2023

An Assessment of ChatGPT on Log Data
Priyanka Mudgal, Rita Wouhaybi
Large Language Model ChatGPT Generated Conversation Text Generation Large Scale Direct Assessment Log Message Log Pattern

August 31, 2023

Vision-Based Cranberry Crop Ripening Assessment
Faith Johnson, Jack Lowry, Kristin Dana, Peter Oudemans
Computer Vision Direct Assessment Crop Breeding Scene Albedo

August 28, 2023

RESTORE: Graph Embedding Assessment Through Reconstruction
Hong Yung Yip, Chidaksh Ravuru, Neelabha Banerjee, Shashwat Jha, Amit Sheth, Aman Chadha, Amitava Das
Graph Drawing Full State Reconstruction Direct Assessment Graph Embeddings Graph Reconstruction Word2Vec Model

August 25, 2023

A Bayesian Active Learning Approach to Comparative Judgement
Andy Gray, Alma Rahat, Tom Crick, Stephen Lindsay
Active Learning Direct Assessment Ranking Model Pairwise Comparison Comparative Evaluation Bayesian Active Learning

August 24, 2023

GPTEval: A Survey on Assessments of ChatGPT and GPT-4
Rui Mao, Guanyi Chen, Xulang Zhang, Frank Guerin, Erik Cambria
Large Language Model Timely Survey ChatGPT Generated Conversation GPT 4 Direct Assessment Reasoning Ability Evaluation Method

August 21, 2023

Using language models in the implicit automated assessment of mathematical short answer items
Christopher Ormerod
Language Model Direct Assessment Level Mathematics Rubric Based Math Question Student Response Attribute Value Identification

August 15, 2023

Large Language Models in Introductory Programming Education: ChatGPT's Performance and Implications for Assessments
Natalie Kiesler, Daniel Schiffner
ChatGPT Generated Conversation Direct Assessment Future Implication Task Specification Programming Education Introductory Programming Novice Programmer

July 3, 2023

Assessment of the Utilization of Quadruped Robots in Pharmaceutical Research and Development Laboratories
Brian Parkinson, Ádám Wolf, Péter Galambos, Károly Széll
Non Humanoid Robot Direct Assessment Mobile Manipulator Quadruped Robot Resource Utilization Automation System Robot Assistance Quality Inspection

June 15, 2023

Thrilled by Your Progress! Large Language Models (GPT-4) No Longer Struggle to Pass Assessments in Higher Education Programming Courses
Jaromir Savelka, Arav Agarwal, Marshall An, Chris Bogart, Majd Sakr
GPT 4 Direct Assessment Much Progress Programming Education

Direct Assessment

Papers

ALBA: Adaptive Language-based Assessments for Mental Health

GCS-ICHNet: Assessment of Intracerebral Hemorrhage Prognosis using Self-Attention with Domain Knowledge Integration

Plot Retrieval as an Assessment of Abstract Semantic Association

Facial asymmetry: A Computer Vision based behaviometric index for assessment during a face-to-face interview

Exploration of the Assessment for AVP Algorithm Training in Underground Parking Garages Simulation Scenario

Assessment of Prediction Intervals Using Uncertainty Characteristics Curves

Segmentation-based Assessment of Tumor-Vessel Involvement for Surgical Resectability Prediction of Pancreatic Ductal Adenocarcinoma

Assessment and treatment of visuospatial neglect using active learning with Gaussian processes regression

Assessment of Local Climate Zone Products via Simplified Classification Rule with 3D Building Maps

Assessment of a new GeoAI foundation model for flood inundation mapping

Assessment of Pre-Trained Models Across Languages and Grammars

An Assessment of ChatGPT on Log Data

Vision-Based Cranberry Crop Ripening Assessment

RESTORE: Graph Embedding Assessment Through Reconstruction

A Bayesian Active Learning Approach to Comparative Judgement

GPTEval: A Survey on Assessments of ChatGPT and GPT-4

Using language models in the implicit automated assessment of mathematical short answer items

Large Language Models in Introductory Programming Education: ChatGPT's Performance and Implications for Assessments

Assessment of the Utilization of Quadruped Robots in Pharmaceutical Research and Development Laboratories

Thrilled by Your Progress! Large Language Models (GPT-4) No Longer Struggle to Pass Assessments in Higher Education Programming Courses