Paper ID: 2402.15873

SemEval-2024 Task 8: Weighted Layer Averaging RoBERTa for Black-Box Machine-Generated Text Detection

Ayan Datta, Aryan Chandramania, Radhika Mamidi

This document contains the details of the authors' submission to the proceedings of SemEval 2024's Task 8: Multigenerator, Multidomain, and Multilingual Black-Box Machine-Generated Text Detection Subtask A (monolingual) and B. Detection of machine-generated text is becoming an increasingly important task, with the advent of large language models (LLMs). In this paper, we lay out how using weighted averages of RoBERTa layers lets us capture information about text that is relevant to machine-generated text detection.

Submitted: Feb 24, 2024

Topics

Large Language Model
Multi Layer
Black Box
Multilingual Model
SemEval 2022 Task
Machine Generated
Machine Generated Text

Links

arXiv PDF