Text Reuse

Text reuse, encompassing the identification and analysis of duplicated or adapted text across various sources, is a growing area of research with applications in plagiarism detection, model training, and understanding knowledge dissemination. Current research focuses on developing robust methods for detecting text reuse, even across semantically altered versions, and on analyzing the implications of reuse biases in machine learning models, particularly in off-policy reinforcement learning and transfer learning for image processing. These efforts are crucial for improving the reliability of scientific findings, enhancing the fairness and efficiency of machine learning algorithms, and providing valuable insights into the dynamics of knowledge creation and dissemination across disciplines.

Papers