Text Distribution
Text distribution analysis focuses on understanding and manipulating the statistical properties of text data, aiming to improve various natural language processing tasks. Current research emphasizes developing methods to balance skewed distributions in training data for large language models (LLMs), detect AI-generated text by analyzing its divergence from human-written text, and describe differences between text distributions using natural language summaries. These advancements have significant implications for improving LLM performance, mitigating biases and ethical concerns in AI-generated content, and facilitating more nuanced analyses of textual data across diverse applications.
Papers
October 9, 2024
September 6, 2024
June 17, 2024
May 22, 2024
March 14, 2024
February 22, 2024
January 29, 2024
January 28, 2024
November 8, 2023
April 10, 2023
March 17, 2023
November 6, 2022
October 10, 2022
May 31, 2022
May 13, 2022
May 10, 2022