Reddit Dataset

Reddit datasets, encompassing diverse user-generated content, serve as valuable resources for studying various aspects of online communication and human behavior. Current research focuses on leveraging these datasets to develop and evaluate machine learning models for tasks such as sentiment analysis, toxicity detection, depression identification, and credibility assessment, often employing transformer-based architectures like BERT and RoBERTa. These studies contribute to a deeper understanding of online social dynamics, mental health, and information credibility, with implications for improving online content moderation, mental health interventions, and combating misinformation.

Papers