Twitter Dataset
Twitter datasets are collections of tweets used for various natural language processing (NLP) research tasks, primarily focused on understanding and mitigating misinformation, analyzing public sentiment, and detecting online manipulation. Current research utilizes transformer-based models like BERT and RoBERTa, along with other deep learning architectures, for tasks such as sentiment classification, hate speech detection, and the identification of manipulated or misleading information. These datasets and the resulting models have significant implications for improving the trustworthiness of online information, informing public health initiatives, and enhancing our understanding of social dynamics and political polarization.
Papers
An Exploratory Study of Tweets about the SARS-CoV-2 Omicron Variant: Insights from Sentiment Analysis, Language Interpretation, Source Tracking, Type Classification, and Embedded URL Detection
Nirmalya Thakur, Chia Y. Han
A Large-Scale Dataset of Twitter Chatter about Online Learning during the Current COVID-19 Omicron Wave
Nirmalya Thakur