Korean Nationwide Daily Conversation Corpus
Korean Nationwide Daily Conversation Corpora are large datasets of everyday Korean conversations used to train and evaluate natural language processing (NLP) models. Current research focuses on leveraging these corpora to improve conversational AI, particularly in areas like personality-based dialogue generation, emotion and causality recognition, and named entity recognition, often employing deep learning architectures such as Bi-LSTMs and Graph Neural Networks. These efforts aim to enhance the accuracy and naturalness of Korean language models for applications such as chatbots and knowledge base construction, while also addressing biases in data representation by analyzing demographic participation.
Papers
April 1, 2024
March 16, 2024
February 27, 2024
May 10, 2023
September 1, 2022
June 29, 2022