Uzbek Language
Research on the Uzbek language, an agglutinative Turkic language, is rapidly advancing, focusing on developing robust natural language processing (NLP) tools. Current efforts concentrate on creating and improving resources like corpora, and building models for tasks such as morphological analysis (using finite state machines and rule-based approaches), part-of-speech tagging, lemmatization, and text classification (employing RNNs, CNNs, and transformer-based models like BERT). These advancements are crucial for bridging the resource gap in NLP for low-resource languages like Uzbek, enabling applications in education, information retrieval, and machine translation.
Papers
September 23, 2024
May 23, 2024
December 25, 2023
March 1, 2023
February 28, 2023
January 30, 2023
October 28, 2022
October 27, 2022
September 15, 2022
July 29, 2022
May 20, 2022
May 19, 2022