Medical Corpus
Medical corpora are collections of textual medical data used to train and evaluate natural language processing (NLP) models for various healthcare applications. Current research focuses on developing larger, multilingual corpora, improving data cleaning techniques (e.g., using ensemble methods), and employing retrieval-augmented generation (RAG) and transformer-based models like BERT and LLMs (including fine-tuning on medical data) to enhance accuracy and address challenges like hallucinations and outdated information. These advancements are crucial for improving medical information retrieval, question answering, clinical text simplification, and other tasks, ultimately leading to more efficient and effective healthcare practices.
Papers
November 17, 2024
October 26, 2024
June 2, 2024
February 21, 2024
February 20, 2024
November 28, 2023
November 27, 2023
November 6, 2023
September 5, 2023
June 5, 2023
June 2, 2023
February 11, 2023
December 16, 2022
October 5, 2022
September 13, 2022
July 27, 2022
July 17, 2022
May 6, 2022