Biocreative VII

BioCreative VII encompassed several challenges focused on advancing biomedical natural language processing (NLP), primarily addressing the efficient extraction of information from large text corpora like PubMed and social media. Research heavily utilized transformer-based models, such as BERT and its variants, often employing ensemble methods and data augmentation techniques to improve performance on tasks like multi-label classification of articles (e.g., assigning topics to COVID-19 research papers) and named entity recognition (e.g., identifying medications in tweets). These advancements significantly improve the speed and accuracy of literature curation and knowledge extraction, facilitating faster scientific discovery and more effective public health monitoring.

Papers

April 20, 2022

Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations
Qingyu Chen, Alexis Allot, Robert Leaman, Rezarta Islamaj Doğan, Jingcheng Du, Li Fang, Kai Wang, Shuo Xu, Yuefu Zhang, Parsa Bagherzadeh, Sabine Bergler, Aakash Bhatnagar, Nidhir Bhavsar, Yung-Chun Chang, Sheng-Jie Lin, Wentai Tang, Hongtong Zhang, Ilija Tavchioski, Senja Pollak, Shubo Tian, Jinfeng Zhang, Yulia Otmakhova, Antonio Jimeno Yepes, Hang Dong, Honghan Wu, Richard Dufour, Yanis Labrak, Niladri Chatterjee, Kushagri Tandon, Fréjus Laleye, Loïc Rakotoson, Emmanuele Chersoni, Jinghang Gu, Annemarie Friedrich, Subhash Chandra Pujari, Mariia Chizhikova, Naveen Sivadasan, Naveen Sivadasan, Zhiyong Lu
Multi Label Classification BioMedical Literature COVID 19 Literature Biocreative VII

April 19, 2022

LitMC-BERT: transformer-based multi-label classification of biomedical literature with an application on COVID-19 literature curation
Qingyu Chen, Jingcheng Du, Alexis Allot, Zhiyong Lu
Application Proficiency BERT Model Multi Label BioMedical Literature Literature Search COVID 19 Literature Biocreative VII

January 9, 2022

Zero-Shot and Few-Shot Classification of Biomedical Articles in Context of the COVID-19 Pandemic
Simon Lupart, Benoit Favre, Vassilina Nikoulina, Salah Ait-Mokhtar
Zero Shot Classification Code Context Information Covid 19 Pandemic MeSH Data Biomedical Article Biocreative VII BioBERT GRU Method

November 30, 2021

November 20, 2021

Improving Tagging Consistency and Entity Coverage for Chemical Identification in Full-text Articles
Hyunjae Kim, Mujeen Sung, Wonjin Yoon, Sungjoon Park, Jaewoo Kang
Data Set Entity Recognition Chemical Data Biocreative VII Entity Coverage

November 12, 2021

Extraction of Medication Names from Twitter Using Augmentation and an Ensemble of Language Models
Igor Kulev, Berkay Köprü, Raul Rodriguez-Esteban, Diego Saldana, Yi Huang, Alessandro La Torraca, Elif Ozkirimli
Language Model Twitter Resource Soft Augmentation Process Extraction Drug Classification Biocreative VII

Biocreative VII

Papers

Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations

LitMC-BERT: transformer-based multi-label classification of biomedical literature with an application on COVID-19 literature curation

Zero-Shot and Few-Shot Classification of Biomedical Articles in Context of the COVID-19 Pandemic

Automatic Extraction of Medication Names in Tweets as Named Entity Recognition

Chemical Identification and Indexing in PubMed Articles via BERT and Text-to-Text Approaches

Improving Tagging Consistency and Entity Coverage for Chemical Identification in Full-text Articles

Extraction of Medication Names from Twitter Using Augmentation and an Ensemble of Language Models