NLP Task

Natural Language Processing (NLP) research currently focuses on enhancing Large Language Models (LLMs) for a wider range of tasks, including improved long-context processing, reliable benchmark creation using synthetic data, and seamless integration of generation and retrieval capabilities. Active research areas involve developing efficient frameworks for handling extensive input sequences within memory constraints, evaluating the effectiveness of LLMs across diverse and challenging benchmarks (including those for specialized domains like finance and law), and mitigating issues like data contamination and hallucination. These advancements are crucial for improving the reliability and applicability of LLMs in various real-world applications, from legal tech to healthcare and beyond.

Papers

April 8, 2024

Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding
Ahmad Idrissi-Yaghir, Amin Dada, Henning Schäfer, Kamyar Arzideh, Giulia Baldini, Jan Trienes, Max Hasin, Jeanette Bewersdorff, Cynthia S. Schmidt, Marie Bauer, Kaleb E. Smith, Jiang Bian, Yonghui Wu, Jörg Schlötterer, Torsten Zesch, Peter A. Horn, Christin Seifert, Felix Nensa, Jens Kleesiek, Christoph M. Friedrich
Natural Language Processing Pre Trained Language Model NLP Task Comprehensive Study Clinical Language Pre Trained Neural Machine Translation German Language Model

April 5, 2024

BuDDIE: A Business Document Dataset for Multi-task Information Extraction
Ran Zmigrod, Dongsheng Wang, Mathieu Sibue, Yulong Pei, Petr Babkin, Ivan Brugere, Xiaomo Liu, Nacho Navarro, Antony Papadimitriou, William Watson, Zhiqiang Ma, Armineh Nourbakhsh, Sameena Shah
NLP Task Document Understanding Entity Extraction Information Extraction Task

April 4, 2024

March 29, 2024

Does Faithfulness Conflict with Plausibility? An Empirical Study in Explainable AI across NLP Tasks
Xiaolei Lu, Jianghong Ma
Explainable AI High Explainability Empirical Study NLP Task Explainability Method Explanation Plausibility Faithfulness Test Explanation Algorithm

March 27, 2024

mALBERT: Is a Compact Multilingual BERT Model Still Worth It?
Christophe Servan, Sahar Ghannay, Sophie Rosset
Language Model Language Understanding Multilingual Model NLP Task Multilingual BERT Embracing CompAct

March 20, 2024

March 15, 2024

ExeGPT: Constraint-Aware Resource Scheduling for LLM Inference
Hyungjun Oh, Kihong Kim, Jaemin Kim, Sungkyun Kim, Junyeol Lee, Du-seong Chang, Jiwon Seo
NLP Task LLM Inference Resource Scheduling

March 14, 2024

A Continued Pretrained LLM Approach for Automatic Medical Note Generation
Dong Yuan, Eti Rastogi, Gautam Naik, Sree Prasanna Rajagopal, Sagar Goyal, Fen Zhao, Bharath Chintagunta, Jeff Ward
Medical LLM Pre Trained NLP Task Medical Dialogue Note Generation

March 12, 2024

March 11, 2024

Evolving Knowledge Distillation with Large Language Models and Active Learning
Chengyuan Liu, Yangyang Kang, Fubang Zhao, Kun Kuang, Zhuoren Jiang, Changlong Sun, Fei Wu
Large Language Model Active Learning Text Generation NLP Task

March 10, 2024

From Instructions to Constraints: Language Model Alignment with Automatic Constraint Verification
Fei Wang, Chao Shang, Sarthak Jain, Shuai Wang, Qiang Ning, Bonan Min, Vittorio Castelli, Yassine Benajiba, Dan Roth
NLP Task Human Instruction Participation Constraint Constraint Programming Language Model Alignment Constraint Satisfaction General Purpose Language Model

February 29, 2024

Pointing out the Shortcomings of Relation Extraction Models with Semantically Motivated Adversarials
Gennaro Nolano, Moritz Blum, Basil Ell, Philipp Cimiano
Large Language Model Adversarial Example NLP Task Relation Extraction Semantic Adversarial Relation Extraction Model Semantic Approach

February 28, 2024

On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction
Jianwei Wang, Tianyin Wang, Ziqian Zeng
NLP Task Greater Public Use Information Extraction Zero Shot Classification Zero Shot Cross Lingual Zero Shot Classification Task Zero Shot Event Silver Standard

February 21, 2024

$Se^2$: Sequential Example Selection for In-Context Learning
Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang
Large Language Model Context Learning NLP Task Context Example Sequential Model Random Sequence

February 20, 2024

February 10, 2024

Should I try multiple optimizers when fine-tuning pre-trained Transformers for NLP tasks? Should I tune their hyperparameters?
Nefeli Gkouti, Prodromos Malakasiotis, Stavros Toumpis, Ion Androutsopoulos
Stochastic Gradient Descent NLP Task Neural Network Architecture Pre Trained Transformer Related Hyperparameters Better Optimizers Adaptive Optimizers

NLP Task

Papers

Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding

BuDDIE: A Business Document Dataset for Multi-task Information Extraction

Embedding-Informed Adaptive Retrieval-Augmented Generation of Large Language Models

The Death of Feature Engineering? BERT with Linguistic Features on SQuAD 2.0

Does Faithfulness Conflict with Plausibility? An Empirical Study in Explainable AI across NLP Tasks

mALBERT: Is a Compact Multilingual BERT Model Still Worth It?

Defending Against Indirect Prompt Injection Attacks With Spotlighting

Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models

ExeGPT: Constraint-Aware Resource Scheduling for LLM Inference

A Continued Pretrained LLM Approach for Automatic Medical Note Generation

Beyond Memorization: The Challenge of Random Memory Access in Language Models

SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

Evolving Knowledge Distillation with Large Language Models and Active Learning

From Instructions to Constraints: Language Model Alignment with Automatic Constraint Verification

Pointing out the Shortcomings of Relation Extraction Models with Semantically Motivated Adversarials

On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction

$Se^2$: Sequential Example Selection for In-Context Learning

SiLLM: Large Language Models for Simultaneous Machine Translation

Comparing Specialised Small and General Large Language Models on Text Classification: 100 Labelled Samples to Achieve Break-Even Performance

Should I try multiple optimizers when fine-tuning pre-trained Transformers for NLP tasks? Should I tune their hyperparameters?