Generative Comprehension

Generative comprehension focuses on evaluating and improving the ability of large language models (LLMs), including multimodal models incorporating audio and visual data, to understand and generate responses based on complex inputs. Current research emphasizes benchmarking these models using diverse datasets and tasks, such as question answering, captioning, and instruction following, often comparing generative and extractive approaches. This work is crucial for advancing the capabilities of LLMs across various domains, leading to more robust and reliable AI systems for applications ranging from legal analysis to human-computer interaction.

Papers

February 12, 2024

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension
Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou
Human Speech Audio Language Model Generative Comprehension

July 30, 2023

SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension
Bohao Li, Rui Wang, Guangzhi Wang, Yuying Ge, Yixiao Ge, Ying Shan
Large Language Model Generative Model Multimodal Large Language Model Human Annotation Generative Comprehension

July 3, 2023

JourneyDB: A Benchmark for Generative Image Understanding
Keqiang Sun, Junting Pan, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Limin Wang, Hongsheng Li
Vision Language Model New Benchmark Image Understanding Multimodal Comprehension Generative Image Generative Comprehension

June 20, 2023

Hallucination is the last thing you need
Shawn Curran, Sam Lansley, Oliver Bethell
Generative AI Content Hallucination Deep Understanding Legal Article Monolithic Deep Last Straw Generative Comprehension

March 14, 2022

Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering
Man Luo, Kazuma Hashimoto, Semih Yavuz, Zhiwei Liu, Chitta Baral, Yingbo Zhou
Question Answering Generative Question Question Answering Model Systematic Study Generative Comprehension

Generative Comprehension

Papers

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension

JourneyDB: A Benchmark for Generative Image Understanding

Hallucination is the last thing you need

Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering