Paper ID: 2304.04576

Learnings from Data Integration for Augmented Language Models

Alon Halevy, Jane Dwivedi-Yu

One of the limitations of large language models is that they do not have access to up-to-date, proprietary or personal data. As a result, there are multiple efforts to extend language models with techniques for accessing external data. In that sense, LLMs share the vision of data integration systems whose goal is to provide seamless access to a large collection of heterogeneous data sources. While the details and the techniques of LLMs differ greatly from those of data integration, this paper shows that some of the lessons learned from research on data integration can elucidate the research path we are conducting today on language models.

Submitted: Apr 10, 2023

Topics

Large Language Model
Language Model
Data Integration
Augmented Language Model
Heterogeneous Data Source

Links

arXiv PDF