Entity Resolution

Entity resolution (ER) aims to identify records representing the same real-world entity across different datasets, a crucial task for data cleaning and integration. Current research emphasizes improving accuracy and efficiency through advanced techniques like knowledge-augmented language models, graph neural networks, and novel blocking strategies that leverage pre-trained embeddings or contrastive learning. These advancements are driving improvements in various applications, including e-commerce, healthcare, and social media analysis, by enabling more accurate and comprehensive data analysis. Furthermore, research is actively addressing challenges in evaluation methodologies and developing more robust and explainable ER systems.

Papers