Multi-Agent and Retrieval-Augmented Techniques for Data Quality: A Systematic Literature Review

Session

Information Systems

Description

In the era of ever-growing amounts of data and enterprise analytics, managing data quality by ensuring accuracy, completeness, and consistency has become a critical prerequisite for reliable insights and decision-making. Traditional techniques such as rule-based validation, anomaly detection, and data profiling remain essential foundations for identifying and addressing common data issues in analytics warehouses. However, the increasing volume, variety, and velocity of modern data environments demand more adaptive and intelligent solutions. Recent advances in artificial intelligence, including Large Language Models, Retrieval-Augmented Generation, and multi-agent systems, introduce new opportunities for automating anomaly detection, contextual validation, and data repair. These approaches combine external knowledge retrieval, and coordinated agent-based workflows to improve the detection of data anomalies and enhance data completeness and consistency. This systematic literature review focuses on synthesizing and comparing different practices and methods recommended by researchers and practitioners, with the purpose of preparing a comprehensive view of the recent modern AI-driven approaches suggested for data quality in analytics warehouses.

Keywords:

Data Quality, Analytics Warehouses, Large Language Models, RetrievalAugmented Generation (RAG), Multi-Agent Systems

Proceedings Editor

Edmond Hajrizi

ISBN

978-9951-982-41-2

Location

UBT Lipjan, Kosovo

Start Date

25-10-2025 9:00 AM

End Date

26-10-2025 6:00 PM

DOI

10.33107/ubt-ic.2025.220

This document is currently not available here.

Share

COinS
 
Oct 25th, 9:00 AM Oct 26th, 6:00 PM

Multi-Agent and Retrieval-Augmented Techniques for Data Quality: A Systematic Literature Review

UBT Lipjan, Kosovo

In the era of ever-growing amounts of data and enterprise analytics, managing data quality by ensuring accuracy, completeness, and consistency has become a critical prerequisite for reliable insights and decision-making. Traditional techniques such as rule-based validation, anomaly detection, and data profiling remain essential foundations for identifying and addressing common data issues in analytics warehouses. However, the increasing volume, variety, and velocity of modern data environments demand more adaptive and intelligent solutions. Recent advances in artificial intelligence, including Large Language Models, Retrieval-Augmented Generation, and multi-agent systems, introduce new opportunities for automating anomaly detection, contextual validation, and data repair. These approaches combine external knowledge retrieval, and coordinated agent-based workflows to improve the detection of data anomalies and enhance data completeness and consistency. This systematic literature review focuses on synthesizing and comparing different practices and methods recommended by researchers and practitioners, with the purpose of preparing a comprehensive view of the recent modern AI-driven approaches suggested for data quality in analytics warehouses.