Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Preprint . 2026
License: CC BY NC ND
Data sources: ZENODO
addClaim

RAG Shield: A Multi-Layer Defense System Against Poisoning Attacks in Retrieval-Augmented Generation

Authors: Petti, Fabio;

RAG Shield: A Multi-Layer Defense System Against Poisoning Attacks in Retrieval-Augmented Generation

Abstract

This whitepaper presents RAG Shield, a security-focused framework fordefending Retrieval-Augmented Generation (RAG) pipelines againstpoisoning and adversarial manipulation at the retrieval layer. The work introduces a multi-layer defense architecture combiningcryptographic document provenance validation, semantic anomaly detection,and secure, authority-weighted retrieval control. A realistic threatmodel is defined, focusing on poisoning of retrieval corpora rather thanprompt or model-level attacks. The system is evaluated against multipleattack scenarios under controlled conditions. RAG Shield is designed as a framework-agnostic security control layerthat operates independently of the underlying language model and vectordatabase, enabling deployment in enterprise and regulated environmentswithout modification of existing RAG architectures. This document is released as a technical preprint to establish prior artand support open discussion in the areas of AI security, adversarialmachine learning, and secure enterprise RAG deployment. Project website and system overview:https://sentinelrag.com Contact:info@sentinelrag.com

Keywords

poisoning attacks, retrieval-augmented generation, AI security, RAG security, adversarial machine learning

Powered by OpenAIRE graph
Found an issue? Give us feedback