
This report synthesises findings from 8 peer-reviewed papers addressing the following research question: How robust are RAG systems to adversarial or out-of-distribution queries in multimodal settings (e.g., image+text benchmarks like VCR) when evaluated using metrics like BLEU or CHRF. 6 claims were extracted from source literature; 6 were independently verified against retrieved documents. An automated multi-reviewer quality assessment produced a score of 9.3/10. This report is a machine-generated literature synthesis and does not constitute original research.Research goal: How robust are RAG systems to adversarial or out-of-distribution queries in multimodal settings (e.g., image+text benchmarks like VCR) when evaluated using metrics like BLEU or CHRF?Autonomous literature synthesis. Automated review score: 9.3/10. Full text and citation available at Assignee Research.
