
The increasing use of retrieval-augmented and large language model–based systems in criminal justice research has raised concerns regarding transparency, auditability, and control over evidentiary boundaries in behavioral case analysis. Many existing applications emphasize prediction or generative synthesis, limiting their suitability for analytically sensitive and high-risk domains. This article presents Stage I of the PROTEX project as a methodological proof of concept demonstrating how a retrieval-augmented architecture can be designed as a controlled, deterministic fact-extraction framework for criminal justice case analysis. The system operates on a structured corpus of fifty historically documented violent offender cases and enforces a strict case-boundary mechanism to prevent cross-case data contamination. System performance was evaluated using structured test queries assessing extraction accuracy, refusal behavior in the absence of relevant data, and comparative analytical tasks. Results demonstrate reliable single-case extraction and consistent enforcement of evidentiary constraints, indicating that conservatively designed retrieval-augmented systems can support transparent, auditable, and accountable criminal justice analysis.
retrieval-augmented systems, behavioral case analysis, evidence-based policing, AI auditability, controlled AI systems, criminal justice
retrieval-augmented systems, behavioral case analysis, evidence-based policing, AI auditability, controlled AI systems, criminal justice
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
