Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Report
Data sources: ZENODO
addClaim

PhantomRed: An Autonomous AI-Powered Penetration Testing Platform with a Consent-First Ethical Framework

Authors: Machiraju, Aditya;

PhantomRed: An Autonomous AI-Powered Penetration Testing Platform with a Consent-First Ethical Framework

Abstract

Penetration testing remains a cornerstone of modern cybersecurity practice, yet its adoption is hindered by high cost, scarce expertise, and time-intensive manual workflows. We present PhantomRed, an autonomous penetration testing platform that combines a ReAct-based AI agent loop, a multi-tool reconnaissance and vulnerability scanning pipeline, and an AI-driven analysis layer to deliver end-to-end security assessments with minimal human effort. PhantomRed integrates industry-standard open-source tools—Nmap, Nuclei, FFUF, and SQLMap—with a locally hosted Llama 3 8B language model to reason over findings and dispatch targeted follow-up probes. A central design principle is a consent-first ethical framework: every scan requires explicit target pre-authorization via a scope.json manifest, a hard confirmation gate, and a blocklist preventing scans of critical infrastructure. Evaluation on the publicly authorized target scanme.nmap.org demonstrates that PhantomRed surfaces six distinct findings—including CVE-2023-48795 (CVSS 5.9)—in approximately four minutes, compared to an estimated 30–45 minutes for an experienced manual tester. PhantomRed is publicly accessible at phantomred.com under a free tier requiring no payment information.

Powered by OpenAIRE graph
Found an issue? Give us feedback