Halligan

Introduction This repository provides the official artifact for the paper:"Are CAPTCHAs Still Bot-Hard? Generalized Visual CAPTCHA Solving with Agentic Vision Language Models" Our work explores the effectiveness of Vision-Language Model (VLM) agents in solving modern visual CAPTCHAs by leveraging reasoning, abstraction, and code synthesis capabilities. Contents benchmark.zip: An interactive offline benchmark suite designed to evaluate VLM agents on their ability to solve visual CAPTCHA challenges. halligan.zip: The implementation of Halligan, our proposed VLM agent introduced in the paper

Keywords

Web Security, VLM, CAPTCHA

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average