Fresh-Phish: A Framework for Auto-Detection of Phishing Websites

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Aug 2017Publisher:IEEEJournal:2017 IEEE International Conference on Information Reuse and Integration (IRI)

Authors: Hossein Shirazi; Kyle Haefner; Indrakshi Ray;

doi: 10.1109/iri.2017.40

Fresh-Phish: A Framework for Auto-Detection of Phishing Websites

- Summary
- Metrics

Abstract

Denizens of the Internet are coming under a barrage of phishing attacks of increasing frequency and sophistication. Emails accompanied by authentic looking websites are ensnaring users who, unwittingly, hand over their credentials compromising both their privacy and security. Methods such as the blacklisting of these phishing websites become untenable and cannot keep pace with the explosion of fake sites. Detection of nefarious websites must become automated and be able to adapt to this ever evolving form of social engineering. We develop a framework, called ""Fresh-Phish"", for creating current machine learning data for phishing websites. Using 30 different website features that we query using python, we build a large labeled dataset and analyze several machine learning classifiers against this dataset to determine which is the most accurate. We analyze not just the accuracy of the technique, but also how long it takes to train the model.

Related Organizations

Colorado State University
United States
Malek Ashtar University of Technology
Iran (Islamic Republic of)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	19
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%