Poster: Different Victims, Same Layout: Email Visual Similarity Detection for Enhanced Email Protection

Name: Poster: Different Victims, Same Layout: Email Visual Similarity Detection for Enhanced Email Protection
Keywords: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Cryptography and Security, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Cryptography and Security (cs.CR), Machine Learning (cs.LG)

Sachin Shukla; Omid Mirzaei

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2024

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1145/365864...

Article . 2024 . Peer-reviewed

License: https://www.acm.org/publications/policies/copyright_policy#Background

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2024

License: CC BY NC SA

Data sources: Datacite

Poster: Different Victims, Same Layout: Email Visual Similarity Detection for Enhanced Email Protection

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 02 Dec 2024Embargo end date: 01 Jan 2024Publisher:ACMJournal:Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security

Authors: Sachin Shukla; Omid Mirzaei;

doi: 10.1145/3658644.3691381 , 10.48550/arxiv.2408.16945

arXiv: 2408.16945

Poster: Different Victims, Same Layout: Email Visual Similarity Detection for Enhanced Email Protection

- Summary
- Subjects
- Metrics

Abstract

In the pursuit of an effective spam detection system, the focus has often been on identifying known spam patterns either through rule-based detection systems or machine learning (ML) solutions that rely on keywords. However, both systems are susceptible to evasion techniques and zero-day attacks that can be achieved at low cost. Therefore, an email that bypassed the defense system once can do it again in the following days, even though rules are updated or the ML models are retrained. The recurrence of failures to detect emails that exhibit layout similarities to previously undetected spam is concerning for customers and can erode their trust in a company. Our observations show that threat actors reuse email kits extensively and can bypass detection with little effort, for example, by making changes to the content of emails. In this work, we propose an email visual similarity detection approach, named Pisco, to improve the detection capabilities of an email threat defense system. We apply our proof of concept to some real-world samples received from different sources. Our results show that email kits are being reused extensively and visually similar emails are sent to our customers at various time intervals. Therefore, this method could be very helpful in situations where detection engines that rely on textual features and keywords are bypassed, an occurrence our observations show happens frequently.

To be published in the proceedings of the ACM Conference on Computer and Communications Security (ACM CCS 2024)

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Cryptography and Security, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Cryptography and Security (cs.CR), Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green