Detecting visually similar web pages
Application to phishing detection
Article Ecrit par: Chen, Teh-Chung ; Dick, Scott ; Miller, James ;
Résumé: We propose a novel approach for detecting visual similarity between two Web pages. The proposed approach applies Gestalt theory and considers aWeb page as a single indivisible entity. The concept of supersignals, as a realization of Gestalt principles, supports our contention thatWeb pages must be treated as indivisible entities. We objectify, and directly compare, these indivisible supersignals using algorithmic complexity theory. We illustrate our approach by applying it to the problem of detecting phishing scams. Via a large-scale, real-world case study, we demonstrate that 1) our approach effectively detects similar Web pages; and 2) it accuractely distinguishes legitimate and phishing pages.
Langue:
Anglais