Phishing crimes are security threats involving fraudulent web
pages that masquerade as trustworthy ones for stealing users’
sensitive information, e.g., passwords, personal identification
numbers, and credit card numbers. Criminals usually create
phishing web pages by exactly copying the legitimate ones or
slightly modifying their page content for obtaining users’ valuable
information. In the past, content-based lexical features have been
extracted to detect phishing web sites [3]. A hybrid approach has
been proposed to detect phishing web pages by identity discovery
and keywords retrieval.