Abstract
In today’s time, phishy website detection is one of the important challenges in the field of information security due to the large numbers of online transactions going through over the websites. Website phishing means stealing one’s personal information over the Internet such as system backup data, user login credentials, bank account details or other security information. Phishing means creation of phishy or fake websites which look like legitimate ones. In this research paper, we use the associative classification data mining approach that is also named as rule-based classification technique by which we can detect a phishy website and thereby identifying the better detection algorithm which has a higher accuracy detection rate. The algorithms used are Naïve Bayes and PART algorithms of associative classification data mining approach. Moreover, we classify the websites into a legitimate website or a phishy website from the collected datasets of websites. The implementation will be done on the datasets of 1,353 websites which contain phishy sites as well as legitimate sites. At the end, results will show us the higher accuracy detection rate algorithm, which will more correctly identify phishing or legitimate websites.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abdelhamid N, Ayesh A, Thabtah F (2014) Phishing detection based associative classification data mining. Expert Syst Appl 41. Elsevier
Mahajan A, Ganpati A (2014) Performance evaluation of rule based classification algorithms. Int J Ad Res Comput Eng Technol (IJARCET)
Taalohi M, Langari N, Tabatabaee H (2015) Identifying phishing websites by techniques hyper heuristic and machine learning. ISSN Sci Int. Lahore
Datasets of phishing and legitimate websites from the sites as Phishtank, Millersmiles, and UCI machine learning repository site. https://archive.ics.uci.edu/ml/datasets.html
Naïve Bayes algorithm. https://en.wikipedia.org/wiki/Naive_Bayes_classifier
Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Yu SY, Zhou ZH, Steinbach M, Hand DJ, Steinberg D (2007) Top 10 Algorithm in data mining. Springer Verlag London Limited published
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Gautam, S., Rani, K., Joshi, B. (2018). Detecting Phishing Websites Using Rule-Based Classification Algorithm: A Comparison. In: Mishra, D., Nayak, M., Joshi, A. (eds) Information and Communication Technology for Sustainable Development. Lecture Notes in Networks and Systems, vol 9. Springer, Singapore. https://doi.org/10.1007/978-981-10-3932-4_3
Download citation
DOI: https://doi.org/10.1007/978-981-10-3932-4_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3931-7
Online ISBN: 978-981-10-3932-4
eBook Packages: EngineeringEngineering (R0)