An Integrated Information Gain with A Black Hole Algorithm for Feature Selection: A Case Study of E-mail Spam Filtering

Authors

  • Amaal Mahmood Department of fundamentals of Religion\ Tarmiya, college Alemam_Alaadm University, Baghdad, Iraq https://orcid.org/0000-0002-2640-1844
  • Adnan Hadi Mahdi Al-Helali Cyber Security Department, Faculty of Science and Information Technology, Irbid National University, Irbid, Jordan

DOI:

https://doi.org/10.24996/ijs.2023.64.9.38

Keywords:

E-mail Spam Filtering, Black Hole Algorithm, Feature Selection, Naïve Bayesian Classifier

Abstract

     The current issues in spam email detection systems are directly related to spam email classification's low accuracy and feature selection's high dimensionality. However, in machine learning (ML), feature selection (FS) as a global optimization strategy reduces data redundancy and produces a collection of precise and acceptable outcomes. A black hole algorithm-based FS algorithm is suggested in this paper for reducing the dimensionality of features and improving the accuracy of spam email classification. Each star's features are represented in binary form, with the features being transformed to binary using a sigmoid function. The proposed Binary Black Hole Algorithm (BBH) searches the feature space for the best feature subsets, and feature selection is based on a fitness function that is proportional to the accuracy achieved using a Naive Bayesian Classifier (NBC). When measuring the performance of the BBH with the SpamBase dataset, the performance of the classifier and the dimension of the selected feature vector used as a classifier input are considered. The experiments revealed that the BBH can produce good FS results even with a small set of selected features. This shows that when utilizing the NBC-based BBH, good spam email categorization accuracy is possible.

Downloads

Published

2023-09-30

Issue

Section

Computer Science

How to Cite

An Integrated Information Gain with A Black Hole Algorithm for Feature Selection: A Case Study of E-mail Spam Filtering. (2023). Iraqi Journal of Science, 64(9), 4779-4790. https://doi.org/10.24996/ijs.2023.64.9.38

Similar Articles

1-10 of 581

You may also start an advanced similarity search for this article.