Comparing the Random Forest vs. Extreme Gradient Boosting using Cuckoo Search Optimizer for Detecting Arabic Cyberbullying
DOI:
https://doi.org/10.24996/ijs.2023.64.9.40Keywords:
Cyberbullying, XGBoost, Random Forest, Machine Learning, Cuckoo SearchAbstract
Cyberbullying is one of the major electronic problems, and it is not a new phenomenon. It was present in the traditional form before the emergence of social networks, and cyberbullying has many consequences, including emotional and physiological states such as depression and anxiety. Given the prevalence of this phenomenon and the importance of the topic in society and its negative impact on all age groups, especially adolescents, this work aims to build a model that detects cyberbullying in the comments on social media (Twitter) written in the Arabic language using Extreme Gradient Boosting (XGBoost) and Random Forest methods in building the models. After a series of pre-processing, we found that the accuracy of classification of these comments was 0.861 in XGBoost, and 0.849 in Random Forest. Then the results of this model were improved by using one of the optimization algorithms called cuckoo search to adjust the parameters in two methods. The results are improved clearly in the random forest method, which obtained results similar to the extreme gradient boosting method, with a value of 0.867.