DSpace Repository

Intelligent Detection of False Information in Arabic Tweets Utilizing Hybrid Harris Hawks Based Feature Selection and Machine Learning Models

Show simple item record

dc.contributor.advisor Saheb, Mahmoud
dc.contributor.author Taher, Thaer
dc.contributor.author Turabieh, Hamza
dc.contributor.author Chanter, Hamouda
dc.date.accessioned 2021-06-13T08:55:05Z
dc.date.accessioned 2022-05-22T08:54:46Z
dc.date.available 2021-06-13T08:55:05Z
dc.date.available 2022-05-22T08:54:46Z
dc.date.issued 2021-03-27
dc.identifier.citation Thaher, T.; Saheb, M.; Turabieh, H.; Chantar, H. Intelligent Detection of False Information in Arabic Tweets Utilizing Hybrid Harris Hawks Based Feature Selection and Machine Learning Models. Symmetry 2021, 13, 556. https://doi.org/10.3390/sym13040556 en_US
dc.identifier.issn 2073-8994
dc.identifier.uri http://localhost:8080/xmlui/handle/123456789/8320
dc.description.abstract Fake or false information on social media platforms is a significant challenge that leads to deliberately misleading users due to the inclusion of rumors, propaganda, or deceptive information about a person, organization, or service. Twitter is one of the most widely used social media platforms, especially in the Arab region, where the number of users is steadily increasing, accompanied by an increase in the rate of fake news. This drew the attention of researchers to provide a safe online environment free of misleading information. This paper aims to propose a smart classification model for the early detection of fake news in Arabic tweets utilizing Natural Language Processing (NLP) techniques, Machine Learning (ML) models, and Harris Hawks Optimizer (HHO) as a wrapper-based feature selection approach. Arabic Twitter corpus composed of 1862 previously annotated tweets was utilized by this research to assess the efficiency of the proposed model. The Bag of Words (BoW) model is utilized using different term-weighting schemes for feature extraction. Eight well-known learning algorithms are investigated with varying combinations of features, including user-profile, content-based, and words-features. Reported results showed that the Logistic Regression (LR) with Term Frequency-Inverse Document Frequency (TF-IDF) model scores the best rank. Moreover, feature selection based on the binary HHO algorithm plays a vital role in reducing dimensionality, thereby enhancing the learning model’s performance for fake news detection. Interestingly, the proposed BHHO-LR model can yield a better enhancement of 5% compared with previous works on the same dataset. en_US
dc.description.sponsorship Taif University Researchers Supporting Project number (TURSP-2020/125), Taif University, Taif, Saudi Arabia. en_US
dc.language.iso en en_US
dc.publisher MDPI en_US
dc.relation.ispartofseries Symmetry in Artificial Visual Perception and Its Application);13, 4
dc.subject false information en_US
dc.subject natural language processing en_US
dc.subject machine learning en_US
dc.subject feacher selection en_US
dc.subject meta-heuristics en_US
dc.subject twitter en_US
dc.title Intelligent Detection of False Information in Arabic Tweets Utilizing Hybrid Harris Hawks Based Feature Selection and Machine Learning Models en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account