Fake news detection in Urdu language using machine learning.

PeerJ Comput Sci

Information and Communication Engineering, Yeungnam University, Gyeongsan si, South Korea.

Published: May 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

With the rise of social media, the dissemination of forged content and news has been on the rise. Consequently, fake news detection has emerged as an important research problem. Several approaches have been presented to discriminate fake news from real news, however, such approaches lack robustness for multi-domain datasets, especially within the context of Urdu news. In addition, some studies use machine-translated datasets using English to Urdu Google translator and manual verification is not carried out. This limits the wide use of such approaches for real-world applications. This study investigates these issues and proposes fake news classier for Urdu news. The dataset has been collected covering nine different domains and constitutes 4097 news. Experiments are performed using the term frequency-inverse document frequency (TF-IDF) and a bag of words (BoW) with the combination of n-grams. The major contribution of this study is the use of feature stacking, where feature vectors of preprocessed text and verbs extracted from the preprocessed text are combined. Support vector machine, k-nearest neighbor, and ensemble models like random forest (RF) and extra tree (ET) were used for bagging while stacking was applied with ET and RF as base learners with logistic regression as the meta learner. To check the robustness of models, fivefold and independent set testing were employed. Experimental results indicate that stacking achieves 93.39%, 88.96%, 96.33%, 86.2%, and 93.17% scores for accuracy, specificity, sensitivity, MCC, ROC, and F1 score, respectively.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280395PMC
http://dx.doi.org/10.7717/peerj-cs.1353DOI Listing

Publication Analysis

Top Keywords

fake news
16
news detection
8
news
8
urdu news
8
preprocessed text
8
fake
4
urdu
4
detection urdu
4
urdu language
4
language machine
4

Similar Publications

Trump Administration's 'Weaponization' of Covid-19.

J Healthc Sci Humanit

January 2024

Professor of Political Science, Florida Memorial University, View Article and Find Full Text PDF

Future of Health Promotion in the Asia-Pacific Region in an Era of Global Disruption: An Interview With Dr. Sandro Demaio.

Health Promot J Austr

October 2025

Health and Social Change Australia, Queensland, Australia, Brisbane, Queensland, Australia.

This interview with Dr. Sandro Demaio, Director of the WHO Asia-Pacific Centre for Environment and Health, explores the future of health promotion in the region amid global disruption. Reflecting on his experiences at the intersection of climate, environment and health, Dr.

View Article and Find Full Text PDF

The widespread dissemination of fake news presents a critical challenge to the integrity of digital information and erodes public trust. This urgent problem necessitates the development of sophisticated and reliable automated detection mechanisms. This study addresses this gap by proposing a robust fake news detection framework centred on a transformer-based architecture.

View Article and Find Full Text PDF

Deepfakes pose critical threats to digital media integrity and societal trust. This paper presents a hybrid deepfake detection framework combining Convolutional Neural Networks (CNNs) and Generative Adversarial Networks (GANs) to address challenges in scalability, generalizability, and adversarial robustness. The framework integrates adversarial training, a temporal decay analysis model, and multimodal detection across audio, video, and text domains.

View Article and Find Full Text PDF

Artificial General Intelligence and Its Threat to Public Health.

J Eval Clin Pract

September 2025

Academic Unit of Population and Lifespan Sciences, School of Medicine, Nottingham City Hospital Campus, University of Nottingham, Clinical Sciences Building, Nottingham, UK.

Background: Artificial intelligence (AI) is increasingly applied across healthcare and public health, with evidence of benefits including enhanced diagnostics, predictive modelling, operational efficiency, medical education, and disease surveillance.However, potential harms - such as algorithmic bias, unsafe recommendations, misinformation, privacy risks, and sycophantic reinforcement - pose challenges to safe implementation.Far less attention has been directed to the public health threats posed by artificial general intelligence (AGI), a hypothetical form of AI with human-level or greater cognitive capacities.

View Article and Find Full Text PDF