Abstract
The rise of fake news has become a significant global concern, undermining public trust and information integrity. This study explores the application of advanced machine learning algorithms for detecting fake news, leveraging a balanced dataset of real and fake news articles. Through rigorous preprocessing, including text cleaning and Term Frequency-Inverse Document Frequency (TF-IDF) vectorization, the study enhances data quality and model performance. Five machine learning models—Random Forest, Support Vector Machine (SVM), Neural Networks, Logistic Regression, and Naïve Bayes—are systematically evaluated using metrics such as accuracy, precision, recall, and F1-score. Results indicate that the Random Forest Classifier outperforms other models with an accuracy of 99.95% and balanced performance across metrics, demonstrating its robustness in distinguishing fake from real news. SVM and Neural Networks also achieve high accuracy, showcasing their capability in handling complex data. Logistic Regression and Naïve Bayes, while computationally efficient, exhibit relatively lower performance. The findings underscore the importance of ensemble methods and sophisticated preprocessing techniques in detecting fake news effectively. This research provides a methodological framework for scalable fake news detection, offering valuable insights for developing automated systems to combat misinformation and promote informed decision-making in the digital age.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have