Predicting Heart Disease Risk Using Machine Learning Models and Feature Selection Techniques

Authors

  • Sohaib Latif Department of Computer Science and Software Engineering, Grand Asian University, Sialkot, Punjab, Pakistan. Corresponding Author, Email: sohaib.latif@gaus.edu.pk
  • Raheel Khalid Department of Computer Science, The University of Chenab, Gujrat, Punjab, Pakistan 50700. raheelkhalid37@gmail.com
  • Muhammad Raza Khan Department of Computer Science, The University of Chenab, Gujrat, Punjab, Pakistan 50700. raheelkhalid37@gmail.com, muhammadrazakhan28@gmail.com

DOI:

https://doi.org/10.63163/jpehss.v3i2.229

Abstract

Heart disease is one of the leading causes of death worldwide, making early detection essential for improving patient outcomes. With advancements in machine learning (ML), predictive models now offer a powerful way to assist doctors in diagnosing heart disease more accurately and efficiently. This study explores various ML algorithms, including Logistic Regression (LR), Random Forest (RF), Support Vector Machine (SVM), Decision Tree (DT), Naïve Bayes (NB), and K-Nearest Neighbors (KNN), to identify the most effective approach for heart disease prediction. Using the Cleveland Heart Disease Dataset, which contains 1,025 patient records with 14 medical attributes, we preprocessed the data, selected key features, and optimized model parameters. After evaluating the models with 10-fold cross-validation, the Random Forest model achieved the highest accuracy (98%), followed by Decision Tree (97%). These results highlight the potential of ML-based tools in clinical decision-making, helping doctors detect heart disease at an earlier stage and make informed treatment plans.

Downloads

Published

2025-04-01

How to Cite

Sohaib Latif, Raheel Khalid, & Muhammad Raza Khan. (2025). Predicting Heart Disease Risk Using Machine Learning Models and Feature Selection Techniques. Physical Education, Health and Social Sciences, 3(2), 1–20. https://doi.org/10.63163/jpehss.v3i2.229