PREDIKSI PENYAKIT JANTUNG DENGAN MENGGUNAKAN METODE ALGORITMA C4.5 BERBASIS BACKWARD ELIMINATION

Asriani Anhi, ASRIANI ASRI ANHI

Abstract


 

Penyakit jantung merupakan penyebab utama kematian di dunia selama 10 tahun terakhir, pada tahun 2003 World Health Organization (WHO) melaporkan bahwa 29,2% dari total kematian global akibat Cardio Vascular Disease (CVD), yang menjadi penyebab utama atas kematian di negara berkembang karena perubahan gaya hidup, budaya kerja dan kebiasaan makanan. Algoritma C4.5 merupakan pengklasifikasian yang paling sederhana, mudah diimplementasikan. Namun Algoritma C4.5 masih memiliki kelemahan dalam menangani data dalam dimensi tinggi. Penelitian ini bertujuan untuk menerapkan algoritma backward elimination dengan seleksi atribut split validation sehingga dapat meningkatkan prediksi serangan penyakit jantung. Dari penelitian ini menguji 4 dataset yang ada di UCI Machine Learning yang awalnya menggunakan algoritma C4.5 dalam pengklasifikasian atribut dengan hasil pencapaian yaitu hungarian dataset 82.95%, Cleveland dataset 56.29 %, Switzerland dataset 50.00% dan VA dataset 35.00%. Setelah diuji dengan algoritma backward elimination maka prediksi meningkat pada setiap dataset, hasil pencapaian yaitu hungarian dataset 93.10 %, Cleveland dataset 67.21 %, Switzerland dataset 75.00% dan VA dataset 40.00% dalam penyeleksian atribut pada serangan penyakit jantung.


Keywords


: Data mining , Method Algorithm C4.5, Attribute Selection, Backward Elimination, Heart Disease.

References


DAFTAR PUSTAKA

J. Soni, “Predictive Data Mining for Medical Diagnosis : An Overview of Heart Disease Prediction,†vol. 17, no. 8, pp. 43–48, 2011.

M. Shouman, T. Turner, and R. Stocker, “Applying k-Nearest Neighbour in Diagnosing Heart Disease Patients,†Int. J. Inf. Educ. Technol., pp. 220–223, 2012.

M. Shouman, T. Turner, and R. Stocker, “Using data mining techniques in heart disease diagnosis and treatment,†2012 Japan-Egypt Conf. Electron. Commun. Comput., pp. 173–177, Mar. 2012.

M. Anbarasi and E. Anupriya, “Enhanced Prediction of Heart Disease with Feature Subset Selection using Genetic Algorithm,†vol. 2, no. 10, pp. 5370–5376, 2010.

Purwanto, C. Eswaran, R. Logeswaran, and A. R. A. Rahman, “Prediction Models for Early Risk Detection of Cardiovascular Event,†J. Med. Syst., vol. 36, no. 2, pp. 521–531, May 2010.

N. Padhy, P. Mishra, and R. Panigrahi, “The Survey of Data Mining Applications And Feature Scope,†vol. 2, no. 3, pp. 43–58, 2012.

A. L. Houston, H. Chen, S. M. Hubbard, B. R. Schatz, T. D. Ng, R. R. Sewell, and K. M. Tolle, “Medical Data Mining on the Internet : Research on a Cancer Information System,†pp. 437–466, 2000.

S. Vijiyarani and S. Sudha, “Disease Prediction in Data Mining Technique – A Survey,†vol. II, no. I, pp. 17–21, 2013.

M. Chau, R. Cheng, and B. Kao, “Uncertain Data Mining : A New Research Direction,†vol. 1, 2005.

K. Srinivas, G. R. Rao, and a. Govardhan, “Analysis of coronary heart disease and prediction of heart attack in coal mining regions using data mining techniques,†2010 5th Int. Conf. Comput. Sci. Educ., pp. 1344–1349, Aug. 2010.

S. Palaniappan, “Intelligent Heart Disease Prediction System Using Data Mining Techniques,†Int.

J. Comput. Sci. Netw. Secur., vol. 8, no. 8, pp. 343–350, 2008.

L. Yu and H. Liu, “Efficient Feature Selection via Analysis of Relevance and Redundancy,†vol. 5, pp. 1205–1224, 2004.

I. Guyon, “An Introduction to Variable and Feature Selection 1 Introduction,†vol. 3, pp. 1157– 1182, 2003.

H.-W. Cho, “A data mining-based subset selection for enhanced discrimination using iterative elimination of redundancy,†Expert Syst. Appl., vol. 36, no. 2, pp. 1355–1361, Mar. 2009.

Edy, “Penerapan algoritma c4.5 dengan seleksi atribut berbasis algoritma genetika dalam diagnosa penyakit jantung tesis,†2012.

P. C. Austin and J. V Tu, “Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality.,†J. Clin. Epidemiol., vol. 57, no. 11, pp. 1138–46, Nov. 2004.

X. Hong, C. Harris, M. Brown, and S. Chen, “Backward Elimination Methods for Associative Memory Network Pruning Problem formulation ¥ bEdc e S f Ggc e S f R S S IH Sf Sf c e E @ c e S f t R S ¥ bE ! c e S f u c e S f h S S XH ¥ aG c e EW @ c e E c e where c e u Sf Sf r Sf Sfs e h @ S h S,†no. Reed 1993, 2002.

C. Goutte and J. Larsen, “Optimal Cross-Validation Split Ratio : Experimental Investigation,†no. 2, pp. 1–6.

Abdul Rohman Penerapan Algoritma C4.5 Berbasis Adaboost Untuk Prediksi Penyakit Jantung, Dian Nuswantoro University Semarang Java Center tesis 2013.

Ian H. Witten and et al, Data Mining, Third Edition ed.: Morgan Kaufmann Publishers is an imprint of Elsevier, 2011.




DOI: https://doi.org/10.33857/patj.v8i2.913

Refbacks

  • There are currently no refbacks.


Copyright (c) 2025 Asriani Anhi, ASRIANI ASRI ANHI

License URL: http://creativecommons.org/licenses/by/3.0/

View My Stats



Journal PoliciesSubmissionsPeople
 Information


Editorial Office
 Department of Electrical Engineering :
Jl. Tun Abdul Razak, Paccinongang, Kec. Somba Opu, Kabupaten Gowa, Sulawesi Selatan 90235 | Email : pa.techjournal@gmail.com

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License

slot88

slot88

slot777

slot gacor

slot dana

slot gacor

slot qris

slot qris

slot thailand

slot thailand

slot88 terpercaya

slot88