Abstract
The ever-increasing size of datasets in the Big Data era requires effective methods for extracting meaningful information. Data Mining provides a means to analyze large datasets and uncover valuable patterns that can inform future decisions. In this study, we analyze a healthcare dataset of heart diseases to predict the likelihood of a patient having a heart disease based on specific parameters. To accomplish this, we implement decision tree classification algorithms such as ADTree, J48, and RandomForest. Additionally, a feature selection algorithm is applied to remove the least significant three attributes from the dataset, resulting in improved classification performance. Comparing the previous and current results reveals the effectiveness of this approach in enhancing the classification accuracy.
Key-Words / Index Term
Data Mining, classification algorithms, Feature selection
References
[1]. F. Chu and C. Zaniolo, “Fast and light boosting for adaptive mining of data streams,” Adv. Knowl. Discov. Data Min.,vol. 3056, pp. 282–292, 2004.
[2]. Sh. Hajirahimova, et. al., Azerbaijan; Aliyeva, Aybeniz S., "About Big Data Measurement Methodologies and Indicators". International Journal of Modern Education and Computer Science. Vol.9, Issue 10, pp.1–9, 2017. doi:10.5815/ijmecs.2017.10.01
[3]. S. Sumathi and S. N. Sivanandam, “Data mining tasks, techniques, and applications,” Stud. Comput. |Intell., Vol. 29, pp. 195–216, 2007.
[4]. S. a. Mingoti and J. O. Lima, “Comparing SOM neural network with Fuzzy c-means, K-means and traditional hierarchical clustering algorithms,” Eur. J. Oper. Res., Vol. 174, pp. 1742–1759, 2006.
[5]. GeletawSahle, “Ethiopic maternal care data mining:discovering the factors that affect postnatal care visit in Ethiopia” Sahle Health Inf Sci Syst (2016) 4:4 DOI 10.1186/s13755-016-0017-2
[6]. DursunDelen*, Christie Fuller, Charles McCann, Deepa Ray “Analysis of healthcare coverage: A data mining approach” Available online at www.sciencedirect.com Expert Systems with Applications, Vol.36, pp.995–1003, 2009.
[7]. Shelly Gupta, DharminderKumar,Anand Sharma, “Data Mining Classification Techniques Applied for Breast Cancer Diagnosis And Prognosis “ Vol. 2 No. 2 Apr-May 2011
[8]. SellappanPalaniappan, Rafiah Awang “Intelligent Heart Disease Prediction System Using Data Mining Techniques”AICCA ’08 Proceeding of the 2008 IEEE/ACS International conference on computer Systems and Application pages 108-115, 2008.
[9] Suresh, Annamalai, Rajagopal Kumar, and R. Varatharajan. "Health care data analysis using evolutionary algorithm." The Journal of Supercomputing, Vol.76, Issue 6, pp.4262-4271, 2020.
[10] Skoff, Tami H., et al. "Impact of the US maternal tetanus, diphtheria, and acellular pertussis vaccination program on preventing pertussis in infants< 2 months of age: a case-control evaluation." Clinical Infectious Diseases, Vol.65, Issue.12, pp.1977-1983, 2017.
[11] Lo’ai, A. Tawalbeh, et al. "Mobile cloud computing model and big data analysis for healthcare applications." IEEE Access, Vol.4, pp. 6171-6180, 2016.
Citation
Vikas Mongia, "Optimizing and Enhancing Performance Classification Algorithm on Heart Disease through Feature Selection," International Journal of Scientific Research in Network Security and Communication, Vol.9, Issue.6, pp.1-4, 2021