Heart disease prediction using multivariate data

Jayaweera, I. M. L. N.

UoK Repository Home
→
Science
→
Symposia & Conferences
→
International Conference on Applied and Pure Sciences (ICAPS)
→
ICAPS 2022
→
View Item

dc.contributor.author	Jayaweera, I. M. L. N.
dc.date.accessioned	2022-11-03T08:56:14Z
dc.date.available	2022-11-03T08:56:14Z
dc.date.issued	2022
dc.identifier.citation	Jayaweera I. M. L. N. (2021), Heart disease prediction using multivariate data, Proceedings of the International Conference on Applied and Pure Sciences (ICAPS 2021-Kelaniya) Volume 2, Faculty of Science, University of Kelaniya Sri Lanka. Page 29.	en_US
dc.identifier.uri	http://repository.kln.ac.lk/handle/123456789/25511
dc.description.abstract	This study addresses the problem of determining the important attributes of deciding heart disease. Heart disease can be considered one of the most common diseases in the world. Nowadays, diagnosing a disease in an early stage is a more crucial task. If it goes to later stages, it is hard to save human life and is a major problem for health care workers. In order to identify the heart condition as early as possible, it is essential to understand the main factors that may affect one’s heart condition. With this motivation, we apply statistical methods to determine the significant attributes that cause heart disease. This study presents the application of various statistical methods to find the significant factors in heart disease for decision support during the diagnostic process. Heart Disease data from UCI Machine Learning Repository is considered for the analysis. The data set is divided into two parts, exploratory data and confirmatory data. We conducted a descriptive analysis for the classification of the heart disease dataset based on association, dimension reduction, and a confirmatory analysis based on principal components and hypothesis testing to discover the most important information in the heart disease dataset. In the explanatory analysis, we got a brief idea about the factors which need to be looked at when diagnosing heart disease and their relationship based on different classifications. Principal Component Analysis (PCA) and hypothesis testing were used in the confirmatory data analysis. As a result of a hypothesis test, it can be concluded that there is no significant difference between the mean of the variables on exploratory data and confirmatory data. This implies that we can use any part of the data for our analysis. According to the principal component analysis, the first four principal components explain 90.4% total variability of the data. PCA further reveals that the cholesterol level, maximum heart rate, and resting blood pressure have a major impact on heart disease.	en_US
dc.publisher	Faculty of Science, University of Kelaniya Sri Lanka	en_US
dc.subject	Dimension reduction, Heart disease, Multivariate, Principal component	en_US
dc.title	Heart disease prediction using multivariate data	en_US