Digital Repository

New Feature Selection Method for High Dimensional Gene Data

Show simple item record

dc.contributor.author Fajila, M.N.F.
dc.contributor.author Nawarathna, R.D.
dc.date.accessioned 2016-12-20T09:44:07Z
dc.date.available 2016-12-20T09:44:07Z
dc.date.issued 2016
dc.identifier.citation Fajila, M.N.F. and Nawarathna, R.D. 2016. New Feature Selection Method for High Dimensional Gene Data. Symposium on Statistical & Computational Modelling with Applications (SymSCMA – 2016), Department of Statistics & Computer Science, University of Kelaniya, Sri Lanka. p 66-69. en_US
dc.identifier.uri http://repository.kln.ac.lk/handle/123456789/15558
dc.description.abstract Dimensionality reduction (i.e., feature selection) is an essential technique in data science when handling high dimensional data such as cancer microarray samples. Cancer microarray experiments normally provide a large number of data which is assumed to contain many features, called, genes. However, genes can be either redundant or irrelevant, and thus be removed without incurring much loss of information. A small number of samples with a large number of genes is the major problem in microarray data analysis. In this study, a new machine learning method, namely, hybrid wrapper – filter feature selection is proposed for gene selection. This approach combines the genes selected by both filter and wrapper feature selection methods. Further, it uses a least priority feature elimination procedure where the genes with the lowest priority are eliminated. The propsoed technique is validated and evaluated on two microarray data sets namely, leukemia and colon cancer data sets. With gene selection performed by the proposed method, it helps to classify the leukemia microarray samples with perfect classification (100%) and to classify the colon cancer data set only with two misclassifications giving an accuracy of 90.5%. Results show that the proposed technique is extremely efficient in terms of the computational time too. en_US
dc.language.iso en en_US
dc.publisher Department of Statistics & Computer Science, University of Kelaniya, Sri Lanka en_US
dc.subject Classification en_US
dc.subject Dimensionality reduction en_US
dc.subject Feature selection en_US
dc.subject Gene selection en_US
dc.subject Microarray experiment en_US
dc.title New Feature Selection Method for High Dimensional Gene Data en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account