Abstract
Machine learning and data mining techniques have recently gained more popularity in the field of Medical diagnosis, especially for the analysis of the human genome. One of the most significant sources of human genome variation is Single Nucleotide Polymorphisms (SNPs), which have been associated with multiple human diseases. Several techniques have been developed for distinguishing between affected and healthy samples of SNP data. In this study, conditional mutual information maximisation (CMIM) has been employed in order to identify a subset of the most informative SNPs to be used in with various classifications algorithms for the detection of hypertension disease. Five classification algorithms have been evaluated, namely k-Nearest Neighbours (KNN), Artificial Neural Networks (ANN), Naive Bayes (NB), Linear Discriminant Analysis (LDA) and Support Vector Machines (SVM), along with their combination into an unweighted majority voting ensemble classification scheme. The experimental evaluation of the proposed approach via supervised classification experiments showed that the ensemble approach using the SVM, 5-NN, and NB classifiers achieves the highest classification accuracy (93.21%) and F1 score (91.72%), demonstrating the suitability of the proposed approach for the detection of hypertension disease from SNPs data.
Original language | English |
---|---|
Title of host publication | 2018 24th IEEE International Conference on Automation and Computing |
Editors | Xiandong Ma |
Publisher | IEEE |
Number of pages | 6 |
ISBN (Electronic) | 9781862203426 |
DOIs | |
Publication status | Published - 6 Sept 2018 |
Event | 24th International Conference on Automation and Computing - Newcastle upon Tyne, United Kingdom Duration: 6 Sept 2018 → 7 Sept 2018 http://www.cacsuk.co.uk/index.php/conferences/icac/program |
Conference
Conference | 24th International Conference on Automation and Computing |
---|---|
Abbreviated title | ICAC2018 |
Country/Territory | United Kingdom |
City | Newcastle upon Tyne |
Period | 6/09/18 → 7/09/18 |
Internet address |