Abstract
In order to perform analysis on text-based datasets, the techniques and methods in Text Mining (TM) which is a subdomain of Data Mining are used. In this study, it is aimed to evaluate the classification accuracy of academic articles which are produced in academic domain. In accordance with this purpose, the abstracts of the academic articles are obtained and a dataset is created from an academic knowledge sharing network named Research Gate by using self-developed software tools. The academic articles in the dataset fall into two categories as “Materials Science & Engineering” and “Social Sciences & Humanities”. KNN (k-nearest neighbors) classification algorithm is performed by utilizing R language and R Studio tools on the dataset. The experimental results show that the classification accuracy (ACC) of KNN is obtained as 96.67%.
Translated title of the contribution | Classification of scientific articles using text mining with KNN algorithm and R language |
---|---|
Original language | Turkish |
Pages (from-to) | 89-94 |
Number of pages | 6 |
Journal | International Journal of Advances in Engineering and Pure Sciences |
Volume | 3 |
DOIs | |
Publication status | Published - 31 Dec 2016 |
Externally published | Yes |
Keywords
- text mining
- R
- R Studio
- KNN
- text classification