KNN algoritması ve R dili ile metin madenciliği kullanılarak bilimsel makale tasnifi

Translated title of the contribution: Classification of scientific articles using text mining with KNN algorithm and R language

Deniz Kılınç*, Emin Borandağ, Fatih Yücalar, Volkan Tunali, Macit Şimşek, Akın Özçift

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

In order to perform analysis on text-based datasets, the techniques and methods in Text Mining (TM) which is a subdomain of Data Mining are used. In this study, it is aimed to evaluate the classification accuracy of academic articles which are produced in academic domain. In accordance with this purpose, the abstracts of the academic articles are obtained and a dataset is created from an academic knowledge sharing network named Research Gate by using self-developed software tools. The academic articles in the dataset fall into two categories as “Materials Science & Engineering” and “Social Sciences & Humanities”. KNN (k-nearest neighbors) classification algorithm is performed by utilizing R language and R Studio tools on the dataset. The experimental results show that the classification accuracy (ACC) of KNN is obtained as 96.67%.
Translated title of the contributionClassification of scientific articles using text mining with KNN algorithm and R language
Original languageTurkish
Pages (from-to)89-94
Number of pages6
JournalInternational Journal of Advances in Engineering and Pure Sciences
Volume3
DOIs
Publication statusPublished - 31 Dec 2016
Externally publishedYes

Keywords

  • text mining
  • R
  • R Studio
  • KNN
  • text classification

Fingerprint

Dive into the research topics of 'Classification of scientific articles using text mining with KNN algorithm and R language'. Together they form a unique fingerprint.

Cite this