CD Skripsi
Klasifikasi Berita Dengan Natural Language Processing Menggunakan Metode Tf-Idf Dan Naïve Bayes Di Badan Pusat Statistik Provinsi Riau
The existence of online news sites is used by the Central Bureau of Statistics (CBS) Riau to collect phenomena that occur in society as supporting data for the data they publish. The long time for collecting and processing news data as well as other work that must be done by officers means that news data collection activities tend to be postponed so that the CBS Riau does not have sufficient supporting data for the data or information they publish. This research aims to classify news into certain categories according to the needs of the CBS Riau. The method used in this research is Natural Language Processing. In classifying text, NLP utilizes Machine Learning which in this research uses the Multinomial Naïve Bayes (MNB) algorithm. The MNB algorithm will predict the class of a text based on the appearance probability of words within the text. This research uses Research and Development (R&D) as a research method which involves a number of systematic steps to classify news using Natural Language Processing into several categories required by Riau Central Bureau of Statistics. The research carried out succeeded in obtaining an accuracy validation value of 83%, accuracy test value of 90% and the average result of precision result of 0.85 and recall 0.93.
Keywords: News, Classification, NLP
Tidak tersedia versi lain