Comparison of Sentiment Analysis from Twitter Data Collection with Naïve Bayes, Decision Tree, and k-Nearest Neighbor Methods

Erwin Apriliyanto, Yayu Sri Rahayu

Abstract


Dalam konteks pesatnya perkembangan pengguna media sosial di Indonesia, khususnya Twitter, data yang dihasilkan memberikan informasi berharga untuk penelitian dan pengambilan keputusan. Penelitian ini bertujuan untuk mengklasifikasikan tweet berbahasa Indonesia ke dalam kategori positif, negatif, dan netral. Hasil pengujian menunjukkan bahwa metode Decision Tree memiliki rata-rata presisi kelas yang lebih baik dibandingkan dengan K-nearest neighbour (K-NN) dan Naïve Bayes. Algoritma K-NN memiliki rata-rata presisi kelas sebesar 54.60%, Decision Tree mencapai 72.85%, dan Naïve Bayes sebesar 47.66%. Selain itu, penggunaan Decision Tree menghasilkan presisi yang tinggi untuk kelas Negatif (90,00%) dan kelas Positif (81,82%) .


Keywords


Sentiment Analysis, Twitter, Naïve Bayes, Decision Tree, k-Nearest Neighbor

Full Text:

PDF

References


Akshay Gole, Sankalp Singh, Prathmesh Kanherkar, P.R.Abhishek, P. W. (2022). Comparative Analysis of Machine Learning Algorithms : Random Forest algorithm, Naive Bayes Classifier and KNN - A survey. International Journal Research Publication & Seminat, 13(03). https://jrps.shodhsagar.com/index.php/j/article/view/556

Apriliyanto, E., Kusrini, K., & Arief, R. (2020). Identification Of Diseases In Rice Plant Using Chatbot With Methode Artificial Intelligence Markup Language and Normalization. RESEARCH : Journal of Computer, Information System & Technology Management. https://doi.org/10.25273/research.v3i2.7060

Chitayae, N., & Sunyoto, A. (2020). Performance Comparison of Mushroom Types Classification Using K-Nearest Neighbor Method and Decision Tree Method. 2020 3rd International Conference on Information and Communications Technology (ICOIACT), 308–313. https://doi.org/10.1109/ICOIACT50329.2020.9332148

Itoo, F., Meenakshi, & Singh, S. (2021). Comparison and analysis of logistic regression, Naïve Bayes and KNN machine learning algorithms for credit card fraud detection. International Journal of Information Technology, 13(4), 1503–1511. https://doi.org/10.1007/s41870-020-00430-y

Jopri, M. H., Ab Ghani, M. R., Abdullah, A. R., Manap, M., Sutikno, T., & Too, J. (2021). K-nearest neighbor and naïve Bayes based diagnostic analytic of harmonic source identification. Bulletin of Electrical Engineering and Informatics, 9(6), 2650–2657. https://doi.org/10.11591/eei.v9i6.2685

Khoirunisa, R., Apriliyanto, E., Sandi, A. S., & Kusrini, K. (2020). Penggunaan Natural Language Processing Pada Chatbot Untuk Media Informasi Pertanian. Indonesian Journal of Applied Informatics, 4(2), 55. https://doi.org/10.20961/ijai.v4i2.38688

Kinanti Kumarahadi, Y., Apriliyanto, E., Yulianto, D., & Kusrini. (2020). Decision Support System For Determining The Provision Of Single Tuition Relief Using KNN and SAW Methods. 2020 8th International Conference on Cyber and IT Service Management (CITSM), 1–6. https://doi.org/10.1109/CITSM50537.2020.9268886

Lestari, F. P., Haekal, M., Edmi Edison, R., Ravi Fauzy, F., Nurul Khotimah, S., & Haryanto, F. (2020). Epileptic Seizure Detection in EEGs by Using Random Tree Forest, Naïve Bayes and KNN Classification. Journal of Physics: Conference Series, 1505(1), 012055. https://doi.org/10.1088/1742-6596/1505/1/012055

Nurdina, A., & Puspita, A. B. I. (2023). Naive Bayes and KNN for Airline Passenger Satisfaction Classification: Comparative Analysis. Journal of Information System Exploration and Research, 1(2). https://doi.org/10.52465/joiser.v1i2.167

Ramadhan, I., Sukarno, P., & Nugroho, M. A. (2020). Comparative Analysis of K-Nearest Neighbor and Decision Tree in Detecting Distributed Denial of Service. 2020 8th International Conference on Information and Communication Technology (ICoICT), 1–4. https://doi.org/10.1109/ICoICT49345.2020.9166380

Romadhon, M. R., & Kurniawan, F. (2021). A Comparison of Naive Bayes Methods, Logistic Regression and KNN for Predicting Healing of Covid-19 Patients in Indonesia. 2021 3rd East Indonesia Conference on Computer and Information Technology (EIConCIT), 41–44. https://doi.org/10.1109/EIConCIT50028.2021.9431845

Sheth, V., Tripathi, U., & Sharma, A. (2022). A Comparative Analysis of Machine Learning Algorithms for Classification Purpose. Procedia Computer Science, 215, 422–431. https://doi.org/10.1016/j.procs.2022.12.044

Sianturi, S. T., & Yuhana, U. L. (2022). Student Behaviour Analysis To Detect Learning Styles Using Decision Tree, Naïve Bayes, And K-Nearest Neighbor Method In Moodle Learning Management System. IPTEK The Journal for Technology and Science, 33(2), 94. https://doi.org/10.12962/j20882033.v33i2.13665

Tella, A., Balogun, A.-L., Adebisi, N., & Abdullah, S. (2021). Spatial assessment of PM10 hotspots using Random Forest, K-Nearest Neighbour and Naïve Bayes. Atmospheric Pollution Research, 12(10), 101202. https://doi.org/10.1016/j.apr.2021.101202

Wibowo, A. H., & Oesman, T. I. (2020). The comparative analysis on the accuracy of k-NN, Naive Bayes, and Decision Tree Algorithms in predicting crimes and criminal actions in Sleman Regency. Journal of Physics: Conference Series, 1450(1), 012076. https://doi.org/10.1088/1742-6596/1450/1/012076




DOI: http://dx.doi.org/10.30646/sinus.v22i2.833

Refbacks

  • There are currently no refbacks.


 


STMIK Sinar Nusantara

KH Samanhudi 84 - 86 Street, Laweyan Surakarta, Central Java, Indonesia
Postal Code: 57142, Phone & Fax: +62 271 716 500 

Email: ejurnal @ sinus.ac.id | https://p3m.sinus.ac.id/jurnal/e-jurnal_SINUS/

ISSN: 1693-1173 (print) | 2548-4028 (online)


Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

View My Stats