Finding Category Value Using Mean Shift Clustering to Optimize Naïve Bayes Classification

  • Berlian Rahmy Lidiawaty Telkom University
  • Arip Ramadan Telkom University
  • Tita Ayu Rospricilia Telkom University
  • Najma Attaqiya Alya Universitas Airlangga
  • Dwi Rantini Universitas Airlangga
  • Alhassan Sesay the United Methodist University
Keywords: Naïve Bayes, Mean Shift Clustering, Classification, Optimize, Education

Abstract

The Naïve Bayes classifier is a simple classification method that can make predictions quickly and accurately by considering the independent variables separately from the class. However, in the Naïve Bayes classifier, each independent variable must be divided into several categories, while some of the data remain continuous and uncategorized. Therefore, this study proposes a measurable and precise model to categorize these independent variables effectively. The main objective is to develop a categorization model for independent variables using the Mean Shift clustering algorithm to optimize the performance of the Naïve Bayes classifier. To implement the proposed model, experiments were conducted on two types of datasets. The first dataset contains 191 records with 4 attributes and 6 classes, while the second dataset consists of 2,000 records with 7 attributes and 2 classes. In both datasets, several attributes were initially uncategorized and were categorized using the Mean Shift clustering method. The Mean Shift approach successfully grouped the uncategorized attributes into meaningful categories. In the first dataset, the accuracy of the proposed categorical Naïve Bayes classifier reached 80.1%, representing an improvement of 5.74%. Furthermore, in the second dataset, the accuracy increased to 84.25%, marking a 3% enhancement. The results of this research are expected to contribute to the field of education, especially in the subfield of machine learning.
Published
2026-02-17
How to Cite
Lidiawaty, B. R., Ramadan, A., Rospricilia, T. A., Alya, N. A., Rantini, D., & Sesay, A. (2026). Finding Category Value Using Mean Shift Clustering to Optimize Naïve Bayes Classification. Statistics, Optimization & Information Computing. https://doi.org/10.19139/soic-2310-5070-3161
Section
Research Articles