Performance Comparasion of Adaboost and PSO Algorithms for Cervical Cancer Classification Using KNN Algorithm

Romdan Muhamad Ubaidilah, Sutedi Sutedi

Abstract


Abstract. Cervical cancer impacts the female reproductive organs and stands as the second most common cancer among women worldwide. The World Health Organization (WHO) reports that annually, approximately 500,000 women are diagnosed with cervical cancer, and about 300,000 die from it. Many of these deaths result from insufficient early detection and preventive measures. There are four primary screening techniques for detecting cervical cancer cells: Hinselmann, Schiller, Cytology, and Biopsy. In this study, patient health history data is analyzed using the KNN algorithm, which is further optimized with Adaboost and PSO techniques. These optimization strategies are evaluated to identify the most precise model for detecting patterns in cervical cancer patients and predicting their screening outcomes. This study employs the RapidMiner tool. Findings reveal that the KNN algorithm effectively performs multilabel classification, and when optimized with PSO, there is a slight improvement in accuracy.

Purpose: The aim of this research is to assess the performance of the K-Nearest Neighbor (KNN) algorithm in multilabel classification of cervical cancer and to optimize it using Adaboost and Particle Swarm Optimization (PSO) techniques. This research is significant as it offers a potentially more accurate diagnostic method for detecting cervical cancer using medical records.

Methods/Study design/approach: The Cervical Cancer Risk Classification dataset from Kaggle was used in this study. Data preprocessing was conducted before applying the KNN algorithm. The KNN algorithm's performance was evaluated using a 10-fold cross-validation method, and results were measured using the Confusion Matrix. Additionally, the KNN algorithm was optimized using Adaboost and PSO to assess improvements in accuracy.

Result/Findings: Experimental results indicated that the KNN algorithm achieved optimal accuracy with k=5, reaching 95.81%, 91.26%, 94.64%, and 93.01% for Hinselmann, Schiller, Cytology, and Biopsy targets, respectively. Adaboost did not significantly improve accuracy, while PSO slightly enhanced the Hinselmann target accuracy from 95.81% to 95.92%. The average training time for this experiment was around two minutes. These results demonstrate the effectiveness of the KNN algorithm in conducting multilabel classification for cervical cancer diagnosis.

Novelty/Originality/Value: This research demonstrates that optimizing the KNN algorithm with PSO can enhance accuracy, though not significantly. This suggests potential for further development to improve cervical cancer diagnostic accuracy. Testing the model with the latest data and optimizing parameters may lead to better models and useful tools for early cervical cancer diagnosis.


Keywords


Cervical Cancer, Classification, KNN, Adaboost, PSO

Full Text:

PDF

References


M. Aljurayfani, S. Alghernas, and A. Shargabi, “Medical Self-Diagnostic System Using Artificial Neural Networks,” in 2019 International Conference on Computer and Information Sciences (ICCIS), 2019, pp. 1–5. doi: 10.1109/ICCISci.2019.8716386.

J. A. Cruz and D. S. Wishart, “Applications of Machine Learning in Cancer Prediction and Prognosis,” Cancer Inform, vol. 2, p. 117693510600200030, Jan. 2006, doi: 10.1177/117693510600200030.

H. Lin, Y. Hu, S. Chen, J. Yao, and L. Zhang, “Fine-grained classification of cervical cells using morphological and appearance based convolutional neural networks,” IEEE Access, vol. 7, pp. 71541–71549, 2019, doi: 10.1109/ACCESS.2019.2919390.

M. M. Rahaman et al., “A survey for cervical cytopathology image analysis using deep learning,” 2020, Institute of Electrical and Electronics Engineers Inc. doi: 10.1109/ACCESS.2020.2983186.

H. Akbar, “KLASIFIKASI KANKER SERVIKS MENGGUNAKAN MODEL CONVOLUTIONAL NEURAL NETWORK (ALEXNET),” Jurnal Informatika dan Komputer) Akreditasi KEMENRISTEKDIKTI, vol. 4, no. 1, 2021, doi: 10.33387/jiko.

R. Kurniawan, D. E. K. Sasmito, and F. Suryani, “Klasifikasi Sel Serviks Menggunakan Analisis Fitur Nuclei pada Citra Pap Smear,” in Seminar Nasional Informatika Medis (SNIMed), 2013.

S. Rathod, J. Potdar, A. Gupta, N. Sethi, and A. Dande, “Empowering Women’s Health: Insights Into HPV Vaccination and the Prevention of Invasive Cervical Cancer,” Cureus, Nov. 2023, doi: 10.7759/cureus.49523.

J. C. Spencer, N. T. Brewer, T. Coyne-Beasley, J. G. Trogdon, M. Weinberger, and S. B. Wheeler, “Reducing poverty-related disparities in cervical cancer: The role of HPV vaccination,” Cancer Epidemiology Biomarkers and Prevention, vol. 30, no. 10, pp. 1895–1903, Oct. 2021, doi: 10.1158/1055-9965.EPI-21-0307.

A. Gates et al., “Screening for the prevention and early detection of cervical cancer: protocol for systematic reviews to inform Canadian recommendations,” Syst Rev, vol. 10, no. 1, Dec. 2021, doi: 10.1186/s13643-020-01538-9.

A. Kim, K. C. Chung, C. Keir, and D. L. Patrick, “Patient-reported outcomes associated with cancer screening: a systematic review,” BMC Cancer, vol. 22, no. 1, Dec. 2022, doi: 10.1186/s12885-022-09261-5.

R. T. Prasetio and S. Susanti, “Prediksi Harapan Hidup Pasien Kanker Paru Pasca Operasi Bedah Toraks Menggunakan Boosted k-Nearest Neighbor,” JURNAL RESPONSIF, vol. 1, no. 1, pp. 64–69, 2019, [Online]. Available: http://ejurnal.univbsi.id/index.php/jti

X. Feng, Y. Cai, and R. Xin, “Optimizing diabetes classification with a machine learning-based framework,” BMC Bioinformatics, vol. 24, no. 1, Dec. 2023, doi: 10.1186/s12859-023-05467-x.

N. D. Saputri, K. Khalid, and D. Rolliawati, “SISTEMASI: Jurnal Sistem Informasi Komparasi Penerapan Metode Bagging dan Adaboost pada Algoritma C4.5 untuk Prediksi Penyakit Stroke Comparison of Bagging and Adaboost Methods on C4.5 Algorithm for Stroke Prediction.” [Online]. Available: http://sistemasi.ftik.unisi.ac.id

L. Pebrianti, F. Aulia, H. Nisa, and K. Saputra, “Jurnal Sistem dan Teknologi Informasi Implementasi Metode Adaboost untuk Mengoptimasi Klasifikasi Penyakit Diabetes dengan Algoritma Naïve Bayes,” vol. 7, no. 2, 2022, [Online]. Available: http://jurnal.unmuhjember.ac.id/index.php/JUSTINDO

W. Yunus, “Algoritma K-Nearest Neighbor Berbasis Particle Swarm Optimization Untuk Prediksi Penyakit Ginjal Kronik,” Jurnal Teknik Elektro CosPhi, vol. 2, no. 2, pp. 2597–9329, 2018.

C. C. C. C. Eko Justino Wahyu, “The Application Of Particle Swarm Optimization Using Naive Bayes Method For Predicting Heart Disease,” Bandar Lampung, 2022. Accessed: Jul. 13, 2024. [Online]. Available: https://jurnal.darmajaya.ac.id/index.php/icitb/article/view/3395

H. Sabita and S. Trisnawati, “Perbandingan Algoritma Support Vector Machine dan AdaBoost Dalam Memprediksi Waktu Kelulusan Mahasiswa,” Teknika, vol. 17, no. 2, pp. 1–5, 2023, doi: https://doi.org/10.5281/zenodo.8220872.

A. S. W. G. Iqbal Muhammad Latief, “PREDIKSI TINGKAT PELANGGAN CHURN PADA PERUSAHAAN TELEKOMUNIKASI DENGAN ALGORITMA ADABOOST,” JURNAL INFORMATIKA, vol. 21, no. 1, 2021, doi: https://doi.org/10.30873/ji.v21i1.2867.

S. Tri Utami, S. Lestari, and H. Widi Nugroho, “PREDICTION OF ANEMIA USING THE PARTICLE SWARM OPTIMIZATION (PSO) AND NAÏVE BAYES ALGORITHM,” Jurnal CoreIT, vol. X, No.X, 2024, doi: 10.24014/coreit.v10i1.28428.

A. Ramadhan, E. Budianita, F. Syafria, and S. Ramadhani, “Feature Selection using Information Gain on the K-Nearest Neighbor (KNN) and Modified K-Nearest Neighbor (MKNN) Methods for Chronic Kidney Disease Classification,” Jurnal CoreIT: Jurnal Hasil Penelitian Ilmu Komputer dan Teknologi Informasi, vol. 9, no. 2, p. 17, Dec. 2023, doi: 10.24014/coreit.v9i2.26834.

P. Kumar et al., “Futuristic Trends in Network and Communication Technologies Communications in Computer and Information Science 1395,” 2020. [Online]. Available: http://www.springer.com/series/7899

S. M. Malakouti, M. B. Menhaj, and A. A. Suratgar, “The usage of 10-fold cross-validation and grid search to enhance ML methods performance in solar farm power generation prediction,” Clean Eng Technol, vol. 15, Aug. 2023, doi: 10.1016/j.clet.2023.100664.

S. M. Malakouti, “Babysitting hyperparameter optimization and 10-fold-cross-validation to enhance the performance of ML methods in predicting wind speed and energy generation,” Intelligent Systems with Applications, vol. 19, Sep. 2023, doi: 10.1016/j.iswa.2023.200248.




DOI: http://dx.doi.org/10.24014/coreit.v10i2.31711

Refbacks

  • There are currently no refbacks.




Creative Commons License  site stats  
Jurnal CoreIT by http://ejournal.uin-suska.ac.id/index.php/coreit/ is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.