การเปรียบเทียบประสิทธิภาพการจำแนกภาพถ่ายโรคเบาหวานขึ้นจอประสาทตา ด้วยการสกัดคุณสมบัติเชิงพื้นผิวและการเรียนรู้ของเครื่อง

Chatchawarn Srimontree; Tanarat  Chotiphan

PDF

Published: Apr 30, 2026

Keywords:

Feature Selection Model Performance Evaluation Machine Learning Diabetic Retinopathy

Chatchawarn Srimontree

Computer Science Program, Faculty of Computer Science,Ubon Ratchathani Rajabhat University

Tanarat Chotiphan

Computer Science Program, Faculty of Computer Science,Ubon Ratchathani Rajabhat University

Abstract

This research aims to classify diabetic retinopathy in fundus images using machine learning combined with texture feature extraction and to compare the performance of various machine learning models for diabetic retinopathy classification. The study employed a dataset of 8,834 images published on www.kaggle.com, collected by Chaudhari (2020). According to the study, texture features were extracted from fundus images and converted into numerical statistical features. Redundant features were reduced using Spearman's rank correlation coefficient with a correlation threshold at 0.90 to improve data quality. The number of irrelevant features were further reduced using the ANOVA F-test by selecting the top 100 features with the highest F-scores showing most strongly correlated with the classification outcomes. Subsequently, feature importance analysis using Random Forest were applied to select statistical features with importance values ≥ 0.01, which were considered significant features for model development. This entire process resulted in 34 features selected from the original 316 features. To compare model performance, five machine learning models were trained using these selected features: Random Forest, eXtreme Gradient Boosting, Support Vector Machine, Neural Network, and K-Nearest Neighbors. The results demonstrate that the Extreme Gradient Boosting model achieved the highest performance with 99.34% accuracy, 99.35% overall performance, 99.98% sensitivity, and 98.71% specificity. This approach exhibited superior classification performance with enhanced interpretability compared to modern deep learning methods. Despite utilizing a smaller dataset than typically required by deep learning approaches, it maintained competitive classification performance while being particularly suitable for deployment in resource-constrained clinical environments.

Downloads

Download data is not yet available.

Issue

Vol. 21 No. 1 (2026): Science and Technology for Emerging Innovations in Praxis

Section

Research Articles

This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright Notice

The copyright of research articles published in the VRU Research and Development Journal Science and Technology Journal belongs to the Research and Development Institute, Valaya Alongkorn Rajabhat University under the Royal Patronage. Reproduction of the content, in whole or in part, is prohibited without prior written permission from the university.

Responsibility
The content published in the VRU Research and Development Journal Science and Technology Journal is the sole responsibility of the author(s). The journal does not assume responsibility for errors arising from the printing process.

References

กมลชนก สีพาติ่ง, นนทวัฒน์ เประนาม, วีระศักดิ์ สว่างโลก, และอนุพงศ์ สุขประเสริฐ(2566). การจำแนกภาพ MRI สำหรับการคัดกรองผู้ป่วยเนื้องอกในสมอง. วารสารวิทยาการสารสนเทศและเทคโนโลยีประยุกต์, 5(2), 100–115.

กฤษณะ ชินสาร, สุวรรณา รัศมีขวัญ, เบญจภรณ์ จันทรกองกุล, ภูสิต กุลเกษม, อัณณ์นุพันธ์ รอดทุกข์,

จรรยา อ้นปันส์ ธนินท์ อินทรมณี, และ Khoeun, R. (2559). โครงการการคัดเลือกด้วยวิธีการเรียนรู้รวมแบบปรับตัวได้สำหรับการตรวจพบอาการเส้นเลือดฝอยโป่งพองและมีจุดเลือดออกในตาสำหรับผู้ป่วยเบาหวานในระยะเริ่มต้น. รายงานการวิจัย, คณะวิทยาการสารสนเทศ มหาวิทยาลัยบูรพา.

คุณานนต์ อ่ำทอง, คชินทร์ บุญทรัพย์, กษิตินาถ บุญพันธ์, และพิศณุ คูมีชัย. (2563). การพัฒนาระบบคัดแยกของเนื้องอกในสมองโดยการใช้เมตริกซ์ของระดับสีเทาที่เกิดขึ้นร่วมกันและการแปลงเวฟเลส. การประชุมวิชาการระดับชาติ ประจำปี 2563 เรื่อง การจัดการในยุคเทคโนโลยีนำการเปลี่ยนแปลง (30 สิงหาคม 2561). มหาวิทยาลัยเกษตรศาสตร์ วิทยาเขตศรีราชา.

มาลีรัตน์ โสดานิล, และอมรศักดิ์ อมรธนานันท์. (2555). การรู้จำใบหน้าแบบหลายมุมมองโดยใช้เทคนิคผสมผสานการแบ่งภาพและการจับคู่ภาพมุมมองจริง. วารสารเทคโนโลยีสารสนเทศ, 8(2), 33–38.

อนุพงศ์ สุขประเสริฐ. (2564). คู่มือการทำเหมืองข้อมูลด้วยโปรแกรม RapidMiner Studio. มหาสารคาม: คณะการบัญชีและการจัดการ มหาวิทยาลัยมหาสารคาม.

อนุพงศ์ สุขประเสริฐ, น้ำฝน ดังโพนทอง, และธีรยุทธ จันทรา. (2567). การสร้างตัวแบบสำหรับการคัดกรองผู้ป่วยโรคนิ่วในไตโดยใช้เทคนิคการจำแนกประเภทข้อมูลภาพ. วารสารนวัตกรรมวิทยาศาสตร์เพื่อการพัฒนาอย่างยั่งยืน, 5(1), 12–23.

อรรถกร มากคงแก้ว. (2555). การประยุกต์ใช้ลักษณะเด่นเชิงพื้นผิวแบบจีแอลซีเอ็มในการนับนิวเคลียสเซลล์มะเร็งเต้านม (วิทยานิพนธ์ปริญญามหาบัณฑิต). มหาวิทยาลัยสงขลานครินทร์.

อรรศฎาวุธ เรืองสวัสดิ์, อนุพงศ์ สุขประเสริฐ, ทิวา สินธุกูฏ, และศิริลักษณ์ ไกยวินิจ. (2566). การเปรียบเทียบประสิทธิภาพของตัวแบบสำหรับการพยากรณ์โรคมะเร็งปอด. วารสารวิทยาศาสตร์ เทคโนโลยี และนวัตกรรม มหาวิทยาลัยกาฬสินธุ์, 2(2), 39–52.

อัจฉรา แก้วน้ำเชื้อ. (2564). การพยาบาลผู้ป่วยโรคเบาหวานขึ้นจอประสาทตา. วารสารสุขภาพและสิ่งแวดล้อมศึกษา, 6(4), 177–180.

Ayyanar, M., Jeganathan, S., Parthasarathy, S., Jayaraman, V., & Lakshminarayanan, A. R. (2022). Predicting cardiac diseases using SelectKBest method equipped Light Gradient Boosting Machine. In 2022 6th International Conference on Trends in Electronics and Informatics (ICOEI) (pp. 117–122). IEEE.

Boonsaen, T., Choksakunwong, S., & Lertwattanarak, R. (2021). Prevalence of and factors associated with diabetic retinopathy in patients with diabetes mellitus at Siriraj Hospital – Thailand’s largest national tertiary referral center. Diabetes, Metabolic Syndrome and Obesity:Targets and Therapy, 14, 4945–4957. https://doi.org/10.2147/DMSO.S346719

Chaudhari, R. (2020). Diabetic retinopathy dataset [Data set]. Kaggle. Retrieved from https://www.kaggle.com/datasets/rutujachaudhari/diabetic-retinopathy

Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp.785–794). Association for Computing Machinery.

Chollet, F., et al. (2015). Keras: Deep learning for humans (Version 2.15.0) [Computer software]. https://keras.io

Gajaram, A. M. (2022). An approach to classify ocular diseases using machine learning and deep learning (Master’s thesis). National College of Ireland.

Hassan, D., Gill, H. M., Happe, M., Bhatwadekar, A. D., Hajrasouliha, A. R., & Janga, S. C. (2022). Combining transfer learning with retinal lesion features for accurate detection of diabetic retinopathy. Frontiers in Medicine, 9, 1050436. https://doi.org/10.3389/fmed.2022.1050436

Jiang, J., Zhang, X., & Yuan, Z. (2024). Feature selection for classification with Spearman’s rank correlation coefficient-based self-information in divergence-based fuzzy rough sets. Expert Systems with Applications, 249(Part B), 123633. https://doi.org/10.1016/j.eswa.2024.123633

Kalyani, D., Reddy, J., Ramesh, C., Saraf, S., Kaushal, A., Ben Slimane, J., & Maqbool, A. (2025). Hybrid statistical-textural and intensity feature-based approach for accurate discrimination of retinal diseases. Journal of Information Systems Engineering & Management, 10, 245–263. https://doi.org/10.52783/jisem.v10i3s.385

Kumar, A. (2022). Learning texture features from GLCM for classification of brain tumor MRI images using random forest classifier. WSEAS Transactions on Signal Processing, 18, 60–63. https://doi.org/10.37394/232014.2022.18.8

OpenCV Contributors. (2025). opencv-python (Version 4.11.0.86) [Computer software]. Retrieved from https://pypi.org/project/opencv-python/

Pandas Development Team. (2024). Pandas: Python data analysis library (Version 2.2.3) [Computer software]. Retrieved from https://pandas.pydata.org/

Scikit-Image Contributors. (2025). Scikit-image (Version 0.25.2) [Computer software]. Retrieved from https://pypi.org/project/scikit-image/

SciPy Community. (2025). SciPy (Version 1.15.3) [Computer software]. Retrieved from https://pypi.org/project/scipy

Wang, H. (2023). Research on the application of random forest-based feature selection algorithm in data mining experiments. International Journal of Advanced Computer Science and Applications, 14(10), 466–473. https://doi.org/10.14569/IJACSA.2023.0141054

Waskom, M. L. (2021). seaborn: Statistical data visualization. Journal of Open Source Software, 6(60), Article 3021. https://doi.org/10.21105/joss.0302

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

References