DIABETES CLASSIFICATION USING  MACHINE LEARNING TECHNIQUES

Please use this identifier to cite or link to this item: http://ithesis-ir.su.ac.th/dspace/handle/123456789/4450

Title:	DIABETES CLASSIFICATION USING MACHINE LEARNING TECHNIQUES การจำแนกการเป็นโรคเบาหวานโดยใช้เทคนิค Machine learning
Authors:	Methaporn PHONGYING เมธาพร ผ่องยิ่ง Sasiprapa Hiriote ศศิประภา หิริโอตป์ Silpakorn University Sasiprapa Hiriote ศศิประภา หิริโอตป์ HIRIOTE_S@SU.AC.TH HIRIOTE_S@SU.AC.TH
Keywords:	การเรียนรู้ด้วยเครื่อง โรคเบาหวาน ต้นไม้ตัดสินใจ ต้นไม้ป่าสุ่ม ซัพพอร์ตเวกเตอร์แมชชีน เพื่อนบ้านใกล้ที่สุด machine learning diabetes Decision tree Random forest Support Vector Machine K-Nearest neighbor
Issue Date:	4
Publisher:	Silpakorn University
Abstract:	Nowadays, Machine learning techniques play an increasingly prominent role in medical diagnosis because using these techniques can be analyzed to find patterns or facts that are difficult to explain, which contributes to making the diagnosis more accurate. The purpose of this research is to compare the efficiency of diabetic classification models with and without interaction using four machine learning techniques including Decision tree, Random forest, Support Vector Machine and K-Nearest neighbor. These models are compared base on accuracy, precision, recall, and F1-score. The results of this research showed that the models with interaction have better classification performance than those without interaction for all 4 machine learning techniques. Among models with interaction, Random forest classifiers had the best performance with 97.5% accuracy, 97.4% precision, 96.6% recall, and 97% F1-score. In the same way, Random forest also had the best classification performance among models without interaction with 88.2% accuracy, 92.2% precision, 89.3% recall, and 90.7% F1-score. The findings from this research can be further developed into a program to effectively screen diabetes patients. ปัจจุบันเทคนิค Machine learning ได้เข้ามามีบทบาททางการแพทย์ในการวินิจฉัยโรคมากขึ้น เนื่องจากเราสามารถใช้เทคนิค Machine learning ในการวิเคราะห์ข้อมูลขนาดใหญ่ทางการแพทย์ เพื่อค้นหารูปแบบหรือข้อเท็จจริงบางอย่างที่ยากต่อการอธิบาย ซึ่งมีส่วนช่วยให้การวินิจฉัยโรคทำได้อย่างแม่นยำมากยิ่งขึ้น โดยในงานวิจัยนี้มีวัตถุประสงค์เพื่อเปรียบเทียบประสิทธิภาพของเทคนิคที่ใช้ในการสร้างแบบจําลอง Machine learning สำหรับการจำแนกการเป็นโรคเบาหวานกรณีที่พิจารณาและไม่พิจารณาอิทธิพลร่วม 4 เทคนิค ได้แก่ เทคนิคต้นไม้ตัดสินใจ (Decision tree) เทคนิคต้นไม้ป่าสุ่ม (Random Forest) เทคนิคซัพพอร์ตเวกเตอร์แมชชีน (Support Vector Machine) และเทคนิคเพื่อนบ้านใกล้ที่สุด (K-Nearest Neighbor) โดยมีเกณฑ์ที่ใช้ในการทดสอบประสิทธิภาพของการจำแนก คือ ค่าความถูกต้อง (accuracy) ค่าความเที่ยง (precision) ค่าความครบถ้วน (recall) และค่าคะแนน F1 (F1-score) ที่ให้ค่ามากที่สุด ซึ่งผลของการวิจัยพบว่าแบบจำลองกรณีที่พิจารณาอิทธิพลร่วมมีประสิทธิภาพการจำแนกดีกว่าแบบจำลองกรณีที่ไม่พิจารณาอิทธิพลร่วมทั้ง 4 เทคนิค โดยที่แบบจำลองกรณีที่พิจารณาอิทธิพลร่วม เทคนิค Random forest มีประสิทธิภาพการจำแนกดีที่สุด ซึ่งให้ค่าความถูกต้องในการจำแนก 97.5% มีค่าความแม่นยำที่ 97.4% มีค่าความครบถ้วนที่ 96.6% และค่าคะแนน F1 ที่ 97% ในทางเดียวกัน แบบจำลองกรณีที่ไม่พิจารณาอิทธิพลร่วม เทคนิค Random forest มีประสิทธิภาพการจำแนกดีที่สุด ซึ่งให้ค่าความถูกต้องในการจำแนก 88.2% มีค่าความแม่นยำที่ 92.2% มีค่าความครบถ้วนที่ 89.3% และค่าคะแนน F1 ที่ 90.7% โดยผลการวิจัยที่ได้นี้สามารถนำไปใช้เป็นแนวทางในการพัฒนาโปรแกรมสำหรับการคัดกรองผู้ป่วยโรคเบาหวานได้อย่างมีประสิทธิภาพต่อไปในอนาคต
URI:	http://ithesis-ir.su.ac.th/dspace/handle/123456789/4450
Appears in Collections:	Science

Files in This Item:

File	Description	Size	Format
630720073.pdf		3.16 MB	Adobe PDF	View/Open

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets