Hasil Pencarian

Ditemukan 133512 dokumen yang sesuai dengan query

Dian Puspita Sari

Klasifikasi sekuens protein Coronavirus penyebab COVID-19 menggunakan metode Particle Swarm Optimization-Support Vector Machine dan Seleksi Fitur Random Forest-Recursive Feature Elimination = Classification of coronavirus protein sequences cause COVID-19 disease using Particle Swarm Optimization-Support Vector Machine Method and Feature Selection of Random Forest-Recursive Feature Elimination

"Coronavirus yaitu kelompok virus yang menginfeksi sistem pernapasan yang dapat menyebabkan infeksi pernapasan ringan maupun berat. Salah satu virus yang termasuk ke dalam coronavirus adalah SARS-CoV-2. Penyakit yang disebabkan oleh virus SARS-CoV-2 disebut COVID-19. COVID-19 pertama kali terdeteksi pada tahun 2019 di Wuhan, China. Penyebaran COVID-19 sangat cepat dengan tingkat kematian yang tinggi terus terjadi di berbagai negara sehingga penyakit ini berstatus pandemi. Skripsi ini menyelesaikan masalah klasifikasi virus SARS-CoV-2 dengan menggunakan data sekuens protein coronavirus. Seleksi fitur pada data sekuens protein coronavirus menggunakan metode seleksi fitur Random Forest-Recurisive Feature Elimination (RF-RFE). Setelah dilakukan seleksi fitur, dilakukan klasifikasi menggunakan pendekatan machine learning dengan metode Support Vector Machine (SVM) dan Particle Swarm Optimization-Support Vector Machine (PSO-SVM). Hasil terbaik performa rata-rata akurasi, spesifisitas, dan sensitivitas untuk metode SVM berturut-turut adalah 93,43%, 98,06%, dan 88,84% pada data pelatihan sebesar 80%. Untuk metode PSO-SVM, hasil terbaik rata-rata akurasi dan spesifisitas adalah 98,48% dan 98,57% pada data pelatihan sebesar 80%, sedangkan hasil terbaik rata-rata sensitivitas adalah 98,96% pada data pelatihan sebesar 90%. Oleh karena itu, pada penelitian ini dapat disimpulkan bahwa metode PSO-SVM menghasilkan performa yang lebih baik dibandingkan dengan metode SVM.

Coronaviruses are a group of viruses that infect the respiratory system that can cause mild or severe respiratory infections. One of the viruses that belongs to the coronavirus is SARS-CoV-2. The disease caused by the SARS-CoV-2 virus is called COVID-19. COVID-19 was first detected in 2019 in Wuhan, China. The spread of COVID-19 is very fast with a high mortality rate that continues to occur in various countries so that this disease has a pandemic status. This thesis solves the problem of classifying the SARS-CoV-2 virus using coronavirus protein sequence data. Feature selection on coronavirus protein sequence data used the Random Forest-Recursive Feature Elimination (RF-RFE) feature selection method. After feature selection, classification is carried out using a machine learning approach with the Support Vector Machine (SVM) and Particle Swarm Optimization-Support Vector Machine (PSO-SVM) methods. The best results of the average performance of accuracy, specificity, and sensitivity for the SVM method are 93.43%, 98.06%, and 88.84%, respectively, for training data of 80%. For the PSO-SVM method, the best results on average accuracy and specificity are 98.48% and 98.57% on training data of 80%, while the best results on average sensitivity are 98.96% on training data of 90%. Therefore, in this study it can be concluded that the PSO-SVM method produces better performance than the SVM method."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Ghani Deori

Klasifikasi sekuens protein coronavirus penyebab COVID-19 menggunakan metode Naive Bayes dengan seleksi fitur Lasso = Classification of coronavirus protein sequences cause COVID-19 using Naive Bayes method with LASSO feature selection

"SARS-COV-2 merupakan jenis virus yang menyebabkan pandemi COVID-19. Pandemi COVID-19 pertama kali terdeteksi di Wuhan, Cina. Berdasarkan data World Health Organization (WHO), jumlah orang yang telah terpapar COVID-19 adalah 123.216.178 orang dan 2.714.517 orang meninggal akibat COVID-19 berdasarkan data www.who.int pada tanggal 23 Maret 2021. Pada skripsi ini, dilakukan klasifikasi untuk SARS-COV-2 dengan menggunakan sekuens protein dari SARS-COV-2. Sekuens protein SARS-COV- 2 di ekstraksi fitur dengan menggunakan package discere dari Python. Package discere akan menghasilkan 27 fitur, dimana fitur-fitur diseleksi dengan menggunakan metode LASSO (Least Absolute Shrinkage and Selection Operator). Setelah dilakukan seleksi fitur, dilakukan klasifikasi dengan menggunakan dua metode, yaitu metode Absolute Correlation Weighted Naïve Bayes dan metode Naïve Bayes. Rata-rata akurasi, sensitifitas, dan spesifisitas tertinggi untuk metode Absolute Correlation Weighted Naïve Bayes berturut-turut adalah 81,85%, 74,81%, dan 89,19%, sedangkan rata-rata akurasi, sensitifitas, dan spesifisitas tertinggi untuk metode Naïve Bayes berturut-turut adalah 81,44%, 74,58%, dan 88,24%. Terlihat bahwa metode Absolute Correlation Weighted Naïve Bayes mempunyai rata-rata akurasi, sensitifitas, dan spesifisitas yang lebih tinggi dibandingkan dengan metode Naïve Bayes.

SARS-COV-2 is the type of virus that causes the COVID-19 pandemic. The COVID-19 pandemic was first detected in Wuhan, China. Based on data from the World Health Organization (WHO), the number of people who have been exposed to COVID-19 is 123,216,178 people and 2,714,517 people died from COVID-19 based on data from www.who.int on March 23, 2021. In this paper, the SARS-COV-2 classification is done by using the protein sequence of SARS-COV-2. The SARS-COV-2 protein sequence will be feature extraction using the discere package from Python. The discere package will produce 27 features, where the features are selected using the LASSO (Least Absolute Shrinkage and Selection Operator) method. After feature selection, classification is carried out using two methods, namely the Absolute Correlation Weighted Naïve Bayes method and the Naïve Bayes method. The highest average accuracy, sensitivity, and specificity for the Absolute Correlation Weighted Naïve Bayes method are 81.85%, 74.81%, and 89.19%, respectively, whereas the highest average accuracy, sensitivity, and specificity for the Naïve Bayes method are 81.44%, 74.58%, and 88.24%, respectively. It can be seen that the Absolute Correlation Weighted Naïve Bayes method has a higher average accuracy, sensitivity, and specificity than the Naïve Bayes method."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Amanda Rizki Bagasta

Klasifikasi data infark serebri menggunakan support vector machine dan seleksi fitur relief = Classification of cerebral infarction using support vector machine and relief feature selection / Amanda Rizki Bagasta

"ABSTRAK

Infark Serebri adalah kondisi dari suatu jaringan otak yang tidak teralirkan darah sehingga sel-sel otak tersebut kekurangan oksigen dan nutrisi. Hal ini dapat mengakibatkan kerusakan bahkan kematian sel-sel otak dan perlu dengan segera mendapatkan penanganan. Keadaan ini sering dikenal sebagai Stroke, dimana pada penulisan ini akan berfokus pada data stroke nonhemoragik (stroke tidak berdarah) yang diakibatkan penyumbatan pembuluh darah di otak. Biasanya penyakit ini dapat dikenali dari gejala kelumpuhan suatu bagian tubuh atau kesulitan menggunakan suatu alat indra. Menurut para ahli, penyakit ini harus dicegah sejak dini karena dapat berakibat fatal bagi keseluruhan fungsional tubuh. Salah satu tindakan yang dapat dilakukan sejak dini adalah mendeteksi kemungkinan penyakit agar dapat dilakukan penanganan secara tepat dan cepat. Dalam penelitian ini, Infark Serebri dideteksi dengan mengklasifikasi ada atau tidaknya sel abnormal pada jaringan otak pada hasil CT Scan otak pasien menggunakan Support Vector Machine dengan Seleksi Fitur RELIEF. Data yang digunakan berupa data numerik dari pasien yang melakukan pemeriksaan di RSUPN dr. Cipto Mangunkusumo Jakarta dalam bentuk hasil CT Scan otak. Terdapat Sembilan fitur indikator yang digunakan dan diproses dengan membandingkan Support Vector Machine dengan dan tanpa seleksi fitur RELIEF. Berdasarkan hasil uji coba, metode yang diusulkan mampu mencapai akurasi sebesar 95,23%. Sehingga, penggunaan seleksi fitur RELIEF pada SVM merupakan metode yang baik untuk menklasifikasi infark serebri.

ABSTRACT

The Cerebrovascular Infarction is a condition of an inflowed blood of brain tissue so that the brain cells lack oxygen and nutrients. This can cause the damage and even the death of brain cells and needed to get immediate treatment. This situation is often known as stroke, which at this writing will fokus on data on non-hemoragic strokes (non-bleeding strokes) caused by blockage of blood vessels in the brain. Usually this disease can be identified by symptoms of paralysis of some body part or difficulty using a human sensory. According to the experts, this disease must be prevented early because it can be fatal to the overall functional body. One of the actions that can be done early is to detect the possibility of a disease so that it can be handled appropriately and quickly. In this study, the cerebral infarction was detected by classifying the presence or absence of abnormal cells in brain tissue in the results of a CT brain scan of patients using Support Vector Machine with the RELIEF Selection Feature. The data used in the form of numerical data reports from patients who performed examinations at the RSUPN dr. Cipto Mangunkusumo Jakarta in the form of brain CT Scan. There are nine indicator features that are used and processed by comparing Support Vector Machine with and without RELIEF feature selection. Based on the results, the proposed method is able to achieve accuracy value of 95,23%. Thus, the use of RELIEF feature selection with SVM is a good method for classifying cerebral infarction.

2019

S-Pdf

UI - Skripsi Membership Universitas Indonesia Library

Situmeang, Jason Nimrod Joshua

Clustering Varian Sekuens Protein SARS-CoV-2 Menggunakan Algoritma BIRCH dengan Seleksi Fitur LASSO = Clustering of SARS-CoV-2 Protein Sequence Variants Using BIRCH Algorithm with LASSO Feature Selection

Penelitian ini bertujuan untuk melakukan pengelompokan varian virus SARS-CoV-2 melalui proses clustering menggunakan metode unsupervised learning. Data yang digunakan adalah sekuens protein SARS-CoV-2 yang diekstraksi fiturnya menggunakan paket Discere dalam bahasa pemrograman Python. Sebanyak 27 fitur dihasilkan dan diseleksi dengan metode seleksi fitur Least Absolute Shrinkage and Selection Operator (LASSO). Metode Elbow digunakan untuk menentukan jumlah cluster yang optimal. Dalam penelitian ini, digunakan metode clustering K-Means dan Balanced Iterative Reducing and Clustering using Hierarchies (BIRCH). Evaluasi hasil clustering dilakukan menggunakan metrik evaluasi Silhouette Score dan Davies-Bouldin Index, serta memperhatikan waktu runtime untuk setiap simulasi. Hasil evaluasi kemudian dibandingkan untuk melihat perbedaan performa antara kedua metode clustering yang digunakan, serta pengaruh seleksi fitur terhadap performa clustering. Hasil terbaik diperoleh pada simulasi dengan metode clustering BIRCH + LASSO, dengan nilai Silhouette Score 0,74186 untuk jumlah cluster k=4 dan 0,73207 untuk k=5. Nilai Davies-Bouldin Index terbaik juga diperoleh pada simulasi tersebut, yaitu 0,42697 untuk k=4 dan 0,37949 untuk k=5. Waktu runtime terbaik tercatat pada simulasi dengan metode K-Means + LASSO, yaitu 0,21551 detik untuk k=4 dan 0,17539 detik untuk k=5. Dapat disimpulkan bahwa metode BIRCH menghasilkan cluster yang lebih baik berdasarkan metrik evaluasi, namun K-Means memberikan proses clustering yang lebih cepat. Seleksi fitur dengan metode LASSO juga membantu meningkatkan performa clustering.

This study aims to perform clustering of SARS-CoV-2 virus variants using unsupervised learning methods. The data used consists of SARS-CoV-2 protein sequences whose features are extracted using the Discere package in the Python programming language. A total of 27 features are generated and selected using the Least Absolute Shrinkage and Selection Operator (LASSO) feature selection method. The Elbow method is employed to determine the optimal number of clusters for the clustering process. The clustering methods used in this research are K-Means clustering and Balanced Iterative Reducing and Clustering using Hierarchies (BIRCH). The clustering results are evaluated using the Silhouette Score and Davies-Bouldin Index metrics, while also considering the runtime for each simulation. The evaluation results are then compared to examine the performance differences between the two clustering methods and the impact of feature selection on clustering performance. The best Silhouette Score is obtained in the simulation using the BIRCH + LASSO clustering method, with a value of 0.74186 for k=4 and 0.73207 for k=5. The best Davies-Bouldin Index is also achieved in the same simulation, with values of 0.42697 for k=4 and 0.37949 for k=5. The fastest runtime is recorded in the simulation using the K-Means + LASSO method, with a time of 0.21551 seconds for k=4 and 0.17539 seconds for k=5. In conclusion, the BIRCH method yields better clustering results based on the evaluation metrics, while K-Means provides faster clustering processes. The LASSO feature selection method also aids in improving clustering performance.
"

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2022

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Febiola Damayanti

Klasifikasi Sekuens Protein Coronavirus Penyebab COVID-19 Menggunakan Metode LightGBM dengan Seleksi Fitur Elastic Net = Coronavirus Protein Sequence Classification Causes of COVID-19 Using the LightGBM Method with Elastic Net Feature Selection

"Pandemi COVID-19 (coronavirus disease 2019) membuat para peneliti di seluruh dunia bekerja untuk memahaminya dengan menerapkan pendekatan machine learning. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) merupakan penyebab dari COVID-19. Penelitian ini membahas klasifikaasi sekuens protein SARS-CoV-2 menggunakan metode LightGBM dan Elastic Net. Metode LightGBM merupakan metode gradient boosting yang cepat dan memiliki high-performance berbasis decision tree untuk melakukan prediksi. Total data sekuens protein yang digunakan adalah 2000 data yang diambil dari situs Uniprot. Uniprot merupakan salah satu situs yang digunakan terkait bioinformatika atau sumber daya sekuens protein dan informasi fungsional yang memiliki kualitas tinggi, komprehensif dan dapat diakses secara bebas. Data tersebut memiliki perincian yaitu 1000 data sekuens protein SARS-CoV-2 dan 1000 data sekuens protein bukan SARS-CoV-2. Python package Discere digunakan untuk mengekstraksi 27 fitur sekuens protein. Selanjutnya, Elastic Net digunakan untuk memilih fitur-fitur yang optimal dan terpilih sebanyak 10 fitur. Terakhir, LightGBM digunakan sebagai metode klasifikasi sekuens protein SARS-CoV-2. Hasil evaluasi performa LightGBM diukur dari akurasi, sensitivitas, dan spesifisitas. Nilai rata-rata akurasi diperoleh 98,87%, nilai rata-rata sensitivitas diperoleh 99,02%, dan nilai rata-rata spesifisitas diperoleh 98,82%

The COVID-19 (coronavirus disease 2019) pandemic has researchers around the world working to understand it by applying a machine-learning approach. Secere acute respiratory syndrome coronavirus 2 (SARS-Cov-2) is the cause of COVID-19. This research discusses the classification of SARS-Cov-2 protein sequences using the LightGBM and Elastic Net methods. The LightGBM method is a gradient-boosting method that fast and has a high-performance decision tree based for making predictions. The total protein sequence data used is 2000 data taken from UniProt site. UniProt is one of the sites used for bioinformatics or protein sequence resources and functional information which is of high quality, comprehensive and freely accesible. The data has details, namely 1000 protein sequence data for SARS-CoV-2 and 1000 protein sequnce data for non-SARS-CoV-2. Python package Dsiscere is used to extraxt 27 protein sequence features. Futhermore, Elastic Net is used to select optimal features and 10 features are selected. While LightGBM is used as a classification method for SARS-Cov-2 protein sequences. The results of the LightGBM performance evaluation are measured by accuracy, sensitivity, and specificity. The average value for accuracy was 98,87%, the average value for sensitivity was 99,02%, and average value for specificity was 98,82%."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Fiftitah Repfian Aszhari

"Klasifikasi Data Stroke Menggunakan Random Forest dengan Recursive Feature Elimination" = "Classification of Stroke Data Using Random Forest with Recursive Feature Elimination"

Stroke merupakan salah satu penyakit dengan risiko kematian dan kecacatan yang tinggi. Secara umum, stroke diklasifikasikan menjadi dua jenis, yaitu stroke iskemik dan stroke hemoragik. Klasifikasi jenis stroke secara cepat dan tepat diperlukan untuk menentukan jenis pengobatan dan tindakan yang tepat guna mencegah terjadinya dampak yang lebih fatal pada pasien stroke. Pada penelitian ini, klasifikasi stroke dilakukan menggunakan pendekatan machine learning. Adapun data penelitian yang digunakan adalah data stroke yang terdiri atas pemeriksaan laboratorium. Pada data penelitian tersebut, terdapat berbagai komponen pemeriksaan laboratorium yang dicatat serta memungkinkan adanya suatu pemeriksaan yang kurang relevan atau informatif dalam mengklasifikasi stroke. Apabila data tersebut tidak ditangani, akan mempengaruhi kinerja serta waktu komputasi model dalam mengklasifikasi stroke. Oleh karena itu, pada penelitian ini, Random Forest (RF) dengan seleksi fitur Recursive Feature Elimination (RFE) digunakan dalam mengklasifikasi data stroke. Dengan menerapkan metode tersebut, diperoleh kinerja model yang lebih baik saat melakukan klasifikasi menggunakan sejumlah fitur yang diperoleh dari hasil seleksi fitur, dibandingkan menggunakan keseluruhan fitur dalam data stroke. Selain itu, pada penerapan metode tersebut, diperoleh kinerja model yang baik dalam mengklasifikasi data kelas stroke iskemik, akan tetapi tidak cukup baik dalam mengklasifikasi data kelas stroke hemoragik. Hal ini dikarenakan proporsi jumlah data pada kelas stroke iskemik lebih banyak dibandingkan stroke hemoragik. Dalam hal ini dibutuhkan suatu metode penanganan agar kinerja model tetap optimal dalam mengklasifikasi data kelas stroke iskemik dan stroke hemoragik. Pada penelitian ini, Synthetic Minority Oversampling Technique (SMOTE) digunakan untuk menyeimbangkan kedua kelas data stroke guna memperoleh kinerja model yang optimal dalam mengklasifikasi kedua kelas data stroke. Berdasarkan penerapan metode RF dengan RFE serta SMOTE dalam mengklasifikasi data stroke, diperoleh kinerja model yang lebih baik dibandingkan melakukan klasifikasi pada data stroke yang tidak diseimbangkan dengan SMOTE.

Stroke is one of the diseases with the high risk of death and disability. Stroke generally can be classified into two types, namely ischemic stroke and hemorrhagic stroke. A quick and accurate stroke classification is needed to find the right treatment to prevent a dangerous effect on the stroke patients. In this study, the stroke classification was applied using a machine learning approach. The data used in this study is stroke data that consists of laboratory examinations. The data consists of various laboratory examination components, therefore, it might be possible that some of the components are less relevant and has less informative related in classifying stroke. If the data is not well handled, it might affect the performance and computation time of the model in classifying stroke. Therefore, in this study, Random Forest (RF) with Recursive Feature Elimination (RFE) method is used to classify the stroke data. The result showed that by applying the method in classifying several amounts of features obtained from the feature selection results has better performance rather than classifying the method using all features in stroke data. Moreover, based on applying this method, the result showed that the model has better performance in classifying ischemic stoke class data but not good enough in classifying hemorrhagic stroke class data. This result might occur because the proportion of numbers the ischemic stroke more than hemorrhagic stroke class data. Therefore, the handling method is needed to obtain optimal model performance in classifying ischemic stroke and hemorrhagic stroke class data. In this study, Synthetic Minority Oversampling Technique (SMOTE) is applied to balance the two classes of stroke data so optimal performance of the classification model can be obtained. Based on the application of the RF with RFE methods and SMOTE in the classification of stroke data, better model performance is obtained compared to classifying the stroke data that is not balanced with SMOTE.

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2020

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Mufarrido Husnah

Klasifikasi sekuens protein coronavirus menggunakan Metode K-Nearest Neighbor dan seleksi fitur algoritma genetika = Classification of coronavirus protein sequences using K-Nearest Neighbor method and feature selection genetic algorithm

"Coronavirus (CoV) adalah keluarga virus penyebab penyakit sistem pernapasan ringan hingga berat pada berbagai spesies hewan termasuk manusia. Salah satu spesies Coronavirus yang muncul pada akhir tahun 2019 yaitu SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus 2) dan menimbulkan penyakit baru bernama Covid-19 (Coronavirus disease-2019) kemudian berstatus pandemi. Penyebaran Covid-19 yang cepat dan dengan tingkat kematian yang tinggi terus terjadi di berbagai negara. Oleh karena itu, deteksi dini patogen perlu dilakukan secara cepat dengan menggunakan data sekuens protein Coronavirus. Sekuens protein merupakan data struktur primer dari suatu protein yang memiliki 27 fitur berdasarkan discere. Dalam penerapannya, tidak semua fitur relevan dengan data yang digunakan sehingga perlu seleksi fitur untuk menghindari dimensi data yang tinggi dan tidak optimal. Seleksi fitur algoritma genetika memberikan fitur-fitur optimal pada data dan metode K-Nearest Neighbor (KNN) melakukan klasifikasi data sekuens protein Coronavirus dengan fitur hasil seleksi fitur algoritma genetika. Seleksi fitur algoritma genetika menghasilkan 11 fitur optimal yang meningkatkan performa running time metode klasifikasi KNN menjadi 0,0541 detik. Fitur optimal diperoleh dari karakteristik AA-count , secondary structure fraction , isoelectric point dan instability index. Hasil terbaik performa akurasi, spesifisitas beserta sensitifitas secara berurutan yaitu 96,68%, 98,7% dan 94,4% yang diperoleh pada nilai parameter K=3.

Coronaviruses (CoV) are a family of viruses that cause mild to severe respiratory system diseases in various animal species including humans. One of the Coronavirus species that emerged at the end of 2019 was SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus 2) and caused a new disease called Covid-19 (Coronavirus disease-2019) then had a pandemic status. The rapid spread of Covid-19 and with a high death rate continues to occur in most of countries. Therefore, early detection of pathogens needs to be done quickly using Coronavirus protein sequence data. Protein sequences are primary structural data of a protein that has 27 features but not all of the existing features are relevant to the data used, so feature selection is necessary to avoid high and suboptimal data dimensions. The genetic algorithm feature selection provides optimal features to the data and the K-Nearest Neighbor (KNN) method performs the classification of Coronavirus protein sequences data with features resulting from the genetic algorithm feature selection. The genetic algorithm feature selection produces 11 optimal features that improve the running time performance of the KNN classification method. The average result of running time is 0.0541 second. The best results were accuracy performance, specificity and sensitivity are 96.68%, 98.7% and 94.4% respectively which were obtained at the parameter value K=3."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Dhita Putri Pratama

Implementasi sistem klasifikasi dan diagnostik coronavirus disease 2019 (covid-19) dengan citra toraks = The implementation of classification and diagnostic system for coronavirus disease (covid-19) with thorax images.

"COVID-19 telah merenggut nyawa banyak manusia. Tercatat per tanggal 29 Juni 2021, sudah terdapat sekitar 3,923,238 pasien yang meninggal dunia akibat penyakit dengan tingkat penularan yang tinggi ini. Dengan semakin banyaknya orang yang terinfeksi COVID-19, persediaan alat untuk mendeteksi penyakit ini pun juga semakin terbatas yang dapat menyebabkan pandemi COVID-19 pun menjadi semakin tidak terkendali. Oleh karena itu, penting untuk mempertimbangkan metode deteksi COVID-19 lainnya yang dapat membantu para staf kesehatan untuk melakukan deteksi pasien positif COVID-19.

Metode deteksi COVID-19 lainnya yang bisa dipertimbangkan untuk dikembangkan adalah metode deteksi COVID-19 dengan artificial intelligence. Dengan metode tersebut, data-data seperti data gejala pasien, data citra toraks, serta data interpretasi citra berupa teks dapat dimanfaatkan untuk mengembangkan suatu model prediksi COVID-19. Ketiga tipe data yang berbeda tersebut dapat dikombinasikan sebagai data input untuk membangun suatu model klasifikasi COVID-19. Pengkombinasian data yang berbeda dapat dilakukan dengan cara melakukan konkatenasi pada tiap input layer yang menerima data gejala dan data teks dengan suatu layer dari arsitektur CNN. Beberapa arsitektur CNN yang dapat digunakan pada penelitian ini adalah ResNet, DenseNet, Inception-ResNet, DarkCovidNet, CoroNet, dan COVID-Net. Selain itu, metode Grad-CAM juga dipilih untuk proses deteksi persebaran coronavirus.

Hasil pengujian menunjukkan bahwa banyak model yang dihasilkan dari pendekatan kombinasi data gambar, data tabular, dan data teks memiliki nilai sensitivitas, akurasi, serta f1-score yang tinggi. Hal tersebut menunjukkan bahwa pendekatan tersebut secara umum menghasilkan model-model dengan performa yang tinggi juga seimbang. Namun, berdasarkan hasil pengujian pula, diketahui bahwa model yang memiliki performa tertinggi dicapai oleh model dari pendekatan klasifikasi gambar dengan data tabular yang menggunakan arsitektur DenseNet khususnya dengan nilai learning rate = 10-3. Model tersebut tercatat memiliki performa yang tinggi dan seimbang dengan nilai sensitivitasnya mencapai angka 1,00, akurasi mencapai angka 0,94, dan F1-Score mencapai angka 0,94.

COVID-19 has taken the lives of many people. As of June 29th 2021, there were approximately 3.923.238 deaths due to this highly contagious disease. With the increasing number of infected people, the COVID-19 detection tool supplies are also getting limited that can lead to an out-of-control situation. Therefore, it is quite necessary to consider alternative methods for COVID-19 detection.
Another COVID-19 detection that can be considered to be developed is a COVID-19 detection method with artificial intelligence. With artificial intelligence, a COVID-19 prediction model can be built by using any available data such as patient symptom dataset, patient thorax images especially chest X-Ray, and thorax interpretations in text form. Those three types of data can be utilized and combined as data input to build a COVID-19 detection system. The combination of those three different types of data can be done with the concatenation of each input layer of tabular and text data with a layer from a CNN architecture. In this study, there are six CNN architectures used and those are ResNet, DenseNet, Inception-ResNet, DarkCovidNet, CoroNet, and COVID-Net. Besides, the Grad-CAM technique is also implemented for coronavirus detection purposes.
The result shows that most of the models from the combined image, tabular, and text datasets offer high sensitivities, accuracies, and scores of F1-Score. It means that the combined image, tabular, and text datasets generally obtained high performance and balanced models. However, according to the test results, the best performance model is achieved by the combined image and tabular datasets approach with DenseNet architecture and the learning rate of \(10^{-3}\). Such a model achieves the best performance model with an accuracy score of 0.94, a sensitivity score of 1.00, and an f1-score of 0.94.
"

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2021

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Muhamad Raihan Fikriansyah

Pengembangan Desain Fitur Pencegahan Dan Penanganan COVID-19 Yang Berkelanjutan Untuk Aplikasi PeduliLindungi Dengan Pendekatan Design Science Research = The Development of Sustainable COVID-19 Prevention and Handling Feature Designs for PeduliLindungi Application using Design Science Research Approach

"Sebagai upaya mengatasi pandemi COVID-19 di Indonesia, Pemerintah Republik Indonesia mewajibkan masyarakat untuk menggunakan aplikasi PeduliLindungi sebagai solusi utama dalam pencegahan dan penanganan COVID-19. PeduliLindungi dirancang untuk memenuhi segala kebutuhan terkait COVID-19, tetapi pada kenyataannya masih ditemukan permasalahan dari segi teknis dan user experience (UX) yang membuat implementasinya tidak sesuai yang diharapkan. Selain itu, kegunaan aplikasi ini dipertanyakan ketika pandemi COVID-19 berakhir di masa depan. Melihat publikasi terkait aplikasi PeduliLindungi sampai saat ini, belum ditemukan publikasi yang membahas inovasi yang dibutuhkan agar aplikasi ini dapat terus bermanfaat saat pandemi COVID-19 dan setelahnya. Oleh karena itu, penelitian ini bertujuan untuk menjadi penelitian pertama yang mengidentifikasi kebutuhan pengguna terkait aplikasi PeduliLindungi yang belum terpenuhi, serta merancang inovasi untuk membuat aplikasi ini bermanfaat secara berkelanjutan menyusul rencana transisi ke endemi COVID-19. Pendekatan Design Science Research (DSR) digunakan untuk mencapai tujuan tersebut karena telah terbukti kecakapannya pada penelitian-penelitian sebelumnya. Melalui penerapan DSR dalam tiga iterasi, penelitian ini akan (1) menganalisis faktor-faktor yang memengaruhi pengguna untuk menggunakan aplikasi PeduliLindungi dengan menguji model penelitian, (2) mengidentifikasi permasalahan dan peluang inovasi fitur baru untuk aplikasi PeduliLindungi menurut pendapat ahli, kemudian (3) merancang antarmuka fitur baru tersebut dan mengevaluasinya ke pengguna. Hasil Iterasi I menunjukkan bahwa perceived usefulness, perceived ease of use, trust in technology, perceived anxiety of infecting others, dan health information orientation memengaruhi niat penggunaan aplikasi PeduliLindungi. Hasil Iterasi II mengungkapkan permasalahan relevan terkait kebutuhan pengguna dan keberlanjutan aplikasi PeduliLindungi, seperti masyarakat yang mulai lengah dengan dampak dan perkembangan COVID-19 dan juga kurangnya kesadaran masyarakat akan risiko kesehatan mereka dalam pandemi COVID-19. Kemudian, pelaksanaan Iterasi III menghasilkan rekomendasi desain antarmuka untuk fitur yang paling layak, baik dari sisi usability maupun keberlanjutannya, berdasarkan evaluasi terhadap pengguna aplikasi PeduliLindungi. Melalui penelitian ini, aplikasi PeduliLindungi diharapkan dapat berkembang menjadi aplikasi andalan masyarakat tidak hanya untuk kebutuhan terkait COVID-19 saat ini, tetapi juga kebutuhan pelayanan kesehatan lain di masa depan. Penelitian ini juga diharapkan dapat membuka jalan bagi penelitian selanjutnya untuk membahas keberlanjutan aplikasi PeduliLindungi.

In an effort to overcome the COVID-19 pandemic in Indonesia, the Indonesian government mandates the use of PeduliLindungi application as the main solution for COVID-19 prevention and handling. PeduliLindungi is designed to meet all the public needs related to COVID-19, but in reality there are still problems from a technical and user experience (UX) perspective in PeduliLindungi that causes its implementation to not go as expected. Moreover, the utility of this app is called into question after the COVID-19 pandemic ends in the future. Looking at the existing publications related to PeduliLindungi to date, there have been no studies that discussed the innovations needed to ensure that this application could continue to be beneficial during the COVID-19 pandemic and beyond. Thus, this study aims to be the first research that identifies user needs related to PeduliLindungi that have not been met, while also designing innovations for the PeduliLindungi to be useful in a sustainable manner following the planned transition to the COVID-19 endemic. The Design Science Research (DSR) approach is used to achieve this goal, due to its proven prowess in previous studies. Through the application of DSR in three iterations, this research will (1) analyse the factors that influence users to use PeduliLindungi by testing the research model, (2) identify problems and opportunities for innovation of new features for PeduliLindungi according to expert opinion, then (3) design the interface of the new feature and evaluate it toward the user. Iteration I results show that perceived usefulness, perceived ease of use, trust in technology, perceived anxiety of infecting others, and health information orientation affect the intention to use PeduliLindungi. The results of Iteration II reveal relevant issues related to user needs and the sustainability of PeduliLindungi, such as people starting to be careless about the impact and development of COVID-19 and also the lack of public awareness of their health risks in the COVID-19 pandemic. Then, the implementation of Iteration III produces interface design recommendations for the most appropriate features, both in terms of usability and sustainability, based on evaluations of PeduliLindungi users. Through this research, hopefully PeduliLindungi can develop into the public's mainstay application, not only for current needs related to COVID-19, but also for other health service needs in the future. This research is also expected to pave the way for further research to discuss the sustainability of PeduliLindungi."

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2022

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Alifah

Klasifikasi retinopati diabetik menggunakan Support Vector Machine (SVM) dengan metode seleksi fitur Recursive Feature Elimination (RFE) dan chi-square = Classification of retinopathy diabetic using Support Vector Machine (SVM) with feature selection method Recursive Feature Elimination (RFE) and chi-square

"Diabetes Melitus (DM) merupakan gangguan sistem metabolik akibat pankreas tidak memproduksi cukup insulin atau tubuh tidak mampu menggunakan insulin yang ada secara efektif. Menderita diabetes dalam jangka waktu panjang dapat mengakibatkan berbagai macam komplikasi salah satu di antaranya adalah Retinopati diabetik. Retinopati diabetik adalah kelainan pada bagian mata yang disebabkan oleh adanya kerusakan dan penyumbatan pada pembuluh darah di bagian belakang mata (retina). Pada penelitian kali ini akan di gunakan data retinopati diabetik dengan menggunakan metode seleksi fitur Recursive Feature Elimination (RFE) dan Chi-Square dan akan di klasifikasi menggunakan Support Vector Machine.

Diabetic retinopathy is one of the complication of diabetes, which is an eye disease that can cause blindness. Its happen because of damage of retina as a result of the long illness of diabetic melitus. People usually do research using image data in diabetic patients. This paper present about diabetic retinopathy will extracting with feature selection. In this study, we use data diabetic patients who will be extracted with a feature selection method. Feature selection used in this study is Recursive Feature Elimination (RFE) and Chi-Square. For classification of diabetic retinopathy has been done by Support Vector Machine (SVM). From the experimental result with various tunning hyperparameters, the classification model can obtain the accuracy between 97%-100% for both methods."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2019

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

<< 1 2 3 4 5 6 7 8 9 10 >>

Hasil Pencarian :: Simpan CSV :: Kembali

Hasil Pencarian