Hasil Pencarian

Ditemukan 103306 dokumen yang sesuai dengan query

Iing Fitria

Penerapan agglomerative clustering untuk menganalisa kekerabatan bakteri pada air liur = Application of agglomerative clustering for analyzing phylogenetically on bacterium of saliva / Iing Fitria

"ABSTRAK

Menganalisis populasi bakteri Streptococcus adalah penting karena spesies ini dapat menyebabkan karies gigi, periodental (plak), halitosis (bau mulut) dan masih banyak lagi masalah yang dapat ditimbulkan. Dalam tesis ini akan dibahas hubungan kekerabatan antara bakteri Streptococcus pada air liur dengan menggunakan pohon filogenetik dari metode agglomerative clustering. Dimulai dengan adanya barisan DNA bakteri Streptococcus yang diambil dari pangkalan data gen (GenBank) yang akan disejajarkan, proses pensejajaran yang dilakukan menggunakan Algoritma Needleman-Wuncsh untuk pensejajaran global. Hasil pensejajaran tersebut berupa skor optimal yang merupakan jarak antara dua barisan DNA bakteri Streptococcus. Skor-skor optimal dikumpulkan dalam satu matriks kemudian membuat pohon filogenetik dengan metode agglomerative clustering yang terdiri atas teknik single linkage,complete linkage dan average linkage. Pada setiap teknik, banyaknya kelompok sama dengan banyaknya individu spesies. Spesies yang paling mirip dikelompokkan sampai akhirnya kemiripan berkurang maka terbentuk kelompok tunggal. Hasil dari pengelompokan berupa pohon filogenetik dan cabang-cabang yang bergabung merupakan tingkatan jarak yang terbentuk. Semakin kecil jarak, maka semakin besar kemiripan spesies serta mengimplementasikannya dengan menggunakan perangkat lunak berbasis open source (Oktave).

ABSTRACT

Analyzing population of Streptococcus bacteria is important because these spesies can cause dental caries, periodontal, halitosis (bad breath) and more problems.This paper will discuss the phylogenetically relation between the bacterium Streptococcus in saliva using a phylogenetic tree of agglomerative clustering methods. Starting with the bacterium Streptococcus DNA sequence obtained from the GenBank to be aligned, the alignment is performed using the Neddleman-Wuncsh Algorithm for global alignment. The alignment results in the optimal score or the distance between DNA sequence of the bacterium Streptococcus one another. Optimal scores collected in a single matrix. Agglomerative clustering technique consisting of single linkage, complete linkage and average linkage. In this technique the number of group sequal to the number of individual species. The most similar species is grouped until the similarity decreases and then formed a single group. Results of grouping is a phylogenetic tree and branches that join an established level of distance, that the smaller distance the more the similarity of the larger spesies implementation is using the Octave, an open source program."

2013

T35950

UI - Tesis Membership Universitas Indonesia Library

Erica

Implementasi agglomerative clustering dan genetic algorithm pada optimisasi portofolio = Agglomerative clustering and genetic algorithm in portfolio optimization

"Diversifikasi portofolio telah dijadikan solusi dalam memitigasi risiko dalam berinvestasi. Tujuan utama diversifikasi portofolio adalah untuk mengurangi variansi return dibandingkan dengan investasi pada satu saham tertentu. Metode Clustering, misalnya Agglomerative Clustering, digunakan untuk mengelompokkan saham-saham ke dalam masing-masing klaster yang homogen berdasarkan risiko. Klaster-klaster yang terbentuk kemudian akan digunakan sebagai acuan diversifikasi portofolio. Objek yang digunakan dalam metode clustering adalah 7 skor rasio finansial PER, EPS, PEG, DER, ROE, Current Ratio dan Profit Margin dari setiap saham. Selanjutnya, proporsi dari setiap saham pembentuk portofolio ditentukan melalui aplikasi Genetic Algorithm ke masing-masing klaster.

Pada penelitian ini, metode Genetic Algorithm dibangun berdasarkan model MVCCPO sehingga membentuk metode Genetic Algorithm Constrained. Performa dari Agglomerative Clustering Genetic Algorithm Constrained yang dievaluasi menggunakan data aktual, menghasilkan portofolio yang mampu mengalahkan return portofolio pasar dan memiliki rata-rata return yang lebih besar dibandingkan dengan portofolio yang dikonstruksi dengan metode Genetic Algorithm saja. Namun, dengan hubungan linear antara risiko dan return, adalah masuk akal bahwa portofolio dengan return yang lebih besar akan memiliki risiko yang lebih besar pula.

The purpose of portfolio diversification is to reduce the return rsquo s variance risk compared with a single stock investment or undiversified portfolio. The primary motivation of this research is to investigate the portfolio selection strategies through clustering and genetic algorithm. Clustering serves as a method to cluster assets with similar financial ratio scores the scores of EPS, PER, PEG, ROE, DER, Current Ratio and Profit Margin. By clustering method such as Agglomerative Clustering, stocks with similar risk profile are clustered together and the clusters produced can be used in diversifying portfolio. Genetic Algorithm will then be applied to each resulting cluster to obtain the optimal proportion of each stock in the portfolio.
The Genetic Algorithm used in this study is built from the MVCCPO model hence making it a Constrained Genetic Algorithm. The performance of Constrained Genetic Algorithm refined with Agglomerative Clustering in portfolio optimization, evaluated based on some actual datasets, gives a portfolio that beats the market and has bigger expected return than a portfolio constructed with only Genetic Algorithm. Due to the direct relationship of risk and return, it is logical to expect portfolio with a bigger return would have a bigger risk."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2017

S-Pdf

UI - Skripsi Membership Universitas Indonesia Library

Asita Darma Irawati

Analisis Pengelompokan Universitas Terbaik Dunia Berdasarkan Komponen Biaya Pendidikan Program Doktor (S3) menggunakan metode K-Means = Clustering Analysis of The World’s Top Universities Based on Tuition Fee Components for Doctoral Program (S3) using K-Means method

"Pertimbangan finansial menjadi salah satu penentu utama apakah seseorang akan melanjutkan

pendidikan ke tingkat yang lebih tinggi atau tidak, sehingga diperlukan beasiswa untuk

membantu mahasiswa dalam menempuh pendidikan tinggi, terutama hingga tingkat doktor.

Besar biaya yang dikeluarkan oleh lembaga penyedia beasiswa kepada penerima beasiswa

tentunya diharapkan sepadan dengan kualitas ilmu yang diperoleh. Oleh karena itu, penelitian

ini bertujuan untuk membahas analisis pengelompokan universitas terbaik dunia berdasarkan

komponen biaya pendidikan program doktor dengan metode K-Means. Universitas pada

penelitian ini diambil dari QS World University Rangkings (WUR) 2022. Analisis eksploratori

data dilakukan dan diperoleh bahwa terdapat 83 dari 472 universitas di dunia memberi bantuan

dana penuh untuk studi program doktor. Nilai Silhouette sebesar 0,72 menunjukkan bahwa tiga

merupakan jumlah kelompok yang optimal bagi data. Sehingga terbentuk kelompok A

sebanyak 328 universitas, kelompok B sebanyak 108 universitas, dan kelompok C sebanyak

36 universitas. Kelompok A terdiri dari universitas dengan SPP dan biaya hidup per bulan

relatif rendah, kelompok B sedang, dan kelompok C tinggi. Untuk biaya transportasi udara,

kelompok B cenderung rendah, sedangkan kelompok A dan C relatif serupa dan lebih mahal

dari kelompok B. Sementara untuk biaya visa, kelompok A cenderung lebih murah, sedangkan

kelompok B dan C cenderung serupa dengan biaya lebih mahal. Berdasarkan analisis ini,

penulis memberikan saran universitas yang bisa dipertimbangkan lembaga pemberi beasiswa

sebagai perguruan tinggi tujuan.

Financial concern has been one of the main reasons why an individual wants to pursue higher
education. That is why scholarship is needed to help students earn an education, especially until
doctoral degree. The amount of money spent by institution who give scholarship must be
equivalent with the quality of knowledge an awardee got. This study aims to do clustering
analysis of the world’s top universities based on tuition fee components for doctoral program
using K-Means method. The object of this study are universities based on QS World University
Rankings 2022. Exploratory data analysis is done and found that there are 83 out of 472
universities in the world who give fully funded program for doctoral study. Based on the
silhouette value of 0.72, three is the best number of clusters for the data. Group A, B, C consists
of 328, 108, and 36 universities in respective order. Group A consists of universities who have
chepear tuition fee and monthly living cost compared to Group B dan C. However, Group B
consists of universities who have cheaper transportation, meanwhile Group A and C are quiet
similar. For visa, Group A is cheaper compared to Group B and C which are similar. Based on
the results, recommendations are given to the institution who provide scholarship about the
objective university for doctoral study."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2022

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Thasya Dwiayu Maydina

Implementasi agglomerative clustering dan modified artificial bee colony algorithm pada optimisasi portofolio dengan kendala possibilistic = Implementation of agglomerative clustering and modified artificial bee colony algorithm on stock portfolio optimization with possibilistic constraints

"Optimisasi portofolio adalah masalah fundamental pada lingkungan keuangan, dimana investor membentuk portofolio yang sesuai dengan yang diharapkan dengan mendapatkan return optimal dan risiko minimal. Pada skripsi ini, membahas masalah optimisasi portofolio dengan kendala di bidang keuangan seperti biaya transaksi, kardinalitas, dan kuantitas dibawah asumsi bahwa return dari aset berisiko adalah bilangan fuzzy. Karena hal tersebut, digunakan model optimisasi portofolio yaitu, mixed integer model nonlinear programming problem. Pertama, data saham di diversifikasi berdasarkan 7 skor rasio finansial EPS, PER, PEG, ROE, DER, Current Ratio dan Profit Margin dengan Agglomerative Clustering untuk menghasikan klaster yang homogen berdasarkan risiko. Selanjutnya, setiap klaster dicari proporsi dalam portofolio dengan menggunakan algoritme heuristik yaitu modified artificial bee colony MABC algorithm, dimana pada algoritme tersebut terdapat proses inisialisasi populasi yang dibangun berdasarkan pendekatan chaotic initialization. Pada akhirnya, return yang dihasilkan dibandingkan dengan S P 500 index return 12,34 dan Sharpe ratio 2,7 . Hasil dari performa Agglomerative Clustering Modified Artificial Bee Colony Algoritm yang dievaluasi menggunakan data aktual, menghasilkan nilai tertinggi dari rata-rata return sebesar 29,96 dan Sharpe ratio sebesar 17,562.

Portfolio optimization problem is a fundamental matter in the financial environment, where the investors form a satisfactory portfolio by obtaining optimal return and minimal risk. In this undergraduate thesis, we discuss the portfolio optimization problem with real world constraints such as transaction costs, cardinality, and quantity under the assumption that the returns of risky assets are fuzzy numbers. Thus, a mixed integer model nonlinear programming problem is discussed. At first, stock data is diversified based on their financial ratio scores the scores of EPS, PER, PEG, ROE, DER, Current Ratio and Profit Margin by using agglomerative clustering to produce a homogeneous cluster. Next, weight of each stock in the stock portfolio are determined using a modified artificial bee colony MABC algorithm, where in the algorithm there is a process of chaotic initialization approach. Finally, the obtained return will be compared to both the S P 500 index return 12,34 and Sharpe ratio 2,7. The results form the performance of Modified Artificial Bee Colony Algoritm with Agglomerative Clustering in portfolio optimization, evaluated based on some actual dataset show that the higher level of return is 29,96 and Sharpe ratio is 17,562."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2018

S-Pdf

UI - Skripsi Membership Universitas Indonesia Library

Banjarnahor, Evander

Analisis Kekerabatan pada Barisan DNA SARS-Cov-2 Berdasarkan Pembentukan Pohon Filogenetik dengan Metode Hierarchical dan K-Means Clustering Menggunakan Multiple Encoding Vector dan K-Mer = Implementation of Hierarchical and K-Means Clustering Methods Using Multiple Encoding Vector in Analyzing Kinship in SARS-Cov-2 DNA Sequences

"Berdasarkan data WHO pada pertengahan Juli 2021 lebih dari 185,2 juta orang di seluruh dunia terinfeksi virus corona atau Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). Virus ini menyerang penapasan manusia yang dapat mengakibatkan infeksi paru-paru pada manusia dan bahkan dapat menyebabkan kematian. Tercatat bahwa lebih dari 4 juta orang di seluruh dunia meninggal akibat terinfeksi virus corona. Di Indonesia sendiri pada pertengahan Juli 2021 tercatat lebih dari 2,4 juta orang ternfeksi virus corona dan lebih dari 65,4 ribu orang meninggal akibat terinfeksi virus corona. Berdasarkan data tersebut, perlu dilakukan analisis kekerabatan virus SARS-CoV-2 untuk mengurangi penyebaran dan memberikan batasan sosial dari negara satu dengan negara lainnya. Identifikasi kekerabatan dari virus covid-19 dan penyebarannya dapat dilakukan dengan cara pembentukan pohon filogenetik dan clustering. Pada penelitian ini pohon filogenetik akan dibangun berdasarkan metode Hierarchical Clustering dengan menggunakan metode Multiple Encoding Vector dan K-Mer berdasarkan translasi DNA kodon menjadi asam amino. Jarak Euclidean akan digunakan untuk menentukan matriks jarak. Penelitian ini selanjutnya menggunakan metode K- Means Clustering untuk melihat penyebarannya, dimana nilai k ditentukan dari jumlah centroid yang dihasilkan dari metode Hierarchical Clustering. Penelitian ini mengambil sampel barisan DNA SARS-CoV-2 dari beberapa negara yang tertular. Dari hasil simulasi, nenek moyang SARS-CoV-2 berasal dari China. Hasil analisis juga menunjukkan bahwa leluhur covid-19 yang paling dekat dengan Indonesia berasal dari India, Australia dan Spanyol. Selain itu dari hasil simulasi dihasilkan bahwa barisan DNA SARS-CoV-2 terdiri dari 9 cluster dan cluster keenam adalah kelompok yang memiliki anggota paling banyak. Hasil analisis juga menunjukkan bahwa metode ini sangat opitimal dalam pengelompokan data dengan nilai 97.4%.

Based on WHO data in middle of July 2021, Coronavirus or Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) is infecting more than 185.2 million people worldwide. The virus attacks human breathing, which can cause lung infections and can even cause death. More than 4 million people worldwide have died due to being infected with the coronavirus. In Indonesia alone, in mid-July 2021, there were more than 2.4 million people infected with the corona virus and more than 65.4 thousand people died from being infected with the corona virus. Based on those covid-19 survivor data, it is necessary to carry out a kinship analysis of the coronavirus to reduce its spreading. Identification of the kinship of the covid- 19 virus and its spread can be done by forming a phylogenetic tree and clustering. This study uses the Multiple Encoding Vector method and K-mer based on translation DNA codon to amino acid in analyzing sequences and Euclidean Distance to determine the distance matrix. This research will then use the Hierarchical Clustering method to determine the number of initial centroids and cluster, which will be used later by the K-Means Clustering method kinship in SARS-CoV-2 DNA sequence. This study took samples of DNA sequences of SARS-CoV-2 from several infected countries. From the simulation results, the ancestors of SARS-CoV-2 came from China. The results of the analysis also show that the closest ancestors of covid-19 to Indonesia came from India, Australia and Spain. In addition, the ancestors of SARS-CoV-2 came from China. The SARS- CoV-2 DNA sequence is also consisted of 9 clusters, and the sixth cluster is the group that has the most members. The results also show that this method is very optimal in a grouping of data with a value of 97.4%."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

T-pdf

UI - Tesis Membership Universitas Indonesia Library

Penerapan Metode Clustering Large Applications (CLARA) untuk Optimasi Matching Problem Pada Ridesharing dengan Memaksimumkan Total Adjusted Distance Proximity (ADP) Index = Application of Clustering Large Applications (CLARA) Method for Matching Problem Optimization on Ridesharing with Maximizing Total Adjusted Distance Proximity (ADP) Index

"Ridesharing adalah suatu model transportasi dimana seorang pengendara (driver) berbagi tumpangan dengan penumpang lain (rider) yang memiliki lokasi asal tujuan dan jadwal perjalanan yang hampir sama atau sama dengan driver. Masalah utama dalam ridesharing yaitu menentukan pasangan driver dan rider yang paling optimal untuk melakukan ridesharing. Sebagai proses awal untuk mencari pasangan tersebut, dalam penelitian ini digunakan metode Clustering Large Applications (CLARA) untuk melakukan clustering terhadap titik-titik koordinat origin dan destination dari masingmasing driver dan rider. Sesuai dengan cluster dari masing-masing titik lokasi, dibentuk himpunan yang mungkin untuk berpasangan. Selanjutnya, untuk driver dan rider yang berada dalam himpunan yang sama diperiksa kelayakan (feasibility) untuk berpasangan berdasarkan kendala waktu, sehingga dihasilkan himpunan pasangan driver dan rider yang feasible, !" . Pada proses optimasi, fungsi obyektif yang dipertimbangkan yaitu memaksimumkan total Adjusted Distance Proximity (ADP) Index. ADP Index merupakan indikasi terjadinya efisiensi berdasarkan jarak tempuh dari tiap pasangan dalam melakukan ridesharing. Sesuai dengan hasil ADP index, dibentuk suatu weighted bipartite graph yang menggambarkan keterhubungan pasangan dalam !" . Proses mendapatkan himpunan pasangan yang memaksimumkan fungsi objektif diselesaikan dengan algoritma Hungarian dalam menyelesaikan Maximum-Weighted Matching pada Bipartite Graph. Setelah seluruh proses dilakukan, dihasilkan himpunan pasangan yang akan melakukan ridesharing. Pada penelitian ini ditunjukkan bahwa dengan melakukan clustering, proses optimasi menjadi lebih efisien dengan adanya pereduksian pasangan yang diuji dari 400 pasangan menjadi 131 pasangan dan banyak pasangan dalam hasil optimasi dengan clustering sama dengan hasil optimasi tanpa clustering yaitu 13 pasangan.

Ridesharing is a model of transportation where the driver share the seat of their vehicle to the rider who has similar departure location and travel schedule with the driver. The main problem in ridesharing is the determination of the optimal pairs of drivers and riders who will conduct ridesharing. As an initial process to find these pairs, in this study the Clustering Large Applications (CLARA) method was used to cluster the coordinate points of origin and destination of each driver and rider. In accordance with the clusters of each location point, a set of possible matches is formed. Furthermore, for drivers and riders who are in the same set, the feasibility of matching is examined based on time constraints, so that a feasible set of driver and rider matches, !" is produced. In the optimization process, the objective function considered is to maximize the total Adjusted Distance Proximity (ADP) Index. The ADP Index is an indication of efficiency based on the distance traveled by each match in ridesharing. In accordance with the results of the ADP index, a weighted bipartite graph is formed which describes the connectedness of the pairs in !" . The process of getting the set of pairs that maximizes the objective function is solved by using the Hungarian algorithm to solve the Maximum-Weighted Matching on a Bipartite Graph. After the whole process is done, a set of pairs that will do ridesharing are generated. In this study it was shown that by clustering, the optimization process became more efficient with the reduction of the matches examined from 400 matches to 131 matches and the number of matches in the optimization results with clustering was the same as the optimization results without clustering, that is 13 matches."

[Depok, Depok]: [Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia], 2020

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Gan, Guojun

Data clustering: theory, algorithms, and applications

"Cluster analysis is an unsupervised process that divides a set of objects into homogeneous groups. This book starts with basic information on cluster analysis, including the classification of data and the corresponding similarity measures, followed by the presentation of over 50 clustering algorithms in groups according to some specific baseline methodologies such as hierarchical, center-based, and search-based methods. As a result, readers and users can easily identify an appropriate algorithm for their applications and compare novel ideas with existing results.

The book also provides examples of clustering applications to illustrate the advantages and shortcomings of different clustering architectures and algorithms. Application areas include pattern recognition, artificial intelligence, information technology, image processing, biology, psychology, and marketing. Readers also learn how to perform cluster analysis with the C/C++ and MATLAB programming languages."

Philadelphia: Society for Industrial and Applied Mathematics, 2007

e20448780

eBooks Universitas Indonesia Library

Alfia Choirun Nisa

Perbandingan antara Hasil Biclustering dan Hasil Cluster-Biplot pada Data 119 Kabupaten/Kota di Pulau Jawa Tahun 2022 Berdasarkan Indikator Kesejahteraan Rakyat = Comparison between Biclustering Results and Cluster-Biplot Results on Data from 119 Regencies/Cities on Java Island in 2022 Based on People's Welfare Indicators

"Keberhasilan pembangunan suatu negara dapat dilihat dari kondisi kesejahteraan rakyatnya. Peningkatan kesejahteraan rakyat menjadi sasaran utama dalam kegiatan pembangunan yang dilaksanakan oleh pemerintah. Agar pembangunan yang dilakukan efektif dan tepat sasaran, perlu dilakukan pengelompokan untuk mengetahui karakteristik wilayah. Penelitian ini membahas mengenai pengelompokan kabupaten/kota di Pulau Jawa berdasarkan indikator kesejahteraan rakyat tahun 2022. Kesejahteraan yang diukur merupakan kesejahteraan materi. Variabel yang digunakan dalam penelitian ini adalah persentase penduduk miskin, PDRB per kapita atas dasar harga berlaku, rata-rata lama sekolah, harapan lama sekolah, persentase pengeluaran per kapita untuk makanan, tingkat pengangguran terbuka, jumlah penduduk, kepadatan penduduk, dan angka harapan hidup. Terdapat dua pendekatan yang digunakan dalam mengelompokkan kabupaten/kota beserta variabel-variabelnya. Pendekatan pertama adalah mengelompokkan kabupaten/kota dan variabel-variabelnya secara simultan dengan menggunakan metode biclustering plaid model. Pendekatan kedua adalah mengelompokkan kabupaten/kota menggunakan clustering metode Ward dan dilanjutkan dengan metode biplot. Tujuan penelitian ini adalah membandingkan hasil kedua pendekatan tersebut, yaitu hasil biclustering dan hasil cluster-biplot pada data 119 kabupaten/kota di Pulau Jawa pada tahun 2022 berdasarkan indikator kesejahteraan rakyat. Berdasarkan hasil penelitian, didapatkan jumlah kelompok dari kedua pendekatan tersebut adalah sebanyak 2 dengan kelompok 1 merupakan wilayah yang lebih sejahtera daripada kelompok 2. Ditinjau dari nilai standar deviasinya, kelompok hasil biclustering plaid model memiliki nilai standar deviasi yang lebih kecil dibanding kelompok hasil cluster-biplot. Dengan demikian, secara umum pendekatan pertama menghasilkan kelompok yang lebih baik karena lebih homogen dibandingkan dengan pendekatan kedua.

The success of a country's development can be known from the well-being of its people. Improving the welfare of the population is the main goal in the development activities carried out by government. To ensure that development is effective and targeted, grouping is needed to understand the characteristics of the region. This study discusses the grouping of regencies/cities in Java based on the people's welfare indicators in 2022. The measured welfare is material well-being. Variables used in this study are the percentage of the poor population, GDP per capita at current prices, average length of schooling, expected length of schooling, percentage of per capita expenditure on food, open unemployment rate, population, population density, and life expectancy. There are two approaches used in grouping regencies/cities along with their variables. The first approach is to group regencies/cities and their variables simultaneously using plaid model biclustering method. The second approach is to group regencies/cities using the Ward clustering method and then followed by the biplot method. The aim of this study is to compare the results of these two approaches, namely the biclustering results and the cluster-biplot results on data from 119 regencies/cities in Java in 2022 based on people's welfare indicators. Based on the results of this study, the number of groups from each approach is 2, with group 1 being more prosperous than group 2. Judging from the standard deviation values, the plaid model biclustering result groups have lower standard deviation values compared to the cluster-biplot result groups. Therefore, in general the first approach produces better groups as they are more homogeneous compared to the second approach."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2023

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Ja`far Hilmy Farhan

Analisa pengaruh perubahan pada input k-means clustering terhadap jumlah cluster optimal di kasus fet-pet = Analysis of the effect of changes in the k-means clustering input on the optimal cluster number in the fet-pet case

"Identifikasi suatu letak atau lokasi tumor beserta jaringan-jaringan sehat di sekitarnya secara akurat, merupakan salah satu langkah dalam melakukan perencanaan radioterapi atau disebut juga dengan istilah treatment planning. Setelah lokasinya ditentukan dan dipastikan, dokter akan mencoba membuat perencanaan berupa dosis radiasi yang dibutuhkan untuk diberikan kepada pasien sehingga dapat memberikan kerusakan yang maksimal pada tumor tanpa memberikan dampak negatif pada organ-organ sehat di sekitarnya. Salah satu solusi dalam melakukan tugas tersebut adalah berupa segmentasi otomatis suatu citra PET. Segmentasi otomatis ini tidak hanya memakan waktu yang cukup singkat, tetapi juga mempertimbangkan seluruh nilai time-activity curve (TAC) yang ada pada citra. Salah satu metode untuk menentukan lokasi tumor adalah dengan cara segmentasi otomatis menggunakan algoritma clustering k-means. Berdasarkan penelitian dari (Abualhaj, 2017), clustering k-means memberikan hasil segmentasi yang sangat baik dalam melalukan pendeteksian lokasi tumor. Namun, algoritma yang digunakan tidak sepenuhnya otomatis karena perlunya input parameter nilai yang diterapkan oleh pengguna. Penelitian ini bertujuan mengidentifikasi pengaruh error terhadap input dari algoritma k-means clustering yaitu apabila terjadi kesalahan dalam memasukkan nilai-nilai input tersebut. Terdapat lima paremeter yang akan diuji dengan masing-masing lima variasi masukan. Parameter tersebut masing-masing adalah maximum number of cluster, maximum iteration, maximum repetition time, total counts, dan random counts. Untuk semua parameter, hasil jumlah cluster optimalnya tidak berubah dan tidak dipengaruhi oleh variasi dari parameter tersebut yaitu berjumlah 4 cluster kecuali pada parameter pertama apabila nilainya di bawah 4. Pada parameter pertama, hasil bentuk segmentasi beserta kurva TAC nya berubah saat nilai parameternya di bawah 4. Adapun jika nilainya di atas 4, berdasarkan yang diteliti penulis, tidak ada perbedaan pada bentuk segmentasi maupun jumlah optimal clusternya. Pada parameter kedua, bentuk segmentasinya terlihat ada sedikit perbedaan beserta kurva TAC saat nilai parameternya diperkecil. Parameter ketiga memiliki hasil yang mirip dengan parameter kedua yang mana hasil segmentasinya memiliki sedikit perbedaan saat nilainya diperkecil begitu pula dengan kurva TAC nya. Parameter ketiga dan keempat memiliki hasil yang mirip ketika nilainya diperkecil maupun diperbesar yakni terlihat adanya sedikit perbedaan pada hasil segmentasinya.

Identifying a location or location of a tumor and surrounding healthy tissues accurately is one of the steps in planning radiotherapy or also known as treatment planning. After the location is determined and confirmed, the doctor will try to make a plan in the form of the radiation dose needed to be given to the patient so that it can provide maximum damage to the tumor without having a negative impact on the surrounding healthy organs. One solution in performing this task is in the form of automatic segmentation of a PET image. This automatic segmentation not only takes a fairly short time, but also considers all the time-activity curve (TAC) values ââin the image. One method to determine the location of the tumor is by means of automatic segmentation using the k-means clustering algorithm. Based on research from (Abualhaj, 2017), k-means clustering provides excellent segmentation results in detecting tumor locations. However, the algorithm used is not fully automated because of the need for user-implemented value parameter input. This study aims to identify the effect of error on the input of the k-means clustering algorithm, namely if there is an error in entering the input values. There are five parameters to be tested with five variations of each input. These parameters are maximum number of cluster, maximum iteration, maximum repetition time, total counts, and random counts. For all parameters, the results of the optimal number of clusters do not change and are not influenced by variations of these parameters, which are 4 clusters except for the first parameter if the value is below 4. In the first parameter, the results of the segmentation form along with the TAC curve change when the parameter value is below 4 Meanwhile, if the value is above 4, based on what the author has studied, there is no difference in the form of segmentation and the optimal number of clusters. In the second parameter, the shape of the segmentation shows a slight difference along with the TAC curve when the parameter value is reduced. The third parameter has similar results to the second parameter where the segmentation results have a slight difference when the value is reduced as well as the TAC curve. The third and fourth parameters have similar results when the value is reduced or enlarged, namely that there is a slight difference in the segmentation results. "

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-Pdf

UI - Skripsi Membership Universitas Indonesia Library

Cari yang mirip

Tambahkan ke Favorit

Metadata PDF

Abstrak PDF

Abstrak

Evan Haryowidyatna

Analisis Pengelompokan Kabupaten dan Kota di Pulau Jawa Sebagai Sasaran Industri Sepeda Motor dengan Metode Partitional Hard Clustering = Clustering Analysis of Districts and Cities in The Island of Java as Targets of Motorcycle Industry Using Partitional Hard Clustering Method

"Per 9 Februari 2023, 87% dari total populasi kendaraan pribadi di Indonesia merupakan sepeda motor. Persebaran sepeda motor terpadat di Indonesia berada di Pulau Jawa dengan persentase sebesar 60%. Tingginya populasi sepeda motor dan fakta bahwa 80% rumah tangga di Pulau Jawa sudah memiliki sepeda motor membuat pasar sepeda motor semakin mengecil. Dalam jangka panjang, kondisi ini dapat berdampak buruk bagi industri sepeda motor yang terus ingin berkembang. Penelitian ini membahas tentang pengelompokan kabupaten dan kota di Pulau Jawa berdasarkan karakteristik demografinya. Kemudian, diberikan saran keputusan yang dapat dilakukan oleh industri sepeda motor berdasarkan kelompok kabupaten dan kota yang terbentuk menggunakan teknik clustering. Hal ini bertujuan agar produsen yang bergerak di industri sepeda motor dapat memfokuskan produknya pada kelompok kabupaten dan kota yang memiliki potensi terbaik. Terdapat 12 variabel demografi yang digunakan dalam penelitian ini, dan variabel tersebut terbagi menjadi tiga kategori: kondisi ekonomi masyarakat, kondisi kehidupan masyarakat, dan kondisi demografis daerah. Metode yang digunakan dalam penelitian ini adalah metode partitional hard clustering. Sebelumnya, dilakukan pembuatan dataset melalui proses data scrapping pada situs terpercaya, dan dilanjutkan dengan proses Exploratory Data Analysis (EDA) pada dataset. Setelah dataset terbentuk, dilakukan pengelompokan dengan metode partitional hard clustering yang terdiri dari metode K-Means Clustering dan metode K-Medoids Clustering. Kemudian, dilakukan evaluasi cluster untuk menentukan metode clustering yang paling sesuai dengan menggunakan empat metrik evaluasi yaitu Indeks Silhouette, Indeks Dunn, Indeks Davies Bouldin, dan Indeks Calinski Harabasz. Didapatkan hasil bahwa metode K-Medoids Clustering dengan 5 kelompok merupakan yang terbaik untuk mengelompokkan kabupaten dan kota di Pulau Jawa. Setelah kelompok terbentuk, setiap kelompok diberikan rekomendasi keputusan yang sebaiknya diambil oleh industri sepeda motor. Terdapat 4 rekomendasi yang dapat diberikan, yaitu distribusi suku cadang, pembuatan bengkel, penjualan sepeda motor kelas menengah ke atas, dan penjualan sepeda motor kelas menengah ke bawah.
As of February 9, 2023, 87% of the total population of private vehicles in Indonesia consists of motorcycles. The densest distribution of motorcycles in Indonesia is found on the Island of Java, with a percentage of 60%. The high population of motorcycles and the fact that 80% of households in Java already have motorcycles are causing the motorcycle market to shrink. In the long run, this condition can have negative impacts on the motorcycle industry that continues to seek growth. This research focuses on the clustering of regencies and cities in Java based on their demographic characteristics. Subsequently, decision recommendations will be provided for the motorcycle industry based on the formed groups using clustering techniques. The aim is to enable manufacturers in the motorcycle industry to focus their products on regencies and cities with the best potential. There are 12 demographic variables used in this research, divided into three categories: the economic conditions of society, the living conditions of society, and the demographic conditions of the region. The method used in this research is the partitional hard clustering method. Firstly, a dataset is created through the data scraping process on trusted sites, followed by the Exploratory Data Analysis (EDA) process on the dataset. Once the dataset is formed, clustering is performed using the partitional hard clustering method, consisting of the K-Means Clustering and K-Medoids Clustering methods. Subsequently, cluster evaluation is carried out to determine the most suitable clustering method using four evaluation metrics: Silhouette Index, Dunn Index, Davies Bouldin Index, and Calinski Harabasz Index. The results show that the K-Medoids Clustering method with 5 clusters is the best for grouping regencies and cities in Java. After the groups are formed, each group is given decision recommendations that the motorcycle industry should consider. There are four recommendations: spare parts distribution, workshop establishment, sales of mid- to high-end motorcycles, and sales of mid-range motorcycles and below."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2023

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Cari yang mirip

Tambahkan ke Favorit

Metadata PDF

Abstrak PDF

Abstrak

<< 1 2 3 4 5 6 7 8 9 10 >>

Hasil Pencarian :: Simpan CSV :: Kembali

Hasil Pencarian