Incorporating stability and error-based constraints for a novel partitional clustering algorithm

K. Aparna, author

Incorporating stability and error-based constraints for a novel partitional clustering algorithm

Mydhili K. Nair (Faculty of Engineering, Universitas Indonesia, 2016)

Abstrak

Data clustering is one of the major areas in data mining. The bisecting clustering algorithm is one of the most widely used for high dimensional dataset. But its performance degrades as the dimensionality increases. Also, the task of selection of a cluster for further bisection is a challenging one. To overcome these drawbacks, we developed a novel partitional clustering algorithm called a HB-K-Means algorithm (High dimensional Bisecting K-Means). In order to improve the performance of this algorithm, we incorporate two constraints, such as a stability-based measure and a Mean Square Error (MSE) resulting in CHB-K-Means (Constraint-based High dimensional Bisecting K-Means) algorithm. The CHB-K-Means algorithm generates two initial partitions. Subsequently, it calculates the stability and MSE for each partition generated. Inference techniques are applied on the stability and MSE values of the two partitions to select the next partition for the re-clustering process. This process is repeated until K number of clusters is obtained. From the experimental analysis, we infer that an average clustering accuracy of 75% has been achieved. The comparative analysis of the proposed approach with the other traditional algorithms shows an achievement of a higher clustering accuracy rate and an increase in computation time.

Kata Kunci

bisecting k-means

constraints

high dimensionality

mean square error (mse)

partitional clustering

stability

Metadata

No. Panggil :	UI-IJTECH 7:4 (2016)
Entri utama-Nama orang :	K. Aparna, author





Subjek :	Data mining--Indonesia
Penerbitan :	Depok: Faculty of Engineering, Universitas Indonesia, 2016

Sumber Pengatalogan :	LibUI eng rda
ISSN :	20869614
Majalah/Jurnal :	International Journal of Technology
Volume :	Vol. 7, No. 4, April 2016: Hal. 691-700
Tipe Konten :	text
Tipe Media :	unmediated
Tipe Carrier :	volume
Akses Elektronik :	https://doi.org/10.14716/ijtech.v7i4.1579
Institusi Pemilik :	Universitas Indonesia
Lokasi :	Perpustakaan UI, Lantai 4 R. Koleksi Jurnal

Ketersediaan
Ulasan

No. Panggil	No. Barkod	Ketersediaan
UI-IJTECH 7:4 (2016)	08-23-83757297	TERSEDIA

Ulasan:

Tidak ada ulasan pada koleksi ini: 9999920533310

:: Artikel Jurnal :: Kembali

Artikel Jurnal :: Kembali

Incorporating stability and error-based constraints for a novel partitional clustering algorithm

Abstrak

Kata Kunci

Metadata