MCMSTClustering: defining non-spherical clusters by using minimum spanning tree over KD-tree-based micro-clusters

[ X ]

Tarih

2023

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Springer London Ltd

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Clustering is a technique for statistical data analysis and is widely used in many areas where class labels are not available. Major problems related to clustering algorithms are handling high-dimensional, imbalanced, and/or varying-density datasets, detecting outliers, and defining arbitrary-shaped clusters. In this study, we proposed a novel clustering algorithm named as MCMSTClustering (Defining Non-Spherical Clusters by using Minimum Spanning Tree over KD-Tree-based Micro-Clusters) to overcome mentioned issues simultaneously. Our algorithm consists of three parts. The first part is defining micro-clusters using the KD-Tree data structure with range search. The second part is constructing macro-clusters by using minimum spanning tree (MST) on defined micro-clusters, and the final part is regulating defined clusters to increase the accuracy of the algorithm. To state the efficiency of our algorithm, we performed some experimental studies on some state-of-the-art algorithms. The findings were presented in detail with tables and graphs. The success of the proposed algorithm using various performance evaluation criteria was confirmed. According to the experimental studies, MCMSTClustering outperformed competitor algorithms in aspects of clustering quality in acceptable run-time. Besides, the obtained results showed that the novel algorithm can be applied effectively in solving many different clustering problems in the literature.

Açıklama

Anahtar Kelimeler

Clustering, Kd-Tree, Minimum spanning tree, Micro-cluster, Arbitrary-shaped clusters

Kaynak

Neural Computing & Applications

WoS Q Değeri

Q2

Scopus Q Değeri

Q1

Cilt

35

Sayı

18

Künye