Dergi makalesi Açık Erişim
Erdinc, Berfin; Kaya, Mahmut; Senol, Ali
<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
<leader>00000nam##2200000uu#4500</leader>
<datafield tag="909" ind1="C" ind2="O">
<subfield code="p">user-tubitak-destekli-proje-yayinlari</subfield>
<subfield code="o">oai:aperta.ulakbim.gov.tr:279365</subfield>
</datafield>
<datafield tag="520" ind1=" " ind2=" ">
<subfield code="a"><p>Stream clustering has emerged as a vital area for processing streaming data in real-time, facilitating the extraction of meaningful information. While efficient approaches for defining and updating clusters based on similarity criteria have been proposed, outliers and noisy data within stream clustering areas pose a significant threat to the overall performance of clustering algorithms. Moreover, the limitation of existing methods in generating non-spherical clusters underscores the need for improved clustering quality. As a new methodology, we propose a new stream clustering approach, MCMSTStream, to overcome the abovementioned challenges. The algorithm applies MST to micro-clusters defined by using the KD-Tree data structure to define macro-clusters. MCMSTStream is robust against outliers and noisy data and has the ability to define clusters with arbitrary shapes. Furthermore, the proposed algorithm exhibits notable speed and can handling high-dimensional data. ARI and Purity indices are used to prove the clustering success of the MCMSTStream. The evaluation results reveal the superior performance of MCMSTStream compared to state-of-the-art stream clustering algorithms such as DenStream, DBSTREAM, and KD-AR Stream. The proposed method obtained a Purity value of 0.9780 and an ARI value of 0.7509, the highest scores for the KDD dataset. In the other 11 datasets, it obtained much higher results than its competitors. As a result, the proposed method is an effective stream clustering algorithm on datasets with outliers, high-dimensional, and arbitrary-shaped clusters. In addition, its runtime performance is also quite reasonable.</p></subfield>
</datafield>
<datafield tag="980" ind1=" " ind2=" ">
<subfield code="a">publication</subfield>
<subfield code="b">article</subfield>
</datafield>
<datafield tag="540" ind1=" " ind2=" ">
<subfield code="a">Creative Commons Attribution</subfield>
<subfield code="u">http://www.opendefinition.org/licenses/cc-by</subfield>
</datafield>
<datafield tag="100" ind1=" " ind2=" ">
<subfield code="a">Erdinc, Berfin</subfield>
<subfield code="u">Siirt Univ, Dept Comp Engn, Siirt, Turkiye</subfield>
</datafield>
<datafield tag="856" ind1="4" ind2=" ">
<subfield code="z">md5:4d97c8f723287fe85e10d886df55a2c7</subfield>
<subfield code="s">212</subfield>
<subfield code="u">https://aperta.ulakbim.gov.trrecord/279365/files/bib-ea59051c-c006-492a-98f6-ecd20ba4cbed.txt</subfield>
</datafield>
<controlfield tag="005">20250417200353.0</controlfield>
<datafield tag="260" ind1=" " ind2=" ">
<subfield code="c">2024-01-01</subfield>
</datafield>
<datafield tag="024" ind1=" " ind2=" ">
<subfield code="a">10.1007/s00521-024-09443-1</subfield>
<subfield code="2">doi</subfield>
</datafield>
<datafield tag="542" ind1=" " ind2=" ">
<subfield code="l">open</subfield>
</datafield>
<datafield tag="245" ind1=" " ind2=" ">
<subfield code="a">MCMSTStream: applying minimum spanning tree to KD-tree-based micro-clusters to define arbitrary-shaped clusters in streaming data</subfield>
</datafield>
<datafield tag="909" ind1="C" ind2="4">
<subfield code="c">18</subfield>
<subfield code="p">NEURAL COMPUTING & APPLICATIONS</subfield>
</datafield>
<datafield tag="650" ind1="1" ind2="7">
<subfield code="a">cc-by</subfield>
<subfield code="2">opendefinition.org</subfield>
</datafield>
<datafield tag="700" ind1=" " ind2=" ">
<subfield code="a">Kaya, Mahmut</subfield>
<subfield code="u">Firat Univ, Dept Artificial Intelligence & Data Engn, TR-23119 Elazig, Turkiye</subfield>
</datafield>
<datafield tag="700" ind1=" " ind2=" ">
<subfield code="a">Senol, Ali</subfield>
<subfield code="u">Tarsus Univ, Dept Comp Technol, Mersin, Turkiye</subfield>
</datafield>
<controlfield tag="001">279365</controlfield>
<datafield tag="980" ind1=" " ind2=" ">
<subfield code="a">user-tubitak-destekli-proje-yayinlari</subfield>
</datafield>
</record>
| Görüntülenme | 0 |
| İndirme | 0 |
| Veri hacmi | 0 Bytes |
| Tekil görüntülenme | 0 |
| Tekil indirme | 0 |