Dergi makalesi Açık Erişim

Reporting and analyzing alternative clustering solutions by employing multi-objective genetic algorithm and conducting experiments on cancer data

Peng, Peter; Addam, Omer; Elzohbi, Mohamad; Ozyer, Sibel T.; Elhajj, Ahmad; Gao, Shang; Liu, Yimin; Ozyer, Tansel; Kaya, Mehmet; Ridley, Mick; Rokne, Jon; Alhajj, Reda


MARC21 XML

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">user-tubitak-destekli-proje-yayinlari</subfield>
    <subfield code="o">oai:zenodo.org:61525</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">Clustering is an essential research problem which has received considerable attention in the research community for decades. It is a challenge because there is no unique solution that fits all problems and satisfies all applications. We target to get the most appropriate clustering solution for a given application domain. In other words, clustering algorithms in general need prior specification of the number of clusters, and this is hard even for domain experts to estimate especially in a dynamic environment where the data changes and/or become available incrementally. In this paper, we described and analyze the effectiveness of a robust clustering algorithm which integrates multi-objective genetic algorithm into a framework capable of producing alternative clustering solutions; it is called Multi-objective K-Means Genetic Algorithm (MOKGA). We investigate its application for clustering a variety of datasets, including microarray gene expression data. The reported results are promising. Though we concentrate on gene expression and mostly cancer data, the proposed approach is general enough and works equally to cluster other datasets as demonstrated by the two datasets Iris and Ruspini. After running MOKGA, a pareto-optimal front is obtained, and gives the optimal number of clusters as a solution set. The achieved clustering results are then analyzed and validated under several cluster validity techniques proposed in the literature. As a result, the optimal clusters are ranked for each validity index. We apply majority voting to decide on the most appropriate set of validity indexes applicable to every tested dataset. The proposed clustering approach is tested by conducting experiments using seven well cited benchmark data sets. The obtained results are compared with those reported in the literature to demonstrate the applicability and effectiveness of the proposed approach. (C) 2013 Elsevier B.V. All rights reserved.</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">article</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="a">Creative Commons Attribution</subfield>
    <subfield code="u">http://www.opendefinition.org/licenses/cc-by</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Peng, Peter</subfield>
    <subfield code="u">Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="z">md5:00d3ac93eb19596fa41a3e6fcf4f884e</subfield>
    <subfield code="s">329</subfield>
    <subfield code="u">https://aperta.ulakbim.gov.trrecord/61525/files/bib-1fcf7c26-9be4-4d8d-bc82-60774c4274c5.txt</subfield>
  </datafield>
  <controlfield tag="005">20210316011433.0</controlfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2014-01-01</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.1016/j.knosys.2013.11.003</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Reporting and analyzing alternative clustering solutions by employing multi-objective genetic algorithm and conducting experiments on cancer data</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="4">
    <subfield code="v">56</subfield>
    <subfield code="c">108-122</subfield>
    <subfield code="p">KNOWLEDGE-BASED SYSTEMS</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Addam, Omer</subfield>
    <subfield code="u">Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Elzohbi, Mohamad</subfield>
    <subfield code="u">Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Ozyer, Sibel T.</subfield>
    <subfield code="u">Cankaya Univ, Dept Comp Engn, Ankara, Turkey</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Elhajj, Ahmad</subfield>
    <subfield code="u">Univ Bradford, Dept Comp, Bradford BD7 1DP, W Yorkshire, England</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Gao, Shang</subfield>
    <subfield code="u">Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Liu, Yimin</subfield>
    <subfield code="u">Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Ozyer, Tansel</subfield>
    <subfield code="u">TOBB Univ, Dept Comp Engn, Ankara, Turkey</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Kaya, Mehmet</subfield>
    <subfield code="u">Firat Univ, Dept Comp Engn, TR-23119 Elazig, Turkey</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Ridley, Mick</subfield>
    <subfield code="u">Univ Bradford, Dept Comp, Bradford BD7 1DP, W Yorkshire, England</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Rokne, Jon</subfield>
    <subfield code="u">Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Alhajj, Reda</subfield>
  </datafield>
  <controlfield tag="001">61525</controlfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-tubitak-destekli-proje-yayinlari</subfield>
  </datafield>
</record>
55
9
görüntülenme
indirilme
Görüntülenme 55
İndirme 9
Veri hacmi 3.0 kB
Tekil görüntülenme 52
Tekil indirme 9

Alıntı yap