Dergi makalesi Açık Erişim

Efficient community identification and maintenance at multiple resolutions on distributed datastores

Aksu, Hidayet; Canim, Mustafa; Chang, Yuan-Chi; Korpeoglu, Ibrahim; Ulusoy, Ozgur


MARC21 XML

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">user-tubitak-destekli-proje-yayinlari</subfield>
    <subfield code="o">oai:zenodo.org:78935</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">The topic of network community identification at multiple resolutions is of great interest in practice to learn high cohesive subnetworks about different subjects in a network. For instance, one might examine the interconnections among web pages, blogs and social content to identify pockets of influencers on subjects like 'Big Data', 'smart phone' or 'global warming'. With dynamic changes to its graph representation and content, the incremental maintenance of a community poses significant challenges in computation. Moreover, the intensity of community engagement can be distinguished at multiple levels, resulting in a multi-resolution community representation that has to be maintained over time. In this paper, we first formalize this problem using the k-core metric projected at multiple k-values, so that multiple community resolutions are represented with multiple k-core graphs. Recognizing that large graphs and their even larger attributed content cannot be stored and managed by a single server, we then propose distributed algorithms to construct and maintain a multi-k-core graph, implemented on the scalable Big Data platform Apache HBase. Our experimental evaluation results demonstrate orders of magnitude speedup by maintaining multi-k-core incrementally over complete reconstruction. Our algorithms thus enable practitioners to create and maintain communities at multiple resolutions on multiple subjects in rich network content simultaneously. (C) 2015 Elsevier B.V. All rights reserved.</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">article</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="a">Creative Commons Attribution</subfield>
    <subfield code="u">http://www.opendefinition.org/licenses/cc-by</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Aksu, Hidayet</subfield>
    <subfield code="u">Bilkent Univ, Dept Comp Engn, Ankara, Turkey</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="z">md5:f55284c774866cd72140d78b5299cd55</subfield>
    <subfield code="s">218</subfield>
    <subfield code="u">https://aperta.ulakbim.gov.trrecord/78935/files/bib-4882f087-dd0f-4b01-b86c-7699a90a75c6.txt</subfield>
  </datafield>
  <controlfield tag="005">20210316051334.0</controlfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2015-01-01</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.1016/j.datak.2015.06.001</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Efficient community identification and maintenance at multiple resolutions on distributed datastores</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="4">
    <subfield code="v">100</subfield>
    <subfield code="c">133-147</subfield>
    <subfield code="p">DATA &amp; KNOWLEDGE ENGINEERING</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Canim, Mustafa</subfield>
    <subfield code="u">IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY USA</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Chang, Yuan-Chi</subfield>
    <subfield code="u">IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY USA</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Korpeoglu, Ibrahim</subfield>
    <subfield code="u">Bilkent Univ, Dept Comp Engn, Ankara, Turkey</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Ulusoy, Ozgur</subfield>
    <subfield code="u">Bilkent Univ, Dept Comp Engn, Ankara, Turkey</subfield>
  </datafield>
  <controlfield tag="001">78935</controlfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-tubitak-destekli-proje-yayinlari</subfield>
  </datafield>
</record>
43
10
görüntülenme
indirilme
Görüntülenme 43
İndirme 10
Veri hacmi 2.2 kB
Tekil görüntülenme 43
Tekil indirme 10

Alıntı yap