Dergi makalesi Açık Erişim

Event Graph-Based News Clustering: The Role of Named Entity-Centered Subgraphs

Komecoglu, Basak Buluz; Yilmaz, Burcu


MARC21 XML

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">user-tubitak-adresli-yayinlar</subfield>
    <subfield code="o">oai:aperta.ulakbim.gov.tr:275735</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;In an era of exponential growth in online news sources, the need for intelligent digital solutions capable of efficiently analyzing and organizing large amounts of news content has become crucial. This paper presents a graph-based methodology designed to enhance Topic Detection and Tracking (TDT) tasks in natural language processing by efficiently clustering news events into coherent stories. The proposed approach leverages a novel event graph model that captures not only the characteristics of individual news events but also their collective narrative context. Using Named Entity Centred Frequent Subgraphs, the model excels in identifying recurring patterns of events and thus provides a framework for learning a robust, language-independent, and structured representation for structuring news stories, which represents a significant advance in the refinement of traditional clustering algorithms. Empirical experiments using a multilingual benchmark dataset, the News Clustering Dataset, highlight the superior clustering performance of our approach compared to state-of-the-art monolingual document clustering techniques, particularly in English and the competitive results in Spanish. To underline the adaptability of the methodology to low-resource languages, the Turkish 'Story-Based News Dataset' developed specifically for this study also promises to serve as an important resource for a wide range of natural language processing tasks.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">article</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="a">Creative Commons Attribution</subfield>
    <subfield code="u">http://www.opendefinition.org/licenses/cc-by</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Komecoglu, Basak Buluz</subfield>
    <subfield code="u">Gebze Tech Univ, Inst Informat Technol, TR-41400 Gebze, Kocaeli, Turkiye</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="z">md5:893341b1830d7bdb6a6cfa4276b914fd</subfield>
    <subfield code="s">147</subfield>
    <subfield code="u">https://aperta.ulakbim.gov.trrecord/275735/files/bib-e4f5e267-fec5-424e-8fd1-24d75aff878d.txt</subfield>
  </datafield>
  <controlfield tag="005">20250417124232.0</controlfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2024-01-01</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.1109/ACCESS.2024.3435343</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Event Graph-Based News Clustering: The Role of Named Entity-Centered Subgraphs</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="4">
    <subfield code="v">12</subfield>
    <subfield code="c">20</subfield>
    <subfield code="p">IEEE ACCESS</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Yilmaz, Burcu</subfield>
    <subfield code="u">Gebze Tech Univ, Inst Informat Technol, TR-41400 Gebze, Kocaeli, Turkiye</subfield>
  </datafield>
  <controlfield tag="001">275735</controlfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-tubitak-adresli-yayinlar</subfield>
  </datafield>
</record>
0
0
görüntülenme
indirilme
Görüntülenme 0
İndirme 0
Veri hacmi 0 Bytes
Tekil görüntülenme 0
Tekil indirme 0

Alıntı yap