Dergi makalesi Açık Erişim

Investigation of Luhn's claim on information retrieval

Kocabas, Ilker; Dincer, Bekir Taner; Karaoglan, Bahar


Dublin Core

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Kocabas, Ilker</dc:creator>
  <dc:creator>Dincer, Bekir Taner</dc:creator>
  <dc:creator>Karaoglan, Bahar</dc:creator>
  <dc:date>2011-01-01</dc:date>
  <dc:description>In this study, we show how Luhn's claim about the degree of importance of a word in a document can be related to information retrieval. His basic idea is transformed into z -scores as the weights of terms for the purpose of modeling terra frequency (If) within documents. The Luhn-based models represented in this paper are considered as the TF component of proposed TF x IDF weighing schemes. Moreover, the final term weighting functions appropriate for the TF x IDF weighting scheme are applied to TREC-6, -7, and -8 databases. The experimental results show relevance to Luhn's claim by having high mean average precision (MAP) for the terms with frequencies around the mean frequency of terms within a document. On the other hand, the weighting, which significantly discriminates the importance between low/high frequencies and medium frequencies, degrades the retrieval performance. Therefore, any weighting scheme (TF) that is directly proportional to If has a probability of high retrieval performance, if this can optimally indicate the difference of the importance regarding tf values and also optimally eliminate the terms that have high frequencies.</dc:description>
  <dc:identifier>https://aperta.ulakbim.gov.trrecord/22341</dc:identifier>
  <dc:identifier>oai:zenodo.org:22341</dc:identifier>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>http://www.opendefinition.org/licenses/cc-by</dc:rights>
  <dc:source>TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES 19(6) 993-1004</dc:source>
  <dc:title>Investigation of Luhn's claim on information retrieval</dc:title>
  <dc:type>info:eu-repo/semantics/article</dc:type>
  <dc:type>publication-article</dc:type>
</oai_dc:dc>
24
5
görüntülenme
indirilme
Görüntülenme 24
İndirme 5
Veri hacmi 925 Bytes
Tekil görüntülenme 24
Tekil indirme 5

Alıntı yap