Yayınlanmış 1 Ocak 2017
| Sürüm v1
Dergi makalesi
Açık
Using latent semantic analysis for automated keyword extraction from large document corpora
Açıklama
In this study, we describe a keyword extraction technique that uses latent semantic analysis (LSA) to identify semantically important single topic words or keywords. We compare our method against two other automated keyword extractors, Tf-idf (term frequency-inverse document frequency) and Metamap, using human-annotated keywords as a reference. Our results suggest that the LSA-based keyword extraction method performs comparably to the other techniques. Therefore, in an incremental update setting, the LSA-based keyword extraction method can be preferably used to extract keywords from text descriptions from big data when compared to existing keyword extraction methods.
Dosyalar
10-3906-elk-1511-203.pdf
Dosyalar
(483.1 kB)
| Ad | Boyut | Hepisini indir |
|---|---|---|
|
md5:d217f68af4915b833a25825285ba147f
|
483.1 kB | Ön İzleme İndir |