Waste Not: Meta-Embedding of Word and Context Vectors

Degirmenci, Selin; Gerek, Aydin; Ganiz, Murat Can

doi:10.1007/978-3-030-23281-8_35

Yayınlanmış 1 Ocak 2019 | Sürüm v1

Konferans bildirisi Açık

Waste Not: Meta-Embedding of Word and Context Vectors

1. Marmara Univ, TR-34730 Istanbul, Turkey

The word2vec and fastText models train two vectors per word: a word and a context vector. Typically the context vectors are discarded after training, even though they may contain useful information for different NLP tasks. Therefore we combine word and context vectors in the framework of meta-embeddings. Our experiments show performance increases at several NLP tasks such as text classification, semantic similarity, and analogy. In conclusion, this approach can be used to increase performance at downstream tasks while requiring minimal additional computational resources.

Dosyalar

bib-25a74055-4dd9-4f9c-8285-06384aa7ecdf.txt

Dosyalar (164 Bytes)

Ad	Boyut	Hepisini indir
bib-25a74055-4dd9-4f9c-8285-06384aa7ecdf.txt md5:cf9b014298a06ef4b655bddbc09b965c	164 Bytes	Ön İzleme İndir

Görüntüleme

İndirilenler

Daha fazla ayrıntı göster

	Tüm sürümler	Bu sürüm
Görüntüleme	65	65
İndirilenler	25	25
Veri miktarı	4.1 kB	4.1 kB

Oluşturuldu

16 Mart 2021

DOI

Kaynak türü

Konferans bildirisi

Konferans

NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2019)

Haklar

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Waste Not: Meta-Embedding of Word and Context Vectors

Dosyalar

bib-25a74055-4dd9-4f9c-8285-06384aa7ecdf.txt

Dosyalar (164 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

Waste Not: Meta-Embedding of Word and Context Vectors

Oluşturanlar

Açıklama

Dosyalar

bib-25a74055-4dd9-4f9c-8285-06384aa7ecdf.txt

Dosyalar (164 Bytes)