Published January 1, 2019 | Version v1
Conference paper Open

Waste Not: Meta-Embedding of Word and Context Vectors

  • 1. Marmara Univ, TR-34730 Istanbul, Turkey

Description

The word2vec and fastText models train two vectors per word: a word and a context vector. Typically the context vectors are discarded after training, even though they may contain useful information for different NLP tasks. Therefore we combine word and context vectors in the framework of meta-embeddings. Our experiments show performance increases at several NLP tasks such as text classification, semantic similarity, and analogy. In conclusion, this approach can be used to increase performance at downstream tasks while requiring minimal additional computational resources.

Files

bib-25a74055-4dd9-4f9c-8285-06384aa7ecdf.txt

Files (164 Bytes)

Name Size Download all
md5:cf9b014298a06ef4b655bddbc09b965c
164 Bytes Preview Download