Few-Shot Representation Learning for Out-Of-Vocabulary Words

Hu, Ziniu; Chen, Ting; Chang, Kai-Wei; Sun, Yizhou

doi:10.18653/v1/P19-1402

Citation Details

Few-Shot Representation Learning for Out-Of-Vocabulary Words

Existing approaches for learning word embedding often assume there are sufficient occurrences for each word in the corpus, such that the representation of words can be accurately estimated from their contexts. However, in real-world scenarios, out-of-vocabulary (a.k.a. OOV) words that do not appear in training corpus emerge frequently. How to learn accurate representations of these words to augment a pre-trained embedding by only a few observations is a challenging research problem. In this paper, we formulate the learning of OOV embedding as a few-shot regression problem by fitting a representation function to predict an oracle embedding vector (defined as embedding trained with abundant observations) based on limited contexts. Specifically, we propose a novel hierarchical attention network-based embedding framework to serve as the neural regression function, in which the context information of a word is encoded and aggregated from K observations. Furthermore, we propose to use Model-Agnostic Meta-Learning (MAML) for adapting the learned model to the new corpus fast and robustly. Experiments show that the proposed approach significantly outperforms existing methods in constructing an accurate embedding for OOV words and improves downstream tasks when the embedding is utilized. more »

Award ID(s):: 1760523

PAR ID:: 10144869

Author(s) / Creator(s):: Hu, Ziniu; Chen, Ting; Chang, Kai-Wei; Sun, Yizhou

Date Published:: 2019-01-01

Journal Name:: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

Page Range / eLocation ID:: 4102 to 4112

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/P19-1402

More Like this