NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Dynamic Augmentation Data Selection for Few-shot Text Classification

Liu, Guangliang; Jin, Lifeng; Yuan, Owen; Zhou, Jiayu (December 2022, Findings of the Association for Computational Linguistics: EMNLP 2022)

Data augmentation has been a popular method for fine-tuning pre-trained language models to increase model robustness and performance. With augmentation data coming from modifying gold train data (in-sample augmentation) or being harvested from general domain unlabeled data (out-of-sample augmentation), the quality of such data is the key to successful fine-tuning. In this paper, we propose a dynamic data selection method to select effective augmentation data from different augmentation sources according to the model’s learning stage, by identifying a set of augmentation samples that optimally facilitates the learning process of the most current model. The method firstly filters out augmentation samples with noisy pseudo labels through a curriculum learning strategy, then estimates the effectiveness of reserved augmentation data by its influence scores on the current model at every update, allowing the data selection process tightly tailored to model parameters. And the two-stage augmentation strategy considers in-sample augmentation and out-of-sample augmentation in different learning stages. Experiments with both kinds of augmentation data on a variety of sentence classification tasks show that our method outperforms strong baselines, proving the effectiveness of our method. Analysis confirms the dynamic nature of the data effectiveness and the importance of model learning stages in utilization of augmentation data.
more » « less
Full Text Available
Salience Allocation as Guidance for Abstractive Summarization

Wang, Fei; Song, Kaiqiang; Zhang, Hongming; Jin, Lifeng; Cho, Sangwoo; Yao, Wenlin; Wang, Xiaoyang; Chen, Muhao; Yu, Dong (January 2022, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Abstractive summarization models typically learn to capture the salient information from scratch implicitly.Recent literature adds extractive summaries as guidance for abstractive summarization models to provide hints of salient content and achieves better performance.However, extractive summaries as guidance could be over strict, leading to information loss or noisy signals.Furthermore, it cannot easily adapt to documents with various abstractiveness.As the number and allocation of salience content pieces varies, it is hard to find a fixed threshold deciding which content should be included in the guidance.In this paper, we propose a novel summarization approach with a flexible and reliable salience guidance, namely SEASON (SaliencE Allocation as Guidance for Abstractive SummarizatiON).SEASON utilizes the allocation of salience expectation to guide abstractive summarization and adapts well to articles in different abstractiveness.Automatic and human evaluations on two benchmark datasets show that the proposed method is effective and reliable.Empirical results on more than one million news articles demonstrate a natural fifteen-fifty salience split for news article sentences, providing a useful insight for composing news articles.
more » « less
Full Text Available
Grounded PCFG Induction with Images

Jin, Lifeng; Schuler, William (December 2020, Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing)
null (Ed.)
Full Text Available
Character-based PCFG Induction for Modeling the Syntactic Acquisition of Morphologically Rich Languages

https://doi.org/10.18653/v1/2021.findings-emnlp.371

Jin, Lifeng; Oh, Byung-Doh; Schuler, William (January 2021, Findings of the Association for Computational Linguistics: EMNLP 2021)

Full Text Available
Depth-Bounded Statistical PCFG Induction as a Model of Human Grammar Acquisition

https://doi.org/10.1162/coli_a_00399

Jin, Lifeng; Schwartz, Lane; Doshi-Velez, Finale; Miller, Timothy; Schuler, William (March 2021, Computational Linguistics)
null (Ed.)
Abstract This article describes a simple PCFG induction model with a fixed category domain that predicts a large majority of attested constituent boundaries, and predicts labels consistent with nearly half of attested constituent labels on a standard evaluation data set of child-directed speech. The article then explores the idea that the difference between simple grammars exhibited by child learners and fully recursive grammars exhibited by adult learners may be an effect of increasing working memory capacity, where the shallow grammars are constrained images of the recursive grammars. An implementation of these memory bounds as limits on center embedding in a depth-specific transform of a recursive grammar yields a significant improvement over an equivalent but unbounded baseline, suggesting that this arrangement may indeed confer a learning advantage.
more » « less
Full Text Available
The Importance of Category Labels in Grammar Induction with Child-directed Utterances

https://doi.org/10.18653/v1/2020.iwpt-1.15

Jin, Lifeng; Schuler, William (January 2020, Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task)

Full Text Available
Memory-bounded Neural Incremental Parsing for Psycholinguistic Prediction

Jin, Lifeng Jin; Schuler, William (January 2020, Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task)

Full Text Available
Variance of Average Surprisal: A Better Predictor for Quality of Grammar from Unsupervised PCFG Induction

Jin, Lifeng; Schuler, William (January 2019, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics)

In unsupervised grammar induction, data likelihood is known to be only weakly correlated with parsing accuracy, especially at convergence after multiple runs. In order to find a better indicator for quality of induced grammars, this paper correlates several linguistically- and psycholinguistically-motivated predictors to parsing accuracy on a large multilingual grammar induction evaluation data set. Results show that variance of average surprisal (VAS) better correlates with parsing accuracy than data likelihood and that using VAS instead of data likelihood for model selection provides a significant accuracy boost. Further evidence shows VAS to be a better candidate than data likelihood for predicting word order typology classification. Analyses show that VAS seems to separate content words from function words in natural language grammars, and to better arrange words with different frequencies into separate classes that are more consistent with linguistic theory.
more » « less
Full Text Available
Unsupervised Learning of PCFGs with Normalizing Flow

Jin, Lifeng; Doshi-Velez, Finale; Miller, Timothy; Schwartz, Lane; Schuler, William (January 2019, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics)

Unsupervised PCFG inducers hypothesize sets of compact context-free rules as explanations for sentences. PCFG induction not only provides tools for low-resource languages, but also plays an important role in modeling language acquisition (Bannard et al., 2009; Abend et al. 2017). However, current PCFG induction models, using word tokens as input, are unable to incorporate semantics and morphology into induction, and may encounter issues of sparse vocabulary when facing morphologically rich languages. This paper describes a neural PCFG inducer which employs context embeddings (Peters et al., 2018) in a normalizing flow model (Dinh et al., 2015) to extend PCFG induction to use semantic and morphological information. Linguistically motivated sparsity and categorical distance constraints are imposed on the inducer as regularization. Experiments show that the PCFG induction model with normalizing flow produces grammars with state-of-the-art accuracy on a variety of different languages. Ablation further shows a positive effect of normalizing flow, context embeddings and proposed regularizers.
more » « less
Full Text Available

Search for: All records