NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Transformer Embeddings of Irregularly Spaced Events and Their Participants

Chenghao Yang; Hongyuan Mei; Jason Eisner (April 2022, Proceedings of the Tenth International Conference on Learning Representations (ICLR))

Full Text Available
On the Uncomputability of Partition Functions In Energy-Based Sequence Models

Chu-Cheng Lin; Arya McCarthy (April 2022, Proceedings of the Tenth International Conference on Learning Representations (ICLR))

Full Text Available
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP

Sabrina J. Mielke; Zaid Alyafeai; Elizabeth Salesky; Colin Raffel; Manan Dey; Matthias Gallé; Arun Raja; Chenglei Si; Wilson Y. Lee; Benoît Sagot; et al (December 2021, Computing Research Repository (arXiv))

What are the units of text that we want to model? From bytes to multi-word expressions, text can be analyzed and generated at many granularities. Until recently, most natural language processing (NLP) models operated over words, treating those as discrete and atomic tokens, but starting with byte-pair encoding (BPE), subword-based approaches have become dominant in many areas, enabling small vocabularies while still allowing for fast inference. Is the end of the road character-level model or byte-level processing? In this survey, we connect several lines of work from the pre-neural and neural era, by showing how hybrid approaches of words and characters as well as subword-based approaches based on learned segmentation have been proposed and evaluated. We conclude that there is and likely will never be a silver bullet singular solution for all applications and that thinking seriously about tokenization remains important for many applications.
more » « less
Full Text Available
Limitations of Autoregressive Models and Their Alternatives

https://doi.org/10.18653/v1/2021.naacl-main.405

Lin, Chu-Cheng; Jaech, Aaron; Li, Xin; Gormley, Matthew R.; Eisner, Jason (June 2021, Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT))

Full Text Available
SIGTYP 2021 Shared Task: Robust Spoken Language Identification

https://doi.org/10.18653/v1/2021.sigtyp-1.11

Salesky, Elizabeth; Abdullah, Badr M.; Mielke, Sabrina; Klyachko, Elena; Serikov, Oleg; Ponti, Edoardo Maria; Kumar, Ritesh; Cotterell, Ryan; Vylomova, Ekaterina (June 2021, Proceedings of the Third Workshop on Computational Typology and Multilingual NLP)

Full Text Available
Learning How to Ask: Querying LMs with Mixtures of Soft Prompts

https://doi.org/10.18653/v1/2021.naacl-main.410

Qin, Guanghui; Eisner, Jason (June 2021, Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT))

Full Text Available
Noise-Contrastive Estimation for Multivariate Point Processes

Hongyuan Mei; Tom Wan; Jason Eisner (December 2020, Advances in neural information processing systems)

Full Text Available
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information

https://doi.org/10.18653/v1/2020.acl-main.149

Bugliarello, Emanuele; Mielke, Sabrina J.; Anastasopoulos, Antonios; Cotterell, Ryan; Okazaki, Naoaki (July 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics)

Full Text Available
A Corpus for Large-Scale Phonetic Typology

https://doi.org/10.18653/v1/2020.acl-main.415

Salesky, Elizabeth; Chodroff, Eleanor; Pimentel, Tiago; Wiesner, Matthew; Cotterell, Ryan; Black, Alan W; Eisner, Jason (July 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL))

Full Text Available
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection

https://doi.org/10.18653/v1/2020.sigmorphon-1.1

Vylomova, Ekaterina; White, Jennifer; Salesky, Elizabeth; Mielke, Sabrina J.; Wu, Shijie; Ponti, Edoardo Maria; Hall Maudslay, Rowan; Zmigrod, Ran; Valvoda, Josef; Toldova, Svetlana; et al (July 2020, Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology)

Full Text Available

« Prev Next »

Search for: All records