Follow the leader: Documents on the leading edge of semantic change get more citations

Soni, Sandeep; Lerman, Kristina; Eisenstein, Jacob

doi:10.1002/asi.24421

Citation Details

Follow the leader: Documents on the leading edge of semantic change get more citations

Abstract

Diachronic word embeddings—vector representations of words over time—offer remarkable insights into the evolution of language and provide a tool for quantifying sociocultural change from text documents. Prior work has used such embeddings to identify shifts in the meaning of individual words. However, simply knowing that a word has changed in meaning is insufficient to identify the instances of word usage that convey the historical meaning or the newer meaning. In this study, we link diachronic word embeddings to documents, by situating those documents as leaders or laggards with respect to ongoing semantic changes. Specifically, we propose a novel method to quantify the degree of semantic progressiveness in each word usage, and then show how these usages can be aggregated to obtain scores for each document. We analyze two large collections of documents, representing legal opinions and scientific articles. Documents that are scored as semantically progressive receive a larger number of citations, indicating that they are especially influential. Our work thus provides a new technique for identifying lexical semantic leaders and demonstrates a new link between progressive use of language and influence in a citation network.

NSF-PAR ID:: 10236541

Author(s) / Creator(s):: Soni, Sandeep ; Lerman, Kristina ; Eisenstein, Jacob

Publisher / Repository:: Wiley Blackwell (John Wiley & Sons)

Date Published:: 2020-10-22

Journal Name:: Journal of the Association for Information Science and Technology

Volume:: 72

Issue:: 4

ISSN:: 2330-1635

Page Range / eLocation ID:: p. 478-492

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1002/asi.24421

More Like this