skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: What we talk about when we talk about colors
Names for colors vary widely across languages, but color categories are remarkably consistent. Shared mechanisms of color perception help explain consistent partitions of visible light into discrete color vocabularies. But the mappings from colors to words are not identical across languages, which may reflect communicative needs—how often speakers must refer to objects of different color. Here we quantify the communicative needs of colors in 130 different languages by developing an inference algorithm for this problem. We find that communicative needs are not uniform: Some regions of color space exhibit 30-fold greater demand for communication than other regions. The regions of greatest demand correlate with the colors of salient objects, including ripe fruits in primate diets. Our analysis also reveals a hidden diversity in the communicative needs of colors across different languages, which is partly explained by differences in geographic location and the local biogeography of linguistic communities. Accounting for language-specific, nonuniform communicative needs improves predictions for how a language maps colors to words, and how these mappings vary across languages. Our account closes an important gap in the compression theory of color naming, while opening directions to study cross-cultural variation in the need to communicate different colors and its impact on the cultural evolution of color categories.  more » « less
Award ID(s):
1946882
PAR ID:
10404244
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
118
Issue:
39
ISSN:
0027-8424
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Nölle, J; Raviv, L; Graham, E; Hartmann, S; Jadoul, Y; Josserand, M; Matzinger, T; Mudd, K; Pleyer, M; Slonimska, A (Ed.)
    Why are some words more frequent than others? Surprisingly, the obvious answers to this seemingly simple question, e.g., that frequent words reflect greater communicative needs, are either wrong or incomplete. We show that a word’s frequency is strongly associated with its position in a semantic association network. More centrally located words are more frequent. But is a word’s centrality in a network merely a reflection of inherent centrality of the word’s meaning? Through cross-linguistic comparisons, we found that differences in the frequency of translation-equivalents are predicted by differences in the word’s network structures in the different languages. Specifically, frequency was linked to how many connections a word had and to its capacity to bridge words that are typically not linked. This hints that a word’s frequency (and with it, its meaning) may change as a function of the word’s association with other words. 
    more » « less
  2. We investigated the emergence of dispersion in phonological systems using an established experimental paradigm in which pairs of participants play a non-linguistic communication game, taking turns to select discrete colors from a continuous underlying space and send them to each other to communicate animal silhouettes. Over time participants established sets of signals made up of combinatorial color units, analogous to the phonemes of natural language. This allowed us to investigate the role of interactive pressures on the emergence of organizational structure in phonological inventories, principally dispersion. We manipulated minimum signal length (as a means of investigating the role of coarticulation) and the presence of probabilistic noise. We also manipulated the nature of the underlying color space. There was an effect of colorspace but not of noise or minimum signal-length. However, dispersion occurred at above-chance levels in all conditions. Our results provide evidence for the role of communicative interaction in the emergence and cultural evolution of phonological structure. 
    more » « less
  3. abstract A growing body of research shows that both signed and spoken languages display regular patterns of iconicity in their vocabularies. We compared iconicity in the lexicons of American Sign Language (ASL) and English by combining previously collected ratings of ASL signs (Caselli, Sevcikova Sehyr, Cohen-Goldberg, & Emmorey, 2017) and English words (Winter, Perlman, Perry, & Lupyan, 2017) with the use of data-driven semantic vectors derived from English. Our analyses show that models of spoken language lexical semantics drawn from large text corpora can be useful for predicting the iconicity of signs as well as words. Compared to English, ASL has a greater number of regions of semantic space with concentrations of highly iconic vocabulary. There was an overall negative relationship between semantic density and the iconicity of both English words and ASL signs. This negative relationship disappeared for highly iconic signs, suggesting that iconic forms may be more easily discriminable in ASL than in English. Our findings contribute to an increasingly detailed picture of how iconicity is distributed across different languages. 
    more » « less
  4. In language evolution, formation of conceptual categories preceded formation of linguistic semantic categories (Hurford, 2007). The mapping from concepts to semantics is non-isomorphic, however, as particular languages categorize conceptual space in divergent ways (e.g. English put in is acceptable for both tight-fit and loose-fit relations, while Korean kkita encodes tight-fit relationships only; Choi & Bowerman, 1991). Despite this variation, are there crosslinguistic patterns in how words lexicalize conceptual space? We address this question analyzing how child homesigners from four different cultures describe instrumental events (e.g. cutting bread with a knife). Homesigners are congenitally deaf individuals who have not been taught a signed language. Despite growing up without structured linguistic input, these individuals use a gestural system ("homesign") to communicate (Goldin-Meadow, 2003). We find that homesign descriptions of instrumental events reflect categories present in adult English, Spanish and Mandarin, suggesting crosslinguistic biases for how verbs encode the conceptual space of events, biases which may have been present over the course of language evolution. 
    more » « less
  5. NLP research has increasingly focused on subjective tasks such as emotion analysis. However, existing emotion benchmarks suffer from two major shortcomings: (1) they largely rely on keyword-based emotion recognition, overlooking crucial cultural dimensions required for deeper emotion understanding, and (2) many are created by translating English-annotated data into other languages, leading to potentially unreliable evaluation. To address these issues, we introduce Cultural Lenses on Emotion (CuLEmo), the first benchmark designed to evaluate culture-aware emotion prediction across six languages: Amharic, Arabic, English, German, Hindi, and Spanish. CuLEmo comprises 400 crafted questions per language, each requiring nuanced cultural reasoning and understanding. We use this benchmark to evaluate several state-of-the-art LLMs on culture-aware emotion prediction and sentiment analysis tasks. Our findings reveal that (1) emotion conceptualizations vary significantly across languages and cultures, (2) LLMs performance likewise varies by language and cultural context, and (3) prompting in English with explicit country context often outperforms in-language prompts for culture-aware emotion and sentiment understanding. The dataset and evaluation code are publicly available. 
    more » « less