skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Are cross-linguistically frequent systems easier to learn? The case of evidentiality.
It is often assumed that cross-linguistically more prevalent distinctions are easier to learn (Typological Prevalence Hypothesis - TPH). Prior work supports this hypothesis in phonology, morphology and syntax but has not addressed semantics. Using an Artificial Language Learning paradigm, we explore the learnability of semantic distinctions within the domain of evidentiality (i.e. the linguistic encoding of information sources). Our results support the TPH, since the most prevalent evidential system was learned best while the most rare evidentiality system yielded the worst learnability results. Furthermore, our results indicate that, cross-linguistically, indirect information sources seem to be marked preferentially (and acquired more easily) compared to direct sources. We explain this pattern in terms of the pragmatic need to mark indirect, potentially more unreliable sources over direct sources of information. The Document  more » « less
Award ID(s):
1632849
PAR ID:
10147297
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Proceedings of the Annual Conference of the Cognitive Science Society
ISSN:
1069-7977
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract While natural languages differ widely in both canonical word order and word order flexibility, their word orders still follow shared cross-linguistic statistical patterns, often attributed to functional pressures. In the effort to identify these pressures, prior work has compared real and counterfactual word orders. Yet one functional pressure has been overlooked in such investigations: The uniform information density (UID) hypothesis, which holds that information should be spread evenly throughout an utterance. Here, we ask whether a pressure for UID may have influenced word order patterns cross-linguistically. To this end, we use computational models to test whether real orders lead to greater information uniformity than counterfactual orders. In our empirical study of 10 typologically diverse languages, we find that: (i) among SVO languages, real word orders consistently have greater uniformity than reverse word orders, and (ii) only linguistically implausible counterfactual orders consistently exceed the uniformity of real orders. These findings are compatible with a pressure for information uniformity in the development and usage of natural languages.1 
    more » « less
  2. Abstract Since the advent of the internet, communication paradigms have continuously evolved, resulting in a present-day landscape where the dynamics of information dissemination have undergone a complete transformation compared to the past. In this study, we challenge the conventional two-step flow communication model, a long-standing paradigm in the field. Our approach introduces a more intricate multi-step and multi-actor model that effectively captures the complexities of modern information spread. We test our hypothesis by examining the spread of information on the Twitter platform. Our findings support the multi-step and multi-actor model hypothesis. In this framework, influencers (individuals with a significant presence in social media) emerge as new central figures and partially take on the role previously attributed to opinion leaders. However, this does not apply to opinion leaders who adapt and reaffirm their influential position on social media, here defined as opinion-leading influencers. Additionally, we note a substantial number of adopters directly accessing information sources, suggesting a potential decline in influence in both opinion leaders and influencers. Finally, we found distinctions in the diffusion patterns of left-/right-leaning groups, indicating variations in the underlying structure of information dissemination across different ideologies. 
    more » « less
  3. null (Ed.)
    We examined the coaching practices of three mathematics teacher educators as they engaged in real-time coaching with secondary mathematics preservice teachers. Situated in a novel early field experience and under close supervision, preservice teachers instructed undergraduate students in an introductory mathematics course; teacher educators coached in real time during these teaching episodes. Forty-four preservice teachers participated in this study, resulting in a data corpus of 44 videos of their teaching. Findings indicate that direct coaching was used more than indirect coaching, and pacing was the most prevalent focus of direct coaching. 
    more » « less
  4. This tutorial targets researchers and practitioners who are interested in AI and ML technologies for structural information extraction (IE) from unstructured textual sources. Particularly, this tutorial will provide audience with a systematic introduction to recent advances of IE, by answering several important research questions. These questions include (i) how to develop an robust IE system from noisy, insufficient training data, while ensuring the reliability of its prediction? (ii) how to foster the generalizability of IE through enhancing the system’s cross-lingual, cross-domain, cross-task and cross-modal transferability? (iii) how to precisely support extracting structural information with extremely fine-grained, diverse and boundless labels? (iv) how to further improve IE by leveraging indirect supervision from other NLP tasks, such as NLI, QA or summarization, and pre-trained language models? (v) how to acquire knowledge to guide the inference of IE systems? We will discuss several lines of frontier research that tackle those challenges, and will conclude the tutorial by outlining directions for further investigation. 
    more » « less
  5. Summary Causal mediation analysis has historically been limited in two important ways: (i) a focus has traditionally been placed on binary exposures and static interventions and (ii) direct and indirect effect decompositions have been pursued that are only identifiable in the absence of intermediate confounders affected by exposure. We present a theoretical study of an (in)direct effect decomposition of the population intervention effect, defined by stochastic interventions jointly applied to the exposure and mediators. In contrast to existing proposals, our causal effects can be evaluated regardless of whether an exposure is categorical or continuous and remain well-defined even in the presence of intermediate confounders affected by exposure. Our (in)direct effects are identifiable without a restrictive assumption on cross-world counterfactual independencies, allowing for substantive conclusions drawn from them to be validated in randomized controlled trials. Beyond the novel effects introduced, we provide a careful study of nonparametric efficiency theory relevant for the construction of flexible, multiply robust estimators of our (in)direct effects, while avoiding undue restrictions induced by assuming parametric models of nuisance parameter functionals. To complement our nonparametric estimation strategy, we introduce inferential techniques for constructing confidence intervals and hypothesis tests, and discuss open-source software, the $$\texttt{medshift}$$  $$\texttt{R}$$ package, implementing the proposed methodology. Application of our (in)direct effects and their nonparametric estimators is illustrated using data from a comparative effectiveness trial examining the direct and indirect effects of pharmacological therapeutics on relapse to opioid use disorder. 
    more » « less