skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Primum Non Nocere: Before working with Indigenous data, the ACL must confront ongoing colonialism
In this paper, we challenge the ACL community to reckon with historical and ongoing colonialism by adopting a set of ethical obligations and best practices drawn from the Indigenous studies literature. While the vast majority of NLP research focuses on a very small number of very high resource languages (English, Chinese, etc), some work has begun to engage with Indigenous languages. No research involving Indigenous language data can be considered ethical without first acknowledging that Indigenous languages are not merely very low resource languages. The toxic legacy of colonialism permeates every aspect of interaction between Indigenous communities and outside researchers. To this end, we propose that the ACL draft and adopt an ethical framework for NLP researchers and computational linguists wishing to engage in research involving Indigenous languages.  more » « less
Award ID(s):
1761680 2243445
PAR ID:
10347128
Author(s) / Creator(s):
Date Published:
Journal Name:
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics
Volume:
2
Page Range / eLocation ID:
724-731
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Serikov, O.; Voloshina, E; Postnikova, A.; Klyachko, E.; Neminova E.; Vylomova, E.; Shavrina, T.; Le Ferrand, E.; Tyers, F (Ed.)
    In recent times, there has been a growing number of research studies focused on addressing the challenges posed by low-resource languages and the transcription bottleneck phenomenon. This phenomenon has driven the development of speech recognition methods to transcribe regional and Indigenous languages automatically. Although there is much talk about bridging the gap between speech technologies and field linguistics, there is a lack of documented efficient communication between NLP experts and documentary linguists. The models created for low-resource languages often remain within the confines of computer science departments, while documentary linguistics remain attached to traditional transcription workflows. This paper presents the early stage of a collaboration between NLP experts and field linguists, resulting in the successful transcription of Kréyòl Gwadloupéyen using speech recognition technology. 
    more » « less
  2. This material is primarily based upon work supported by the National Science Foundation Graduate Research Fellowship (grant no. DGE-1321845). Addressing complex social-ecological issues requires all relevant sources of knowledge and data, especially those held by communities who remain close to the land. Centuries of oppression, extractive research practices, and misrepresentation have hindered balanced knowledge exchange with Indigenous communities and inhibited innovation and problem-solving capacity in all scientific fields. A recent shift in the research landscape reflects a growing interest in engaging across diverse communities and ways of knowing. Scientific discussions increasingly highlight the inherent value of Indigenous environmental ethics frameworks and processes as the original roadmaps for sustainable development planning, including their potential in addressing the climate crisis and related social and environmental concerns. Momentum in this shift is also propelled by an increasing body of research evidencing the role of Indigenous land stewardship for maintaining ecological health and biodiversity. However, a key challenge straining this movement lies rooted in colonial residue and ongoing actions that suppress and co-opt Indigenous knowledge systems. Scientists working with incomplete datasets privilege a handful of narratives, conceptual understandings, languages, and historical contexts, while failing to engage thousands of collective bodies of intergenerational, place-based knowledge systems. The current dominant colonial paradigm in scientific research risks continued harmful impacts to Indigenous communities that sustain diverse knowledge systems. Here, we outline how ethical standards in researcher practice can be raised in order to reconcile colonial legacies and ongoing settler colonial practices. We synthesize across Indigenous and community-based research protocols and frameworks, transferring knowledge across disciplines, and ground truthing methods and processes in our own practice, to present a relational science working model for supporting Indigenous rights and reconciliation in research. We maintain that core Indigenous values of integrity, respect, humility, and reciprocity should shape researcher responsibilities and methods applied in order to raise ethical standards and long-term relational accountability regarding Indigenous lands, rights, communities, and our shared futures. 
    more » « less
  3. Michael Lachney and Aman Yadav, Special issue (Ed.)
    This article offers Ancestral Computing for Sustainability (ACS) to dismantle the logics of settler colonialism that affect accessibility, identities, and epistemologies of computer science education (CSE). ACS centers Indigenous epistemologies in researching CSE across four public universities in the United States. This paper describes Ancestral Computing for Sustainability and explores reflections of two students engaging as researchers in ACS inquiry. Drawing on Indigenous methodologies and Participatory Action Research, they share their reflections as co-researchers in ACS through storywork. These critical reflections include their relationship to computing, observations of the interdependent work within ACS, ethics and sustainability, and their experiences within the focus groups. The article ends with recommendations for furthering ACS as a decolonial approach that centers Indigenous epistemologies in CSE. Recommendations for CSE education include Ancestral Knowledge Systems and adding sustainability as a topic within computing education pathways and building student-faculty relationships based on trust is recommended to foster students’ academic and personal growth within CSE education and research. 
    more » « less
  4. Africa has over 2000 indigenous languages but they are under-represented in NLP research due to lack of datasets. In recent years, there have been progress in developing labelled corpora for African languages. However, they are often available in a single domain and may not generalize to other domains. In this paper, we focus on the task of sentiment classification for cross-domain adaptation. We create a new dataset, NollySenti—based on the Nollywood movie reviews for five languages widely spoken in Nigeria (English, Hausa, Igbo, Nigerian-Pidgin, and Yorùbá). We provide an extensive empirical evaluation using classical machine learning methods and pre-trained language models. Leveraging transfer learning, we compare the performance of cross-domain adaptation from Twitter domain, and cross-lingual adaptation from English language. Our evaluation shows that transfer from English in the same target domain leads to more than 5% improvement in accuracy compared to transfer from Twitter in the same language. To further mitigate the domain difference, we leverage machine translation (MT) from English to other Nigerian languages, which leads to a further improvement of 7% over cross-lingual evaluation. While MT to low-resource languages are often of low quality, through human evaluation, we show that most of the translated sentences preserve the sentiment of the original English reviews. 
    more » « less
  5. null (Ed.)
    A growing body of literature has argued for the reconceptualization of Latin America as a settler colony. Contrary to the self-proclaimed decolonization of Latin American states upon their independence two centuries ago, the settlers who came to Latin America stayed and preserved the structure of settler colonialism to the present day. This article analyzes the case of Nicaragua through the conceptual frame of settler colonialism and examines an apt case study: the Indigenous and Afrodescendant communities of the Rama-Kriol Territory in southeastern Nicaragua, where I have conducted activist ethnographic research since 2014. The ongoing colonization of the Rama-Kriol Territory exhibits not only failures of the state to enforce legal protections of multicultural rights, but also the extension of a colonial logic of dispossession and elimination. The case of the Rama-Kriol Territory demonstrates the entanglements of Nicaraguan settler colonialism with international institutions, development banks, multinational corporations, and settler colonial projects around the world. I conclude that social science researchers should attend to continuing and emergent forms of Indigenous sovereignty in Nicaragua. Amid the fading backdrop of liberal multiculturalism in Latin America, these assertions of sovereignty pose a political horizon of decolonization and an end to settler violence, dispossession, and domination. 
    more » « less