skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A dataset for the study of identity at scale: Annual Prevalence of American Twitter Users with specified Token in their Profile Bio 2015–2020
Personally expressed identity is who or what an individual themselves says they are, and it should be studied at scale. At scale means with data on millions of individuals, which is newly available and comes timestamped and geocoded. This work introduces a dataset for the study of identity at scale and describes the method for collecting and aggregating such data. Further, tools and theory for working with the data are presented. A demonstration analysis provides evidence that personal, individual development and changing cultural norms can be observed with these data and methods.  more » « less
Award ID(s):
1927227
PAR ID:
10326618
Author(s) / Creator(s):
Editor(s):
Cotfas, Liviu-Adrian
Date Published:
Journal Name:
PLOS ONE
Volume:
16
Issue:
11
ISSN:
1932-6203
Page Range / eLocation ID:
e0260185
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Simon, Ralph (Ed.)
    Animals can actively encode different types of identity information in learned communication signals, such as group membership or individual identity. The social environments in which animals interact may favor different types of information, but whether identity information conveyed in learned signals is robust or responsive to social disruption over short evolutionary timescales is not well understood. We inferred the type of identity information that was most salient in vocal signals by combining computational tools, including supervised machine learning, with a conceptual framework of “hierarchical mapping”, or patterns of relative acoustic convergence across social scales. We used populations of a vocal learning species as a natural experiment to test whether the type of identity information emphasized in learned vocalizations changed in populations that experienced the social disruption of introduction into new parts of the world. We compared the social scales with the most salient identity information among native and introduced range monk parakeet (Myiopsitta monachus) calls recorded in Uruguay and the United States, respectively. We also evaluated whether the identity information emphasized in introduced range calls changed over time. To place our findings in an evolutionary context, we compared our results with another parrot species that exhibits well-established and distinctive regional vocal dialects that are consistent with signaling group identity. We found that both native and introduced range monk parakeet calls displayed the strongest convergence at the individual scale and minimal convergence within sites. We did not identify changes in the strength of acoustic convergence within sites over time in the introduced range calls. These results indicate that the individual identity information in learned vocalizations did not change over short evolutionary timescales in populations that experienced the social disruption of introduction. Our findings point to exciting new research directions about the robustness or responsiveness of communication systems over different evolutionary timescales. 
    more » « less
  2. Abstract Significant advances in computational ethology have allowed the quantification of behaviour in unprecedented detail. Tracking animals in social groups, however, remains challenging as most existing methods can either capture pose or robustly retain individual identity over time but not both.To capture finely resolved behaviours while maintaining individual identity, we built NAPS (NAPS is ArUco Plus SLEAP), a hybrid tracking framework that combines state‐of‐the‐art, deep learning‐based methods for pose estimation (SLEAP) with unique markers for identity persistence (ArUco). We show that this framework allows the exploration of the social dynamics of the common eastern bumblebee (Bombus impatiens).We provide a stand‐alone Python package for implementing this framework along with detailed documentation to allow for easy utilization and expansion. We show that NAPS can scale to long timescale experiments at a high frame rate and that it enables the investigation of detailed behavioural variation within individuals in a group.Expanding the toolkit for capturing the constituent behaviours of social groups is essential for understanding the structure and dynamics of social networks. NAPS provides a key tool for capturing these behaviours and can provide critical data for understanding how individual variation influences collective dynamics. 
    more » « less
  3. null (Ed.)
    Motivation: Recent efforts to expand K-12 computer science education highlight the great need for well-prepared computer science (CS) teachers. Teacher identity theory offers a particular conceptual lens for us to understand computer science teacher preparation and professional development. The emerging literature suggests that teacher identity is central to sustaining motivation, efficacy, job satisfaction, and commitment, and these attributes are crucial in determining teacher retention. While the benefits associated with a strong sense of teacher identity are great, teachers face unique challenges and tensions in developing their professional identity for teaching computer science. Objectives: This exploratory study attempts to operationalize computer science teacher identity through discussing the potential domains, proposing and testing a quantitative instrument for assessing computer science teachers’ professional identity. Method: We first discussed the potential domains of computer science teacher identity based on recent teacher identity literature and considerations on some unique challenges for computer science teachers. Then we proposed the computer science teacher identity scale, which was piloted through a national K-12 computer science teacher survey with 3,540 completed responses. The survey results were analyzed with a series of factor analyses to test the internal structure of the computer science teacher identity scale. Results: Our analyses reveal a four-factor solution for the computer science teacher identity scale, which is composed of CS teaching commitment, CS pedagogical confidence, confidence to engage students, and sense of community/belonging. There were significant differences among the teachers with different computer science teaching experiences. In general, teachers with more computer science teaching experience had higher computer science teacher identity scores on all four factors. Discussion: The four-factor model along with a large national dataset invites a deeper analysis of the data and can provide important benchmarks. Such an instrument can be used to explore developmental patterns in computer science teacher identity, and function as a pedagogical tool to provoke discussion and reflection among teachers about their professional development. This study may also contribute to understanding computer science teachers’ professional development needs and inform efforts to prepare, develop, and retain computer science teachers. 
    more » « less
  4. Identity, or how people choose to define themselves, is gaining traction as an explanation for who pursues and persists in engineering. A number of quantitative studies have developed scales for predicting engineering identity in undergraduate students. However, the outcome measure of identity is sometimes based on a single item. In this paper, we present the results of a new two-item scale. The scale is adapted from an existing measure of identification with an organization that was developed by Bergami and Bagozzi [1] and refined by Bartel [2]. The measure focuses on the “cognitive (i.e., self-categorization) component of identification” (p. 556), and has been found to have high convergent validity with another, rigorous measure of identification with an organization or other entity created by Mael and Ashforth [3]. This measure utilizes one primarily visual and one verbal item to assess the extent to which an individual cognitively categorizes himself or herself as an engineer. The scale was administered to 1528 engineering undergraduate students during the 2016-2017 academic year. Internal consistency of the new engineering identity scale, as measured by Cronbach’s alpha, is 0.84. This new scale is an important step toward refining quantitative measures of, and the study of, engineering identity development in undergraduate students and other populations. 
    more » « less
  5. In animal societies, identity signals are common, mediate interactions within groups, and allow individuals to discriminate group-mates from out-group competitors. However, individual recognition becomes increasingly challenging as group size increases and as signals must be transmitted over greater distances. Group vocal signatures may evolve when successful in-group/out-group distinctions are at the crux of fitness-relevant decisions, but group signatures alone are insufficient when differentiated within-group relationships are important for decision-making. Spotted hyenas are social carnivores that live in stable clans of less than 125 individuals composed of multiple unrelated matrilines. Clan members cooperate to defend resources and communal territories from neighbouring clans and other mega carnivores; this collective defence is mediated by long-range (up to 5 km range) recruitment vocalizations, called whoops. Here, we use machine learning to determine that spotted hyena whoops contain individual but not group signatures, and that fundamental frequency features which propagate well are critical for individual discrimination. For effective clan-level cooperation, hyenas face the cognitive challenge of remembering and recognizing individual voices at long range. We show that serial redundancy in whoop bouts increases individual classification accuracy and thus extended call bouts used by hyenas probably evolved to overcome the challenges of communicating individual identity at long distance. 
    more » « less