skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Structural causes of citation gaps
The social identity of a researcher can affect their position in a community, as well as the uptake of their ideas. In many fields, members of underrepresented or minority groups are less likely to be cited, leading to citation gaps. Though this empirical phenomenon has been well-studied, empirical work generally does not provide insight into the causes of citation gaps. I will argue, using mathematical models, that citation gaps are likely due in part to the structure of academic communities. The existence of these ‘structural causes’ has implications for attempts to lessen citation gaps, and for proposals to make academic communities more efficient (e.g. by eliminating pre-publication peer review). These proposals have the potential to create feedback loops, amplifying current structural inequities.  more » « less
Award ID(s):
2045007
PAR ID:
10331294
Author(s) / Creator(s):
Date Published:
Journal Name:
Philosophical Studies
ISSN:
0031-8116
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Accurate prediction of scientific impact is important for scientists, academic recommender systems, and granting organizations alike. Existing approaches rely on many years of leading citation values to predict a scientific paper’s citations (a proxy for impact), even though most papers make their largest contributions in the first few years after they are published. In this paper, we tackle a new problem: predicting a new paper’s citation time series from the date of publication (i.e., without leading values). We propose HINTS, a novel end-to-end deep learning framework that converts citation signals from dynamic heterogeneous information networks (DHIN) into citation time series. HINTS imputes pseudo-leading values for a paper in the years before it is published from DHIN embeddings, and then transforms these embeddings into the parameters of a formal model that can predict citation counts immediately after publication. Empirical analysis on two real-world datasets from Computer Science and Physics show that HINTS is competitive with baseline citation prediction models. While we focus on citations, our approach generalizes to other “cold start” time series prediction tasks where relational data is available and accurate prediction in early timestamps is crucial. 
    more » « less
  2. Benjamin, Paaßen; Carrie, Demmans Epp (Ed.)
    This paper was written with the help of ChatGPT. Recent advancements in the development and deployment of large generative language models to power generative AI tools, including OpenAIż˝fs ChatGPT, have led to their broad usage across virtually all fields of study. While the tools have been trained to generate human-like-dialogue in response to questions or prompts, they are similarly used to compose larger, more complex artifacts, including social media posts, essays, and even research articles. Although this abstract has been written entirely by a human without any input, consultation, or revision from a generative language model, it would likely be difficult to discern any difference as a reader. In light of this, there is growing debate and concern regarding using these models to aid the writing process, particularly concerning publication. Aside from some notable risks, including the unintentional generation of false information, citation of non-existing research articles, or plagiarism by generating text that is sampled from another source without proper citation, there are additional questions pertaining to the originality of ideas expressed in a work has been partially-written or revised by a generative language model. We present this paper as both a case study into the usage of generative models to aid in the writing of academic research articles but also as an example of how transparency and open science practices may help in addressing several issues that have been raised in other contexts and communities. While this paper neither attempts to promote nor contest the use of these language models in any writing task, it is the goal of this work to provide insight and potential guidance into the ethical and effective usage of these models within this domain. 
    more » « less
  3. Searching for relevant literature is a fundamental part of academic research. The search for relevant literature is becoming a more difficult and time-consuming task as millions of articles are published each year. As a solution, recommendation systems for academic papers attempt to help researchers find relevant papers quickly. This paper focuses on graph-based recommendation systems for academic papers using citation networks. This type of paper recommendation system leverages a graph of papers linked by citations to create a list of relevant papers. In this study, we explore recommendation systems for academic papers using citation networks incorporating citation relations. We define citation relation based on the number of times the origin paper cites the reference paper, and use this citation relation to measure the strength of the relation between the papers. We created a weighted network using citation relation as citation weight on edges. We evaluate our proposed method on a real-world publication data set, and conduct an extensive comparison with three state-of-the-art baseline methods. Our results show that citation network-based recommendation systems using citation weights perform better than the current methods. 
    more » « less
  4. Cameron, Carrie (Ed.)
    Grant writing is an essential skill to develop for academic and other career success but providing individual feedback to large numbers of trainees is challenging. In 2014, we launched the Stanford Biosciences Grant Writing Academy to support graduate students and postdocs in writing research proposals. Its core program is a multi-week Proposal Bootcamp designed to increase the feedback writers receive as they develop and refine their proposals. The Proposal Bootcamp consisted of two-hour weekly meetings that included mini lectures and peer review. Bootcamp participants also attended faculty review workshops to obtain faculty feedback. Postdoctoral trainees were trained and hired as course teaching assistants and facilitated weekly meetings and review workshops. Over the last six years, the annual Bootcamp has provided 525 doctoral students and postdocs with multi-level feedback (peer and faculty). Proposals from Bootcamp participants were almost twice as likely to be funded than proposals from non-Bootcamp trainees. Overall, this structured program provided opportunities for feedback from multiple peer and faculty reviewers, increased the participants’ confidence in developing and submitting research proposals, while accommodating a large number of participants. 
    more » « less
  5. Abstract Research in affective science includes over one hundred thousand articles, the vast majority of which have been published in only the past two decades. The size and rapid growth of this field have led to unique challenges for the twenty-first-century scientist including how to develop both breadth and depth of scholarship, curb siloing and promote integrative and interdisciplinary framework, and represent and monitor the field in its entirety. Here, we help address these issues by compactly mapping out this enormous field using citation network analysis (CNA). We generated a citation matrix of over 100,000 publications and over 1 million citations since the seminal works on emotion by Charles Darwin (1872) and William James (1884). Using graph theory metric and content analysis of titles and abstracts, we identified and characterized the contents of 69 research communities, their most influential articles, and their interconnectedness with each other. We further identified potential “missed connections” between communities that share similar content but do not have strong citation-based connections. In doing so, we establish the first, low-dimensional representation, or field-wide map, of a substantial portion of the affective sciences literature. This panoramic view of the field provides affective and non-affective scientists alike with the means to rapidly survey dozens of major research communities and topics in the field, guide scholarship development, and identify gaps and connections for developing an integrative science. 
    more » « less