skip to main content


Title: Text Recycling in Scientific Writing
Text recycling, often called “self-plagiarism”, is the practice of reusing textual material from one’s prior documents in a new work. The practice presents a complex set of ethical and practical challenges to the scientific community, many of which have not been addressed in prior discourse on the subject. This essay identifies and discusses these factors in a systematic fashion, concluding with a new definition of text recycling that takes these factors into account. Topics include terminology, what is not text recycling, factors affecting judgements about the appropriateness of text recycling, and visual materials.  more » « less
Award ID(s):
1737093
NSF-PAR ID:
10063216
Author(s) / Creator(s):
Date Published:
Journal Name:
Science and Engineering Ethics
ISSN:
1353-3452
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Because science advances incrementally, scientists often need to repeat material included in their prior work when composing new texts. Such “text recycling” is a common but complex writing practice, so authors and editors need clear and consistent guidance about what constitutes appropriate practice. Unfortunately, publishers’ policies on text recycling to date have been incomplete, unclear, and sometimes internally inconsistent. Building on 4 years of research on text recycling in scientific writing, the Text Recycling Research Project has developed a model text recycling policy that should be widely applicable for research publications in scientific fields. This article lays out the challenges text recycling poses for editors and authors, describes key factors that were addressed in developing the policy, and explains the policy’s main features. 
    more » « less
  2. When writing journal articles, STEM researchers produce a number of other genres such as grant proposals and conference posters, and their articles routinely build directly on their own prior work. As a result, STEM authors often reuse material from their completed documents in producing new documents. While this practice, known as text recycling (or self-plagiarism), is a debated issue in publishing and research ethics, little is known about researchers’ beliefs about what constitutes appropriate practice. This article presents results of from an exploratory, survey-based study on beliefs and attitudes toward text recycling among STEM “experts” (faculty researchers) and “novices” (graduate students and post docs). While expert and novice researchers are fairly consistent in distinguishing between text recycling and plagiarism, there is considerable disagreement about appropriate text recycling practice. 
    more » « less
  3. Abstract

    Because research in science, engineering and medical fields advances incrementally, researchers routinely write papers that build directly on their prior work. While each new research article is expected to make a novel contribution, researchers often need to repeat some material—method details, background and so on—from their previous articles, a practice called ‘text recycling’. While increasing awareness of text recycling has led to the proliferation of policies, journal editorials and scholarly articles addressing the practice, these documents tend to employ inconsistent terminology—using different terms to name the same key ideas and, even more problematic, using the same terms with different meanings. These inconsistencies make it difficult for readers to know precisely how the ideas or expectations articulated in one document relate to those of others. This paper first clarifies the problems with current terminology, showing how key terms are used inconsistently across publisher policies for authors, guidelines for editors and textbooks on research ethics. It then offers a new taxonomy of text‐recycling practices with terms designed to align with the acceptability of these practices in common research writing and publishing contexts.

     
    more » « less
  4. Background: Text recycling (hereafter TR)—the reuse of one’s own textual materials from one document in a new document—is a common but hotly debated and unsettled practice in many academic disciplines, especially in the context of peer-reviewed journal articles. Although several analytic systems have been used to determine replication of text—for example, for purposes of identifying plagiarism—they do not offer an optimal way to compare documents to determine the nature and extent of TR in order to study and theorize this as a practice in different disciplines. In this article, we first describe TR as a common phenomenon in academic publishing, then explore the challenges associated with trying to study the nature and extent of TR within STEM disciplines. We then describe in detail the complex processes we used to create a system for identifying TR across large corpora of texts, and the sentence-level string-distance lexical methods used to refine and test the system (White & Joy, 2004). The purpose of creating such a system is to identify legitimate cases of TR across large corpora of academic texts in different fields of study, allowing meaningful cross-disciplinary comparisons in future analyses of published work. The findings from such investigations will extend and refine our understanding of discourse practices in academic and scientific settings. Literature Review: Text-analytic methods have been widely developed and implemented to identify reused textual materials for detecting plagiarism, and there is considerable literature on such methods. (Instead of taking up space detailing this literature, we point readers to several recent reviews: Gupta, 2016; Hiremath & Otari, 2014; and Meuschke & Gipp, 2013). Such methods include fingerprinting, term occurrence analysis, citation analysis (identifying similarity in references and citations), and stylometry (statistically comparing authors’ writing styles; see Meuschke & Gipp, 2013). Although TR occurs in a wide range of situations, recent debate has focused on recycling from one published research paper to another—particularly in STEM fields (see, for example, Andreescu, 2013; Bouville, 2008; Bretag & Mahmud, 2009; Roig, 2008; Scanlon, 2007). An important step in better understanding the practice is seeing how authors actually recycle material in their published work. Standard methods for detecting plagiarism are not directly suitable for this task, as the objective is not to determine the presence or absence of reuse itself, but to study the types and patterns of reuse, including materials that are syntactically but not substantively distinct—such as “patchwriting” (Howard, 1999). In the present account of our efforts to create a text-analytic system for determining TR, we take a conventional alphabetic approach to text, in part because we did not aim at this stage of our project to analyze non-discursive text such as images or other media. However, although the project adheres to conventional definitions of text, with a focus on lexical replication, we also subscribe to context-sensitive approaches to text production. The results of applying the system to large corpora of published texts can potentially reveal varieties in the practice of TR as a function of different discourse communities and disciplines. Writers’ decisions within what appear to be canonical genres are contingent, based on adherence to or deviation from existing rules and procedures if and when these actually exist. Our goal is to create a system for analyzing TR in groups of texts produced by the same authors in order to determine the nature and extent of TR, especially across disciplinary areas, without judgment of scholars’ use of the practice. 
    more » « less
  5. Key points

    Text recycling is the reuse of material from an author's own prior work in a new document.

    While the ethical aspects of text recycling have received considerable attention, the legal aspects have been largely ignored or inaccurately portrayed.

    Copyright laws and publisher contracts are difficult to interpret and highly variable, making it difficult for authors or editors to know when text recycling in research writing is legal or illegal.

    We argue that publishers should revise their author contracts to make text recycling explicitly legal as long as authors follow ethics‐based guidelines.

     
    more » « less