skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Characterizing the Vector Data Ecosystem
Abstract A growing body of information on vector-borne diseases has arisen as increasing research focus has been directed towards the need for anticipating risk, optimizing surveillance, and understanding the fundamental biology of vector-borne diseases to direct control and mitigation efforts. The scope and scale of this information, in the form of data, comprising database efforts, data storage, and serving approaches, means that it is distributed across many formats and data types. Data ranges from collections records to molecular characterization, geospatial data to interactions of vectors and traits, infection experiments to field trials. New initiatives arise, often spanning the effort traditionally siloed in specific research disciplines, and other efforts wane, perhaps in response to funding declines, different research directions, or lack of sustained interest. Thusly, the world of vector data – the Vector Data Ecosystem – can become unclear in scope, and the flows of data through these various efforts can become stymied by obsolescence, or simply by gaps in access and interoperability. As increasing attention is paid to creating FAIR (Findable Accessible Interoperable, and Reusable) data, simply characterizing what is ‘out there’, and how these existing data aggregation and collection efforts interact, or interoperate with each other, is a useful exercise. This study presents a snapshot of current vector data efforts, reporting on level of accessibility, and commenting on interoperability using an illustration to track a specimen through the data ecosystem to understand where it occurs for the database efforts anticipated to describe it (or parts of its extended specimen data).  more » « less
Award ID(s):
2021909 2213854 2016265 2016282
PAR ID:
10398324
Author(s) / Creator(s):
; ;
Editor(s):
Faraji, Ary
Date Published:
Journal Name:
Journal of Medical Entomology
ISSN:
0022-2585
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. A growing body of information on vector-borne diseases has arisen as increasing research focus has been directed towards the need for anticipating risk, optimizing surveillance, and understanding the fundamental biology of vector-borne diseases to direct efforts to control and mitigation. The scope and scale of this information, in the form of data, comprising database efforts, data storage, and serving approaches, mean that it is distributed across many formats and data types. Data ranges from collections records to molecular characterization, geospatial data to interactions of vectors and traits, infection experiments to field trials. New initiatives arise, often spanning the effort traditionally siloed in specific research disciplines, and other efforts wane, perhaps in response to funding declines, different research directions, or lack of sustained interest. Thusly, the world of vector data - the Vector Data Ecosystem - can become unclear in scope, and the flows of data through these various efforts can become stymied by obsolescence, or simply by gaps in access and interoperability. As increasing attention is paid to creating FAIR (Findable Accessible Interoperable, and Reusable) data, simply characterizing what is ‘out there’, and how these existing data aggregation and collection efforts interact, or interoperate with each other, is a useful exercise. This website and related project presents a list of vector data curation efforts, a brief description of their stated scope and purpose, and level of accessibility. The Vector Data Ecosystem by the University of Notre Dame Center for Research Computing, and is being developed and maintained as part of the NSF funded VectorByte Initiative (www.vectorbyte.org). 
    more » « less
  2. Abstract BackgroundTick-borne diseases are a growing public health threat in the United States. Despite the prevalence and rising burden of tick-borne diseases, there are major gaps in baseline knowledge and surveillance efforts for tick vectors, even among vector control districts and public health agencies. To address this issue, an online tick training course (OTTC) was developed through the Southeastern Center of Excellence in Vector-Borne Diseases (SECOEVBD) to provide a comprehensive knowledge base on ticks, tick-borne diseases, and their management. MethodsThe OTTC consisted of training modules covering topics including tick biology, tick identification, tick-borne diseases, and public health, personal tick safety, and tick surveillance. The course was largely promoted to vector control specialists and public health employees throughout the Southeastern US. We collected assessment and survey data on participants to gauge learning outcomes, perceptions of the utility of knowledge gained, and barriers and facilitators to applying the knowledge in the field. ResultsThe OTTC was successful in increasing participants’ baseline knowledge across all course subject areas, with the average score on assessment increasing from 62.6% (pre-course) to 86.7% (post-course). More than half of participants (63.6%) indicated that they would definitely use information from the course in their work. Barriers to using information identified in the delayed assessment included lack of opportunities to apply skills (18.5%) and the need for additional specialized training beyond what the OTTC currently offers (18.5%), while the main facilitator (70.4%) for applying knowledge was having opportunities at work, such as an existing tick surveillance program. ConclusionsOverall, this OTTC demonstrated capacity to improve knowledge in a necessary and underserved public health field, and more than half of participants use or plan to use the information in their work. The geographic reach of this online resource was much larger than simply for the Southeastern region for which it was designed, suggesting a much broader need for this resource. Understanding the utility and penetrance of training programs such as these is important for refining materials and assessing optimal targets for training. 
    more » « less
  3. Over the last decade, the United States paleontological collections community has invested heavily in the digitization of specimen-based data, including over 10 million USD funded through the National Science Foundation’s Advancing Digitization of Biodiversity Collections program. Fossil specimen data—9.0 million records and counting (Global Biodiversity Information Facility 2024)—are now accessible on open science platforms such as the Global Biodiversity Information Facility (GBIF). However, the full potential of this data is far from realized due to fundamental challenges associated with mobilization, discoverability, and interoperability of paleontological information within the existing cyberinfrastructure landscape and data pipelines. Additionally, it can be difficult for individuals with varying expertise to develop a comprehensive understanding of the existing landscape due to its breadth and complexity. Here, we present preliminary results from a project aiming to explore how we might address these problems. Funding from the US National Science Foundation (NSF) to the University of Colorado Museum of Natural History, Smithsonian National Museum of Natural History, and Arizona State University will result in, among other products, an “ecosystem map” for the paleontological collections community. This map will be an information-rich visualization of entities (e.g. concepts, systems, platforms, mechanisms, drivers, tools, documentation, data, standards, people, organizations) operating in, intersecting with, or existing in parallel to our domain. We are inspired and informed by similar efforts to map the biodiversity informatics landscape (Bingham et al. 2017) and the research infrastructure landscape (Distributed System of Scientific Collections 2024), as well as by many ongoing metadata cataloging projects, e.g. re3data and the Global Registry of Scientific Collections (GRSciColl). Our strategy for developing this ecosystem map is to model the existing information and systems landscape by characterizing entities, e.g. potentially in a graph database as nodes with relationships to other nodes. The ecosystem map will enable us to provide guidance for communities workingacrossdifferent sectors of the landscape, promoting a shared understanding of the ecosystem that everyone works in together. We can also use the map to identify points of entry and engagement at various stages of the paleontological data process, and to engage diverse memberswithinthe paleontological community. We see three primary user types for this map: people new(er) to the community, people with expertise in a subset of the community, and people working to integrate initiatives and systems across communities. Each of these user types needs tailored access to the ecosystem map and its community knowledge. By promoting shared knowledge with the map, users will be able to identify their own space within the ecosystem and the connections or partnerships that they can utilize to expand their knowledge or resources, relieving the burden on any single individual to hold a comprehensive understanding. For example, the flow of taxonomic information between publications, collections, digital resources, and biodiversity aggregators is not straightforward or easy to understand. A person with expertise in collections care may want to use the ecosystem map to understand why taxonomic identifications associated with their specimen occurrence records are showing up incorrectly when published to GBIF. We envision that our final ecosystem map will visualize the flow of taxonomic information and how it is used to interpret specimen occurrence data, thereby highlighting to this user where problems may be happening and whom to ask for help in addressing them (Fig. 1). Ultimately, development of this map will allow us to identify mobilization pathways for paleontological data, highlight core cyberinfrastructure resources, define cyberinfrastructure gaps, strategize future partnerships, promote shared knowledge, and engage a broader array of expertise in the process. Contributing domain-based evidence FAIRly*2 requires expertise that bridges the content (e.g. paleontology) and the mechanics (e.g. informatics). By centering the role of humans in open science cyberinfrastructure throughout our process, we hope to develop systems that create and sustain such expertise. 
    more » « less
  4. Abstract Vector‐borne diseases (VBDs) are embedded within complex socio‐ecological systems. While research has traditionally focused on the direct effects of VBDs on human morbidity and mortality, it is increasingly clear that their impacts are much more pervasive. VBDs are dynamically linked to feedbacks between environmental conditions, vector ecology, disease burden, and societal responses that drive transmission. As a result, VBDs have had profound influence on human history. Mechanisms include: (1) killing or debilitating large numbers of people, with demographic and population‐level impacts; (2) differentially affecting populations based on prior history of disease exposure, immunity, and resistance; (3) being weaponised to promote or justify hierarchies of power, colonialism, racism, classism and sexism; (4) catalysing changes in ideas, institutions, infrastructure, technologies and social practices in efforts to control disease outbreaks; and (5) changing human relationships with the land and environment. We use historical and archaeological evidence interpreted through an ecological lens to illustrate how VBDs have shaped society and culture, focusing on case studies from four pertinent VBDs: plague, malaria, yellow fever and trypanosomiasis. By comparing across diseases, time periods and geographies, we highlight the enormous scope and variety of mechanisms by which VBDs have influenced human history. 
    more » « less
  5. Abstract Vector-borne diseases pose a persistent and increasing challenge to human, animal, and agricultural systems globally. Mathematical modeling frameworks incorporating vector trait responses are powerful tools to assess risk and predict vector-borne disease impacts. Developing these frameworks and the reliability of their predictions hinge on the availability of experimentally derived vector trait data for model parameterization and inference of the biological mechanisms underpinning transmission. Trait experiments have generated data for many known and potential vector species, but the terminology used across studies is inconsistent, and accompanying publications may share data with insufficient detail for reuse or synthesis. The lack of data standardization can lead to information loss and prohibits analytical comprehensiveness. Here, we present MIReVTD, a Minimum Information standard for Reporting Vector Trait Data. Our reporting checklist balances completeness and labor- intensiveness with the goal of making these important experimental data easier to find and reuse, without onerous effort for scientists generating the data. To illustrate the standard, we provide an example reproducing results from anAedes aegyptimosquito study. 
    more » « less