skip to main content


Title: popler: An r package for extraction and synthesis of population time series from the long‐term ecological research (LTER) network
Abstract

Population dynamics play a central role in the historical and current development of fundamental and applied ecological science. The nascent culture of open data promises to increase the value of population dynamics studies to the field of ecology. However, synthesis of population data is constrained by the difficulty in identifying relevant datasets, by the heterogeneity of available data and by access to raw (as opposed to aggregated or derived) observations.

To obviate these issues, we built a relational database,popler, and itsRclient, the library popler.popleraccommodates the vast majority of population data under a common structure, and without the need for aggregating raw observations. The popler R library is designed for users unfamiliar with the structure of the database and with the SQL language. ThisRlibrary allows users to identify, download, explore and cite datasets salient to their needs.

We implemented popler as a PostgreSQL instance, where we stored population data originated by the United States Long Term Ecological Research (LTER) Network. Our focus on the US LTER data aims to leverage the potential of this vast open data resource. The database currently contains 305 datasets from 25 LTER sites.popleris designed to accommodate automatic updates of existing datasets, and to accommodate additional datasets from LTER as well as non‐LTER studies.

The combination of the online database and theRlibrary popler is a resource for data synthesis efforts in population ecology. The common structure ofpoplersimplifies comparative analyses, and the availability of raw data confers flexibility in data analysis. The popler R library maximizes these opportunities by providing a user‐friendly interface to the online database.

 
more » « less
Award ID(s):
1655499
NSF-PAR ID:
10457227
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;
Publisher / Repository:
Wiley-Blackwell
Date Published:
Journal Name:
Methods in Ecology and Evolution
Volume:
11
Issue:
2
ISSN:
2041-210X
Page Range / eLocation ID:
p. 258-264
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Comprehensive, time‐scaled phylogenies provide a critical resource for many questions in ecology, evolution and biodiversity. Methodological advances have increased the breadth of taxonomic coverage in phylogenetic data; however, accessing and reusing these data remain challenging.

    We introduce the Fish Tree of Life website and associatedrpackagefishtreeto provide convenient access to sequences, phylogenies, fossil calibrations and diversification rate estimates for the most diverse group of vertebrate organisms, the ray‐finned fishes. The Fish Tree of Life website presents subsets and visual summaries of phylogenetic and comparative data, and is complemented by therpackage, which provides flexible programmatic access to the same underlying data source for advanced users wishing to extend or reanalyse the data.

    We demonstrate functionality with an overview of the website, and show three examples of advanced usage through therpackage. First, we test for the presence of long branch attraction artefacts across the fish tree of life. The second example examines the effects of habitat on diversification rate in the pufferfishes. The final example demonstrates how a community phylogenetic analysis could be conducted with the package.

    This resource makes a large comparative vertebrate dataset easily accessible via the website, while therpackage enables the rapid reuse and reproducibility of research results via its ability to easily integrate with otherrpackages and software for molecular biology and comparative methods.

     
    more » « less
  2. Abstract

    Matrix population models (MPMs) are an important tool for biologists seeking to understand the causes and consequences of variation in vital rates (e.g. survival, reproduction) across life cycles. Empirical MPMs describe the age‐ or stage‐structured demography of organisms and usually represent the life history of a population during a particular time frame at a specific geographical location.

    The COMPADRE Plant Matrix Database and COMADRE Animal Matrix Database are the most extensive resources for MPM data, collectively containing >12,000 individual projection matrices for >1,100 species globally. Although these databases represent an unparalleled resource for researchers, land managers and educators, the current computational tools available to answer questions with MPMs impose significant barriers to potential COM(P)ADRE database users by requiring advanced knowledge to handle diverse data structures and program custom analysis functions.

    To close this knowledge gap, we present two interrelated R packages designed to (a) facilitate the use of these databases by providing functions to acquire, quality control and manage both the MPM data contained in COMPADRE and COMADRE, and a user's own MPM data (Rcompadre) and (b) present a range of functions to calculate life‐history traits from MPMs in support of ecological and evolutionary analyses (Rage). We provide examples to illustrate the use of both.

    RcompadreandRagewill facilitate demographic analyses using MPM data and contribute to the improved replicability of studies using these data. We hope that this new functionality will allow researchers, land managers and educators to unlock the potential behind the thousands of MPMs and ancillary metadata stored in the COMPADRE and COMADRE matrix databases, and in their own MPM data.

     
    more » « less
  3. Abstract

    The diversity and distribution of marine species in eastern Australia is influenced by one of the world's strongest western boundary currents, the East Australia Current, which propels water and propagules poleward, a flow intensifying due to climate change.

    Population genetic structure of the asterinid sea starMeridiastra calcarwas investigated across its range in eastern Australia (12° of latitude, 2,500 km) from northern New South Wales to its poleward‐extending range in Tasmania at the southern edge influence of the East Australia Current.

    Population structure and connectivity ofM. calcarwere examined across six bioregions using six microsatellite loci (nuclear DNA) and the control region (mitochondrial DNA). The potential influence of the extent ofM. calcar's intertidal rock platform habitat was also assessed.

    Genetic structure analysis indicated that the Hawkesbury Shelf contained distinct genetic clusters, whereas the two sites in the Batemans Shelf differed from each other, with Jervis Bay Marine Park having just one genetic cluster. The Manning Shelf, Twofold Shelf, and Bruny bioregions all had similar genetic composition.

    Strong self‐seeding (68–98%) was indicated by microsatellite loci for all bioregions, with lower (0.3–6.5%) migration between bioregions. Poleward (New South Wales to Tasmania) migration was low except from the Manning Shelf (30%).

    Contemporary population connectivity and genetic structure ofM. calcarappear to be influenced by ocean currents, habitat distribution, and its short planktonic larval duration, which was a minimum of 12–14 days, depending on availability of a settlement cue.

    The dominance of unique genetic groups in the Hawkesbury bioregion shows the importance of this region forM. calcarand possibly a diversity of co‐distributed rock platform species. This highlights how important it is to have a large marine park in the Hawkesbury bioregion, which is presently lacking.

     
    more » « less
  4. Abstract

    Phenotypic data are crucial for understanding genotype–phenotype relationships, assessing the tree of life and revealing trends in trait diversity over time. Large‐scale description of whole organisms for quantitative analyses (phenomics) presents several challenges, and technological advances in the collection of genomic data outpace those for phenomic data. Reasons for this disparity include the time‐consuming and expensive nature of collecting discrete phenotypic data and mining previously published data on a given species (both often requiring anatomical expertise across taxa), and computational challenges involved with analysing high‐dimensional datasets.

    One approach to building approximations of organismal phenomes is to combine published datasets of discrete characters assembled for phylogenetic analyses into a phenomic dataset. Despite a wealth of legacy datasets in the literature for many groups, relatively few methods exist for automating the assembly, analysis, and visualization of phenomic datasets in phylogenetic contexts. Here, we introduce a newrpackagephenotoolsfor integrating (fusing original or legacy datasets), curating (finding and removing duplicates) and visualizing phenomic datasets.

    We demonstrate the utility of the proposed toolkit with a morphological dataset for flightless birds and two morphological datasets for theropod dinosaurs and provide recommendations for character construction to maximize accessibility in future workflows. Visualization tools allow rapid identification of anatomical subregions with difficult or problematic histories of homology.

    We anticipate these tools aiding automation of the assembly and visualization of phenomic datasets to inform evolutionary relationships and rates of phenotypic evolution.

     
    more » « less
  5. Abstract

    Many important demographic processes are seasonal, including survival. For many species, mortality risk is significantly higher at certain times of the year than at others, whether because resources are scarce, susceptibility to predators or disease is high, or both. Despite the importance of survival modelling in wildlife sciences, no tools are available to estimate the peak, duration and relative importance of these ‘seasons of mortality’.

    We presentcyclomort, anrpackage that estimates the timing, duration and intensity of any number of mortality seasons with reliable confidence intervals. The package includes a model selection approach to determine the number of mortality seasons and to test whether seasons of mortality vary across discrete grouping factors.

    We illustrate the periodic hazard function model and workflow of cyclomort with simulated data. We then estimate mortality seasons of two caribouRangifer taranduspopulations that have strikingly different mortality patterns, including different numbers and timing of mortality peaks, and a marked change in one population over time.

    Thecyclomortpackage was developed to estimate mortality seasons for wildlife, but the package can model any time‐to‐event processes with a periodic component.

     
    more » « less