skip to main content


Title: popler: An r package for extraction and synthesis of population time series from the long‐term ecological research (LTER) network
Abstract

Population dynamics play a central role in the historical and current development of fundamental and applied ecological science. The nascent culture of open data promises to increase the value of population dynamics studies to the field of ecology. However, synthesis of population data is constrained by the difficulty in identifying relevant datasets, by the heterogeneity of available data and by access to raw (as opposed to aggregated or derived) observations.

To obviate these issues, we built a relational database,popler, and itsRclient, the library popler.popleraccommodates the vast majority of population data under a common structure, and without the need for aggregating raw observations. The popler R library is designed for users unfamiliar with the structure of the database and with the SQL language. ThisRlibrary allows users to identify, download, explore and cite datasets salient to their needs.

We implemented popler as a PostgreSQL instance, where we stored population data originated by the United States Long Term Ecological Research (LTER) Network. Our focus on the US LTER data aims to leverage the potential of this vast open data resource. The database currently contains 305 datasets from 25 LTER sites.popleris designed to accommodate automatic updates of existing datasets, and to accommodate additional datasets from LTER as well as non‐LTER studies.

The combination of the online database and theRlibrary popler is a resource for data synthesis efforts in population ecology. The common structure ofpoplersimplifies comparative analyses, and the availability of raw data confers flexibility in data analysis. The popler R library maximizes these opportunities by providing a user‐friendly interface to the online database.

 
more » « less
Award ID(s):
1655499
NSF-PAR ID:
10457227
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;
Publisher / Repository:
Wiley-Blackwell
Date Published:
Journal Name:
Methods in Ecology and Evolution
Volume:
11
Issue:
2
ISSN:
2041-210X
Page Range / eLocation ID:
p. 258-264
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Matrix population models (MPMs) are an important tool for biologists seeking to understand the causes and consequences of variation in vital rates (e.g. survival, reproduction) across life cycles. Empirical MPMs describe the age‐ or stage‐structured demography of organisms and usually represent the life history of a population during a particular time frame at a specific geographical location.

    The COMPADRE Plant Matrix Database and COMADRE Animal Matrix Database are the most extensive resources for MPM data, collectively containing >12,000 individual projection matrices for >1,100 species globally. Although these databases represent an unparalleled resource for researchers, land managers and educators, the current computational tools available to answer questions with MPMs impose significant barriers to potential COM(P)ADRE database users by requiring advanced knowledge to handle diverse data structures and program custom analysis functions.

    To close this knowledge gap, we present two interrelated R packages designed to (a) facilitate the use of these databases by providing functions to acquire, quality control and manage both the MPM data contained in COMPADRE and COMADRE, and a user's own MPM data (Rcompadre) and (b) present a range of functions to calculate life‐history traits from MPMs in support of ecological and evolutionary analyses (Rage). We provide examples to illustrate the use of both.

    RcompadreandRagewill facilitate demographic analyses using MPM data and contribute to the improved replicability of studies using these data. We hope that this new functionality will allow researchers, land managers and educators to unlock the potential behind the thousands of MPMs and ancillary metadata stored in the COMPADRE and COMADRE matrix databases, and in their own MPM data.

     
    more » « less
  2. Abstract

    Comprehensive, time‐scaled phylogenies provide a critical resource for many questions in ecology, evolution and biodiversity. Methodological advances have increased the breadth of taxonomic coverage in phylogenetic data; however, accessing and reusing these data remain challenging.

    We introduce the Fish Tree of Life website and associatedrpackagefishtreeto provide convenient access to sequences, phylogenies, fossil calibrations and diversification rate estimates for the most diverse group of vertebrate organisms, the ray‐finned fishes. The Fish Tree of Life website presents subsets and visual summaries of phylogenetic and comparative data, and is complemented by therpackage, which provides flexible programmatic access to the same underlying data source for advanced users wishing to extend or reanalyse the data.

    We demonstrate functionality with an overview of the website, and show three examples of advanced usage through therpackage. First, we test for the presence of long branch attraction artefacts across the fish tree of life. The second example examines the effects of habitat on diversification rate in the pufferfishes. The final example demonstrates how a community phylogenetic analysis could be conducted with the package.

    This resource makes a large comparative vertebrate dataset easily accessible via the website, while therpackage enables the rapid reuse and reproducibility of research results via its ability to easily integrate with otherrpackages and software for molecular biology and comparative methods.

     
    more » « less
  3. Abstract

    Projects focused on movement behaviour and home range are commonplace, but beyond a focus on choosing appropriate research questions, there are no clear guidelines for such studies. Without these guidelines, designing an animal tracking study to produce reliable estimates of space‐use and movement properties (necessary to answer basic movement ecology questions), is often done in an ad hoc manner.

    We developed ‘movedesign’, a user‐friendly Shiny application, which can be utilized to investigate the precision of three estimates regularly reported in movement and spatial ecology studies: home range area, speed and distance travelled. Conceptually similar to statistical power analysis, this application enables users to assess the degree of estimate precision that may be achieved with a given sampling design; that is, the choices regarding data resolution (sampling interval) and battery life (sampling duration).

    Leveraging the ‘ctmmRpackage, we utilize two methods proven to handle many common biases in animal movement datasets: autocorrelated kernel density estimators (AKDEs) and continuous‐time speed and distance (CTSD) estimators. Longer sampling durations are required to reliably estimate home range areas via the detection of a sufficient number of home range crossings. In contrast, speed and distance estimation requires a sampling interval short enough to ensure that a statistically significant signature of the animal's velocity remains in the data.

    This application addresses key challenges faced by researchers when designing tracking studies, including the trade‐off between long battery life and high resolution of GPS locations collected by the devices, which may result in a compromise between reliably estimating home range or speed and distance. ‘movedesign’ has broad applications for researchers and decision‐makers, supporting them to focus efforts and resources in achieving the optimal sampling design strategy for their research questions, prioritizing the correct deployment decisions for insightful and reliable outputs, while understanding the trade‐off associated with these choices.

     
    more » « less
  4. Abstract

    Plant transpiration is the largest evaporative flux from most vegetated ecosystems, playing a dominant role in energy balance, water and element cycling, ecosystem services and water security. Quantification of plant‐level transpiration, for example sap flux, is essential to land managers and scientists. Thermal dissipation probes (TDP) are reliable and affordable tools for measuring sap flux, but difficulties in replicable data processing often serve as a barrier to their use and interpretation of data.

    AquaFluxis anrpackage designed to efficiently process and analyse TDP data. This program maximizes data collection by continually importing raw TDP values and alerting the user of any malfunctioning sensors. Data processing is expedited through a user‐friendly graphical interface, predictive algorithms and data recovery options.AquaFlux's post‐processing options address gapfilling, radial trends in sap flux across sapwood and rescaling from points to whole stems.

    To ensure reproducibility and transparency, all data processing steps are automatically documented, highlighting the impact of user decisions. AquaFlux confirms to emerging best practices in data science and TDP data processing and analyses.

    Understanding spatiotemporal patterns of sap flux and how they relate to plant traits is essential for enhancing agricultural productivity, optimizing land management planning, ecological studies and improving climate modelling. AquaFlux provides a robust tool to facilitate predictive understanding of plant transpiration.

     
    more » « less
  5. Summary

    Leaf angle distribution (LAD) in forest canopies affects estimates of leaf area, light interception, and global‐scale photosynthesis, but is often simplified to a single theoretical value. Here, we present TLSLeAF (Terrestrial Laser Scanning Leaf Angle Function), an automated open‐source method of deriving LADs from terrestrial laser scanning.

    TLSLeAF produces canopy‐scale leaf angle and LADs by relying on gridded laser scanning data. The approach increases processing speed, improves angle estimates, and requires minimal user input. Key features are automation, leaf–wood classification, beta parameter output, and implementation in R to increase accessibility for the ecology community.

    TLSLeAF precisely estimates leaf angle with minimal distance effects on angular estimates while rapidly producing LADs on a consumer‐grade machine. We challenge the popular spherical LAD assumption, showing sensitivity to ecosystem type in plant area index and foliage profile estimates that translate toc. 25% andc. 11% increases in canopy net photosynthesis (c. 25%) and solar‐induced chlorophyll fluorescence (c. 11%).

    TLSLeAF can now be applied to the vast catalog of laser scanning data already available from ecosystems around the globe. The ease of use will enable widespread adoption of the method outside of remote‐sensing experts, allowing greater accessibility for addressing ecological hypotheses and large‐scale ecosystem modeling efforts.

     
    more » « less