skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Prescreening-Based Subset Selection for Improving Predictions of Earth System Models With Application to Regional Prediction of Red Tide
We present the ensemble method of prescreening-based subset selection to improve ensemble predictions of Earth system models (ESMs). In the prescreening step, the independent ensemble members are categorized based on their ability to reproduce physically-interpretable features of interest that are regional and problem-specific. The ensemble size is then updated by selecting the subsets that improve the performance of the ensemble prediction using decision relevant metrics. We apply the method to improve the prediction of red tide along the West Florida Shelf in the Gulf of Mexico, which affects coastal water quality and has substantial environmental and socioeconomic impacts on the State of Florida. Red tide is a common name for harmful algal blooms that occur worldwide, which result from large concentrations of aquatic microorganisms, such as dinoflagellate Karenia brevis , a toxic single celled protist. We present ensemble method for improving red tide prediction using the high resolution ESMs of the Coupled Model Intercomparison Project Phase 6 (CMIP6) and reanalysis data. The study results highlight the importance of prescreening-based subset selection with decision relevant metrics in identifying non-representative models, understanding their impact on ensemble prediction, and improving the ensemble prediction. These findings are pertinent to other regional environmental management applications and climate services. Additionally, our analysis follows the FAIR Guiding Principles for scientific data management and stewardship such that data and analysis tools are findable, accessible, interoperable, and reusable. As such, the interactive Colab notebooks developed for data analysis are annotated in the paper. This allows for efficient and transparent testing of the results’ sensitivity to different modeling assumptions. Moreover, this research serves as a starting point to build upon for red tide management, using the publicly available CMIP, Coordinated Regional Downscaling Experiment (CORDEX), and reanalysis data.  more » « less
Award ID(s):
1939994
PAR ID:
10379676
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
Frontiers in Earth Science
Volume:
10
ISSN:
2296-6463
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. This second consensus document builds on the first, providing updates on actions to address the initial recommendations and identifying additional actions that will advance management of red tide. The HAB Task Force continues to recommend actions that create improved understanding of red tide and translate it into enhanced management. Like its predecessor, this document is not intended to provide an exhaustive list of useful actions. The recommendations are meant to complement and support other efforts to set long-term goals and implement specific actions that minimize the harmful effects of red tide as well as a variety of other HABs that impact Florida, such as the work of the Blue-Green Algae Task Force. 
    more » « less
  2. Machine learning algorithms are often used to model and predict animal habitat selection—the relationships between animal occurrences and habitat characteristics. For broadly distributed species, habitat selection often varies among populations and regions; thus, it would seem preferable to fit region- or population-specific models of habitat selection for more accurate inference and prediction, rather than fitting large-scale models using pooled data. However, where the aim is to make range-wide predictions, including areas for which there are no existing data or models of habitat selection, how can regional models best be combined? We propose that ensemble approaches commonly used to combine different algorithms for a single region can be reframed, treating regional habitat selection models as the candidate models. By doing so, we can incorporate regional variation when fitting predictive models of animal habitat selection across large ranges. We test this approach using satellite telemetry data from 168 humpback whales across five geographic regions in the Southern Ocean. Using random forests, we fitted a large-scale model relating humpback whale locations, versus background locations, to 10 environmental covariates, and made a circumpolar prediction of humpback whale habitat selection. We also fitted five regional models, the predictions of which we used as input features for four ensemble approaches: an unweighted ensemble, an ensemble weighted by environmental similarity in each cell, stacked generalization, and a hybrid approach wherein the environmental covariates and regional predictions were used as input features in a new model. We tested the predictive performance of these approaches on an independent validation dataset of humpback whale sightings and whaling catches. These multiregional ensemble approaches resulted in models with higher predictive performance than the circumpolar naive model. These approaches can be used to incorporate regional variation in animal habitat selection when fitting range-wide predictive models using machine learning algorithms. This can yield more accurate predictions across regions or populations of animals that may show variation in habitat selection. 
    more » « less
  3. null (Ed.)
    Abstract The California Current System (CCS) sustains economically valuable fisheries and is particularly vulnerable to ocean acidification, due to its natural upwelling of carbon-enriched waters that generate corrosive conditions for local ecosystems. Here we use a novel suite of retrospective, initialized ensemble forecasts with an Earth system model (ESM) to predict the evolution of surface pH anomalies in the CCS. We show that the forecast system skillfully predicts observed surface pH variations a year in advance over a naive forecasting method, with the potential for skillful prediction up to five years in advance. Skillful predictions of surface pH are mainly derived from the initialization of dissolved inorganic carbon anomalies that are subsequently transported into the CCS. Our results demonstrate the potential for ESMs to provide skillful predictions of ocean acidification on large scales in the CCS. Initialized ESMs could also provide boundary conditions to improve high-resolution regional forecasting systems. 
    more » « less
  4. The circular economy (CE) seeks to maintain products and materials at their highest utility and value. The organisational and governmental policy have seised onto the CE philosophy to advance socio-economic and environmental development. CE remains an essentially contested concept – making its utilisation as a foundation for managerial and policy decisions challenging. Circularity assessment has not been systematically adopted, especially within supply chain management. Using critical scholarly and practical evidential foundation, we proposed a comprehensive set of metrics that can be utilised in supplier selection, monitoring, and development for circularity. These metrics include the macro, meso, and micro levels. A group decision-making method integrating best-worst method (BWM), regret theory (RT), and dual hesitant fuzzy sets (DHFS) for circular economy and circularity (CEC) supplier evaluation and selection is introduced – providing instrumental value for the identified metrics typology. The proposed BWM-DHFE-RT integrative analytical method can accommodate decisionmaker psychological behavior under uncertainty while simultaneously capturing divergent or conflicting opinions of different decision-makers. An illustrative business scenario is utilized to demonstrate the application of the proposed method. Though the proposed CE performance metrics and methodology are used for CEC supplier management reasons they have broader applicability. Future research and application directions are discussed. 
    more » « less
  5. Abstract Environmental decisions with substantial social and environmental implications are regularly informed by model predictions, incurring inevitable uncertainty. The selection of a set of model predictions to inform a decision is usually based on model performance, measured by goodness‐of‐fit metrics. Yet goodness‐of‐fit metrics have a questionable relationship to a model's value to end users, particularly when validation data are themselves uncertain. For example, decisions based on flow frequency models are not necessarily improved by adopting models with the best overall goodness of fit. We propose an alternative model evaluation approach based on the conditional value of sample information, first defined in 1961, which has found extensive use in sampling design optimization but which has not previously been used for model evaluation. The metric uses observations from a validation set to estimate the expected monetary costs associated with model prediction uncertainties. A model is only considered superior to alternatives if (i) its predictions reduce these costs and (ii) sufficient validation data are available to distinguish its performance from alternative models. By describing prediction uncertainties in monetary terms, the metric facilitates the communication of prediction uncertainty by end users, supporting the inclusion of uncertainty analysis in decision making. 
    more » « less