skip to main content


Title: Large-Scale High-Resolution Coastal Mangrove Forests Mapping Across West Africa With Machine Learning Ensemble and Satellite Big Data
Coastal mangrove forests provide important ecosystem goods and services, including carbon sequestration, biodiversity conservation, and hazard mitigation. However, they are being destroyed at an alarming rate by human activities. To characterize mangrove forest changes, evaluate their impacts, and support relevant protection and restoration decision making, accurate and up-to-date mangrove extent mapping at large spatial scales is essential. Available large-scale mangrove extent data products use a single machine learning method commonly with 30 m Landsat imagery, and significant inconsistencies remain among these data products. With huge amounts of satellite data involved and the heterogeneity of land surface characteristics across large geographic areas, finding the most suitable method for large-scale high-resolution mangrove mapping is a challenge. The objective of this study is to evaluate the performance of a machine learning ensemble for mangrove forest mapping at 20 m spatial resolution across West Africa using Sentinel-2 (optical) and Sentinel-1 (radar) imagery. The machine learning ensemble integrates three commonly used machine learning methods in land cover and land use mapping, including Random Forest (RF), Gradient Boosting Machine (GBM), and Neural Network (NN). The cloud-based big geospatial data processing platform Google Earth Engine (GEE) was used for pre-processing Sentinel-2 and Sentinel-1 data. Extensive validation has demonstrated that the machine learning ensemble can generate mangrove extent maps at high accuracies for all study regions in West Africa (92%–99% Producer’s Accuracy, 98%–100% User’s Accuracy, 95%–99% Overall Accuracy). This is the first-time that mangrove extent has been mapped at a 20 m spatial resolution across West Africa. The machine learning ensemble has the potential to be applied to other regions of the world and is therefore capable of producing high-resolution mangrove extent maps at global scales periodically.  more » « less
Award ID(s):
1841403
NSF-PAR ID:
10213803
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Frontiers in Earth Science
Volume:
8
ISSN:
2296-6463
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Expansion of large-scale tree plantations for commodity crop and timber production is a leading cause of tropical deforestation. While automated detection of plantations across large spatial scales and with high temporal resolution is critical to inform policies to reduce deforestation, such mapping is technically challenging. Thus, most available plantation maps rely on visual inspection of imagery, and many of them are limited to small areas for specific years. Here, we present an automated approach, which we call Plantation Analysis by Learning from Multiple Classes (PALM), for mapping plantations on an annual basis using satellite remote sensing data. Due to the heterogeneity of land cover classes, PALM utilizes ensemble learning to simultaneously incorporate training samples from multiple land cover classes over different years. After the ensemble learning, we further improve the performance by post-processing using a Hidden Markov Model. We implement the proposed automated approach using MODIS data in Sumatra and Indonesian Borneo (Kalimantan). To validate the classification, we compare plantations detected using our approach with existing datasets developed through visual interpretation. Based on random sampling and comparison with high-resolution images, the user’s accuracy and producer’s accuracy of our generated map are around 85% and 80% in our study region. 
    more » « less
  2. null (Ed.)
    In September of 2017, Hurricane Irma made landfall within the Rookery Bay National Estuarine Research Reserve of southwest Florida (USA) as a category 3 storm with winds in excess of 200 km h−1. We mapped the extent of the hurricane’s impact on coastal land cover with a seasonal time series of satellite imagery. Very high-resolution (i.e., <5 m pixel) satellite imagery has proven effective to map wetland ecosystems, but challenges in data acquisition and storage, algorithm training, and image processing have prevented large-scale and time-series mapping of these data. We describe our approach to address these issues to evaluate Rookery Bay ecosystem damage and recovery using 91 WorldView-2 satellite images collected between 2010 and 2018 mapped using automated techniques and validated with a field campaign. Land cover was classified seasonally at 2 m resolution (i.e., healthy mangrove, degraded mangrove, upland, soil, and water) with an overall accuracy of 82%. Digital change detection methods show that hurricane-related degradation was 17% of mangrove forest (~5 km2). Approximately 35% (1.7 km2) of this loss recovered one year after Hurricane Irma. The approach completed the mapping approximately 200 times faster than existing methods, illustrating the ease with which regional high-resolution mapping may be accomplished efficiently. 
    more » « less
  3. null (Ed.)
    Urban flooding is a major natural disaster that poses a serious threat to the urban environment. It is highly demanded that the flood extent can be mapped in near real-time for disaster rescue and relief missions, reconstruction efforts, and financial loss evaluation. Many efforts have been taken to identify the flooding zones with remote sensing data and image processing techniques. Unfortunately, the near real-time production of accurate flood maps over impacted urban areas has not been well investigated due to three major issues. (1) Satellite imagery with high spatial resolution over urban areas usually has nonhomogeneous background due to different types of objects such as buildings, moving vehicles, and road networks. As such, classical machine learning approaches hardly can model the spatial relationship between sample pixels in the flooding area. (2) Handcrafted features associated with the data are usually required as input for conventional flood mapping models, which may not be able to fully utilize the underlying patterns of a large number of available data. (3) High-resolution optical imagery often has varied pixel digital numbers (DNs) for the same ground objects as a result of highly inconsistent illumination conditions during a flood. Accordingly, traditional methods of flood mapping have major limitations in generalization based on testing data. To address the aforementioned issues in urban flood mapping, we developed a patch similarity convolutional neural network (PSNet) using satellite multispectral surface reflectance imagery before and after flooding with a spatial resolution of 3 meters. We used spectral reflectance instead of raw pixel DNs so that the influence of inconsistent illumination caused by varied weather conditions at the time of data collection can be greatly reduced. Such consistent spectral reflectance data also enhance the generalization capability of the proposed model. Experiments on the high resolution imagery before and after the urban flooding events (i.e., the 2017 Hurricane Harvey and the 2018 Hurricane Florence) showed that the developed PSNet can produce urban flood maps with consistently high precision, recall, F1 score, and overall accuracy compared with baseline classification models including support vector machine, decision tree, random forest, and AdaBoost, which were often poor in either precision or recall. The study paves the way to fuse bi-temporal remote sensing images for near real-time precision damage mapping associated with other types of natural hazards (e.g., wildfires and earthquakes). 
    more » « less
  4. Improving high-resolution (meter-scale) mapping of snow-covered areas in complex and forested terrains is critical to understanding the responses of species and water systems to climate change. Commercial high-resolution imagery from Planet Labs, Inc. (Planet, San Francisco, CA, USA) can be used in environmental science, as it has both high spatial (0.7–3.0 m) and temporal (1–2 day) resolution. Deriving snow-covered areas from Planet imagery using traditional radiometric techniques have limitations due to the lack of a shortwave infrared band that is needed to fully exploit the difference in reflectance to discriminate between snow and clouds. However, recent work demonstrated that snow cover area (SCA) can be successfully mapped using only the PlanetScope 4-band (Red, Green, Blue and NIR) reflectance products and a machine learning (ML) approach based on convolutional neural networks (CNN). To evaluate how additional features improve the existing model performance, we: (1) build on previous work to augment a CNN model with additional input data including vegetation metrics (Normalized Difference Vegetation Index) and DEM-derived metrics (elevation, slope and aspect) to improve SCA mapping in forested and open terrain, (2) evaluate the model performance at two geographically diverse sites (Gunnison, Colorado, USA and Engadin, Switzerland), and (3) evaluate the model performance over different land-cover types. The best augmented model used the Normalized Difference Vegetation Index (NDVI) along with visible (red, green, and blue) and NIR bands, with an F-score of 0.89 (Gunnison) and 0.93 (Engadin) and was found to be 4% and 2% better than when using canopy height- and terrain-derived measures at Gunnison, respectively. The NDVI-based model improves not only upon the original band-only model’s ability to detect snow in forests, but also across other various land-cover types (gaps and canopy edges). We examined the model’s performance in forested areas using three forest canopy quantification metrics and found that augmented models can better identify snow in canopy edges and open areas but still underpredict snow cover under forest canopies. While the new features improve model performance over band-only options, the models still have challenges identifying the snow under trees in dense forests, with performance varying as a function of the geographic area. The improved high-resolution snow maps in forested environments can support studies involving climate change effects on mountain ecosystems and evaluations of hydrological impacts in snow-dominated river basins. 
    more » « less
  5. Grassland monitoring can be challenging because it is time-consuming and expensive to measure grass condition at large spatial scales. Remote sensing offers a time- and cost-effective method for mapping and monitoring grassland condition at both large spatial extents and fine temporal resolutions. Combinations of remotely sensed optical and radar imagery are particularly promising because together they can measure differences in moisture, structure, and reflectance among land cover types. We combined multi-date radar (PALSAR-2 and Sentinel-1) and optical (Sentinel-2) imagery with field data and visual interpretation of aerial imagery to classify land cover in the Masai Mara National Reserve, Kenya using machine learning (Random Forests). This study area comprises a diverse array of land cover types and changes over time due to seasonal changes in precipitation, seasonal movements of large herds of resident and migratory ungulates, fires, and livestock grazing. We classified twelve land cover types with user’s and producer’s accuracies ranging from 66%–100% and an overall accuracy of 86%. These methods were able to distinguish among short, medium, and tall grass cover at user’s accuracies of 83%, 82%, and 85%, respectively. By yielding a highly accurate, fine-resolution map that distinguishes among grasses of different heights, this work not only outlines a viable method for future grassland mapping efforts but also will help inform local management decisions and research in the Masai Mara National Reserve. 
    more » « less