skip to main content


Title: Clustering and classification of vertical movement profiles for ecological inference of behavior
Abstract

Vertical movements can expose individuals to rapid changes in physical and trophic environments—for aquatic fauna, dive profiles from biotelemetry data can be used to quantify and categorize vertical movements. Inferences on classes of vertical movement profiles typically rely on subjective summaries of parameters or statistical clustering techniques that utilize Euclidean matching of vertical movement profiles with vertical observation points. These approaches are prone to subjectivity, error, and bias. We used machine learning approaches on a large dataset of vertical time series (N = 28,217 dives) for 31 post‐nesting leatherback turtles (Dermochelys coriacea). We applied dynamic time warp (DTW) clustering to group vertical movement (dive) time series by their metrics (depth and duration) into an optimal number of clusters. We then identified environmental covariates associated with each cluster using a generalized additive mixed‐effects model (GAMM). A convolutional neural network (CNN) model, trained on standard dive shape types from the literature, was used to classify dives within each DTW cluster by their shape. Two clusters were identified with the DTW approach—these varied in their spatial and temporal distributions, with dependence on environmental covariates, sea surface temperature, bathymetry, sea surface height anomaly, and time‐lagged surface chlorophyllaconcentrations. CNN classification accuracy of the five standard dive profiles was 95%. Subsequent analyses revealed that the two clusters differed in their composition of standard dive shapes, with each cluster dominated by shapes indicative of distinct behaviors (pelagic foraging and exploration, respectively). The use of these two machine learning approaches allowed for discrete behaviors to be identified from vertical time series data, first by clustering vertical movements by their movement metrics (DTW) and second by classifying dive profiles within each cluster by their shapes (CNN). Statistical inference for the identified clusters found distinct relationships with environmental covariates, supporting hypotheses of vertical niche switching and vertically structured foraging behavior. This approach could be similarly applied to the time series of other animals utilizing the vertical dimension in their movements, including aerial, arboreal, and other aquatic species, to efficiently identify different movement behaviors and inform habitat models.

 
more » « less
Award ID(s):
1915347
NSF-PAR ID:
10443404
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Ecosphere
Volume:
14
Issue:
1
ISSN:
2150-8925
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Objective Severe infection can lead to organ dysfunction and sepsis. Identifying subphenotypes of infected patients is essential for personalized management. It is unknown how different time series clustering algorithms compare in identifying these subphenotypes. Materials and Methods Patients with suspected infection admitted between 2014 and 2019 to 4 hospitals in Emory healthcare were included, split into separate training and validation cohorts. Dynamic time warping (DTW) was applied to vital signs from the first 8 h of hospitalization, and hierarchical clustering (DTW-HC) and partition around medoids (DTW-PAM) were used to cluster patients into subphenotypes. DTW-HC, DTW-PAM, and a previously published group-based trajectory model (GBTM) were evaluated for agreement in subphenotype clusters, trajectory patterns, and subphenotype associations with clinical outcomes and treatment responses. Results There were 12 473 patients in training and 8256 patients in validation cohorts. DTW-HC, DTW-PAM, and GBTM models resulted in 4 consistent vitals trajectory patterns with significant agreement in clustering (71–80% agreement, P < .001): group A was hyperthermic, tachycardic, tachypneic, and hypotensive. Group B was hyperthermic, tachycardic, tachypneic, and hypertensive. Groups C and D had lower temperatures, heart rates, and respiratory rates, with group C normotensive and group D hypotensive. Group A had higher odds ratio of 30-day inpatient mortality (P < .01) and group D had significant mortality benefit from balanced crystalloids compared to saline (P < .01) in all 3 models. Discussion DTW- and GBTM-based clustering algorithms applied to vital signs in infected patients identified consistent subphenotypes with distinct clinical outcomes and treatment responses. Conclusion Time series clustering with distinct computational approaches demonstrate similar performance and significant agreement in the resulting subphenotypes. 
    more » « less
  2. Abstract

    Conservation of migratory species exhibiting wide‐ranging and multidimensional behaviors is challenged by management efforts that only utilize horizontal movements or produce static spatial–temporal products. For the deep‐diving, critically endangered eastern Pacific leatherback turtle, tools that predict where turtles have high risks of fisheries interactions are urgently needed to prevent further population decline. We incorporated horizontal–vertical movement model results with spatial–temporal kernel density estimates and threat data (gear‐specific fishing) to develop monthly maps of spatial risk. Specifically, we applied multistate hidden Markov models to a biotelemetry data set (n = 28 leatherback tracks, 2004–2007). Tracks with dive information were used to characterize turtle behavior as belonging to 1 of 3 states (transiting, residential with mixed diving, and residential with deep diving). Recent fishing effort data from Global Fishing Watch were integrated with predicted behaviors and monthly space‐use estimates to create maps of relative risk of turtle–fisheries interactions. Drifting (pelagic) longline fishing gear had the highest average monthly fishing effort in the study region, and risk indices showed this gear to also have the greatest potential for high‐risk interactions with turtles in a residential, deep‐diving behavioral state. Monthly relative risk surfaces for all gears and behaviors were added to South Pacific TurtleWatch (SPTW) (https://www.upwell.org/sptw), a dynamic management tool for this leatherback population. These modifications will refine SPTW's capability to provide important predictions of potential high‐risk bycatch areas for turtles undertaking specific behaviors. Our results demonstrate how multidimensional movement data, spatial–temporal density estimates, and threat data can be used to create a unique conservation tool. These methods serve as a framework for incorporating behavior into similar tools for other aquatic, aerial, and terrestrial taxa with multidimensional movement behaviors.

     
    more » « less
  3. Modeling corrosion growth for complex systems such as the oil refinery system is a major challenge since the corrosion process of oil and gas pipelines are inherently stochastic and depends on many factors including exposures to environmental conditions, operating conditions, and electrochemical reactions. Moreover, the number of sensors is usually limited, and sensor data are incomplete and scattering, which hinders the capability of capturing the corrosion growth behaviors. Therefore, this paper proposes Multi-sensor Corrosion Growth Model with Latent Variables to predict the corrosion growth process in oil refinery piping. The proposed model is a combination of the hierarchical clustering algorithm and the vector autoregression (VAR) model. The clustering algorithm aims to find the hidden (i.e., latent) data clusters of the measured time series data, from which the time series from the same cluster will be included in the VAR model to predict the corrosion depth from multiple sensors. The model can capture the relationship between sensor time series data and identify latent variables. A real case study of an oil refinery system, in which in-line inspection (ILI) data were collected, was utilized to validate model. Regarding corrosion growth prediction, the paper compared the prediction accuracy of VAR model with other three forms of power law model, which is widely accepted to expect the time-dependent depth of corrosion such as power function (PF), PF with initiation time of corrosion (PFIT), and PF with initiation time of corrosion and covariates (PFCOV). The results showed that VAR model has the lowest prediction error based on the mean absolute percentage error (MAPE) evaluation for test data. Finally, the proposed model is believed to be useful for dealing with a complex system that has a variety of corrosion growth behaviors, such as the oil refinery system, as well as it can be applied in other real-time applications. 
    more » « less
  4. Animals that display plasticity in behavioral, ecological, and morphological traits are better poised to cope with environmental disturbances. Here, we examined individual plasticity and intraspecific variation in the morphometrics, movement patterns, and dive behavior of an enigmatic apex predator, the leopard seal ( Hydrurga leptonyx ). Satellite/GPS tags and time-depth recorders were deployed on 22 leopard seals off the Western Antarctic Peninsula. Adult female leopard seals were significantly larger (454±59 kg) and longer (302±11 cm) than adult males (302±22 kg, 276±11 cm). As females were 50% larger than their male counterparts, leopard seals are therefore one of the most extreme examples of female-biased sexual size dimorphism in marine mammals. Female leopard seals also spent more time hauled-out on land and ice than males. In the austral spring/summer, three adult female leopard seals hauled-out on ice for 10+ days, which likely represent the first satellite tracks of parturition and lactation for the species. While we found sex-based differences in morphometrics and haul-out durations, other variables, including maximum distance traveled and dive parameters, did not vary by sex. Regardless of sex, some leopard seals remained in near-shore habitats, traveling less than 50 kilometers, while other leopard seals traveled up to 1,700 kilometers away from the tagging location. Overall, leopard seals were short (3.0±0.7 min) and shallow (29±8 m) divers. However, within this general pattern, some individual leopard seals primarily used short, shallow dives, while others switched between short, shallow dives and long, deep dives. We also recorded the single deepest and longest dive made by any leopard seal—1, 256 meters for 25 minutes. Together, our results showcased high plasticity among leopard seals tagged in a single location. These flexible behaviors and traits may offer leopard seals, an ice-associated apex predator, resilience to the rapidly changing Southern Ocean. 
    more » « less
  5. Objective: Slaughterhouse data has recently been used to enhance animal disease surveillance in many countries, however has been largely underused for syndromic surveillance in the United States. We characterize spatiotemporal patterns and system dynamics of whole carcass swine condemnations in the US. We illustrate the value of data mining and machine learning approaches to more cost-effectively identify: emerging trends by condemnation reason, areas and time periods with higher than predicted condemnation rates, and regions or time periods with similar trends. Methods: Swine slaughter and condemnation data from 2005-2016 were obtained for slaughterhouses inspected by the Food Safety and Inspection Service (FSIS). Time series of condemnation rates by condemnation reason, type of pig, state and month were generated. Data time warping (DTW) and hierarchical clustering methods were used to identify states with similar patterns in the rate of condemnation cases by cause and type of pig. Spatiotemporal scan statistics were used to identify states and months with significantly higher number of condemnation cases than expected. Clusters were compared to historic infectious disease outbreaks in the swine industry. Results: Between 2005-2016, 1,109,300 whole swine carcasses were condemned. The top causes for condemnation were abscess/pyemia, septicemia, pneumonia, icterus, and peritonitis, respectively. DTW and cluster analysis revealed clear spatiotemporal patterns in the rate of condemnations, many with a strong seasonal component. Several clusters were detected in timeframes where widespread outbreaks had occurred. Conclusions: Timely evaluation of spatiotemporal patterns in swine condemnations may provide critical information in predicting disease outbreaks. Identification of spatiotemporal hot spots can direct investigation of primary on-farm risk factors contributing to condemnation. Risk mitigation through targeted decision-making and improved management practices can minimize carcass condemnations and animal losses, improving economic efficiency, profitability and sustainability of the US swine industry 
    more » « less