skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Analyzing collective motion with machine learning and topology
We use topological data analysis and machine learning to study a seminal model of collective motion in biology [M. R. D’Orsogna et al., Phys. Rev. Lett. 96, 104302 (2006)]. This model describes agents interacting nonlinearly via attractive-repulsive social forces and gives rise to collective behaviors such as flocking and milling. To classify the emergent collective motion in a large library of numerical simulations and to recover model parameters from the simulation data, we apply machine learning techniques to two different types of input. First, we input time series of order parameters traditionally used in studies of collective motion. Second, we input measures based on topology that summarize the time-varying persistent homology of simulation data over multiple scales. This topological approach does not require prior knowledge of the expected patterns. For both unsupervised and supervised machine learning methods, the topological approach outperforms the one that is based on traditional order parameters.  more » « less
Award ID(s):
1813752
PAR ID:
10588623
Author(s) / Creator(s):
; ; ; ; ; ;
Publisher / Repository:
American Institute of Physics
Date Published:
Journal Name:
Chaos: An Interdisciplinary Journal of Nonlinear Science
Volume:
29
Issue:
12
ISSN:
1054-1500
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract This study introduces a hybrid model that utilizes a model-based optimization method to generate training data and an artificial neural network (ANN)-based learning method to offer real-time exoskeleton support in lifting activities. For the model-based optimization method, the torque of the knee exoskeleton and the optimal lifting motion are predicted utilizing a two-dimensional (2D) human–exoskeleton model. The control points for exoskeleton motor current profiles and human joint angle profiles from cubic B-spline interpolation represent the design variables. Minimizing the square of the normalized human joint torque is considered as the cost function. Subsequently, the lifting optimization problem is tackled using a sequential quadratic programming (SQP) algorithm in sparse nonlinear optimizer (SNOPT). For the learning-based approach, the learning-based control model is trained using the general regression neural network (GRNN). The anthropometric parameters of the human subjects and lifting boundary postures are used as input parameters, while the control points for exoskeleton torque are treated as output parameters. Once trained, the learning-based control model can provide exoskeleton assistive torque in real time for lifting tasks. Two test subjects’ joint angles and ground reaction forces (GRFs) comparisons are presented between the experimental and simulation results. Furthermore, the utilization of exoskeletons significantly reduces activations of the four knee extensor and flexor muscles compared to lifting without the exoskeletons for both subjects. Overall, the learning-based control method can generate assistive torque profiles in real time and faster than the model-based optimal control approach. 
    more » « less
  2. While machine learning approaches are rapidly being applied to hydrologic problems, physics-informed approaches are still relatively rare. Many successful deep-learning applications have focused on point estimates of streamflow trained on stream gauge observations over time. While these approaches show promise for some applications, there is a need for distributed approaches that can produce accurate two-dimensional results of model states, such as ponded water depth. Here, we demonstrate a 2D emulator of the Tilted V catchment benchmark problem with solutions provided by the integrated hydrology model ParFlow. This emulator model can use 2D Convolution Neural Network (CNN), 3D CNN, and U-Net machine learning architectures and produces time-dependent spatial maps of ponded water depth from which hydrographs and other hydrologic quantities of interest may be derived. A comparison of different deep learning architectures and hyperparameters is presented with particular focus on approaches such as 3D CNN (that have a time-dependent learning component) and 2D CNN and U-Net approaches (that use only the current model state to predict the next state in time). In addition to testing model performance, we also use a simplified simulation based inference approach to evaluate the ability to calibrate the emulator to randomly selected simulations and the match between ML calibrated input parameters and underlying physics-based simulation. 
    more » « less
  3. Context. Machine-learning methods for predicting solar flares typically employ physics-based features that have been carefully cho- sen by experts in order to capture the salient features of the photospheric magnetic fields of the Sun. Aims. Though the sophistication and complexity of these models have grown over time, there has been little evolution in the choice of feature sets, or any systematic study of whether the additional model complexity leads to higher predictive skill. Methods. This study compares the relative prediction performance of four different machine-learning based flare prediction models with increasing degrees of complexity. It evaluates three different feature sets as input to each model: a “traditional” physics-based feature set, a novel “shape-based” feature set derived from topological data analysis (TDA) of the solar magnetic field, and a com- bination of these two sets. A systematic hyperparameter tuning framework is employed in order to assure fair comparisons of the models across different feature sets. Finally, principal component analysis is used to study the effects of dimensionality reduction on these feature sets. Results. It is shown that simpler models with fewer free parameters perform better than the more complicated models on the canonical 24-h flare forecasting problem. In other words, more complex machine-learning architectures do not necessarily guarantee better prediction performance. In addition, it is found that shape-based feature sets contain just as much useful information as physics-based feature sets for the purpose of flare prediction, and that the dimension of these feature sets – particularly the shape-based one – can be greatly reduced without impacting predictive accuracy. 
    more » « less
  4. Thenkabail, Prasad S. (Ed.)
    Physically based hydrologic models require significant effort and extensive information for development, calibration, and validation. The study explored the use of the random forest regression (RFR), a supervised machine learning (ML) model, as an alternative to the physically based Soil and Water Assessment Tool (SWAT) for predicting streamflow in the Rio Grande Headwaters near Del Norte, a snowmelt-dominated mountainous watershed of the Upper Rio Grande Basin. Remotely sensed data were used for the random forest machine learning analysis (RFML) and RStudio for data processing and synthesizing. The RFML model outperformed the SWAT model in accuracy and demonstrated its capability in predicting streamflow in this region. We implemented a customized approach to the RFR model to assess the model’s performance for three training periods, across 1991–2010, 1996–2010, and 2001–2010; the results indicated that the model’s accuracy improved with longer training periods, implying that the model trained on a more extended period is better able to capture the parameters’ variability and reproduce streamflow data more accurately. The variable importance (i.e., IncNodePurity) measure of the RFML model revealed that the snow depth and the minimum temperature were consistently the top two predictors across all training periods. The paper also evaluated how well the SWAT model performs in reproducing streamflow data of the watershed with a conventional approach. The SWAT model needed more time and data to set up and calibrate, delivering acceptable performance in annual mean streamflow simulation, with satisfactory index of agreement (d), coefficient of determination (R2), and percent bias (PBIAS) values, but monthly simulation warrants further exploration and model adjustments. The study recommends exploring snowmelt runoff hydrologic processes, dust-driven sublimation effects, and more detailed topographic input parameters to update the SWAT snowmelt routine for better monthly flow estimation. The results provide a critical analysis for enhancing streamflow prediction, which is valuable for further research and water resource management, including snowmelt-driven semi-arid regions. 
    more » « less
  5. Collective movement and organization of cell monolayers are important for wound healing and tissue development. Recent experiments highlighted the importance of liquid crystal order within these layers, suggesting that +1 topological defects have a role in organizing tissue morphogenesis. We study fibroblast organization, motion, and proliferation on a substrate with micron-sized ridges that induce +1 and −1 topological defects using simulation and experiment. We model cells as self-propelled deformable ellipses that interact via a Gay–Berne potential. Unlike earlier work on other cell types, we see that density variation near defects is not explained by collective migration. We propose instead that fibroblasts have different division rates depending on their area and aspect ratio. This model captures key features of our previous experiments: the alignment quality worsens at high cell density and, at the center of the +1 defects, cells can adopt either highly anisotropic or primarily isotropic morphologies. Experiments performed with different ridge heights confirm a prediction of this model: Suppressing migration across ridges promotes higher cell density at the +1 defect. Our work enables a mechanism for tissue patterning using topological defects without relying on cell migration. 
    more » « less