skip to main content


This content will become publicly available on September 1, 2024

Title: A machine learning approach to predict cellular mechanical stresses in response to chemical perturbation
Mechanical stresses generated at the cell-cell level and cell-substrate level have been suggested to be important in a host of physiological and pathological processes. However, the influence various chemical compounds have on the mechan- ical stresses mentioned above is poorly understood, hindering the discovery of novel therapeutics, and representing a barrier in the field. To overcome this barrier, we implemented two approaches: 1) monolayer boundary predictor and 2) discretized window predictor utilizing either stepwise linear regression or quadratic support vector machine machine learning model to predict the dose-dependent response of tractions and intercellular stresses to chemical perturbation. We used experimental traction and intercellular stress data gathered from samples subject to 0.2 or 2 mg/mL drug concentrations along with cell morphological prop- erties extracted from the bright-field images as predictors to train our model. To demonstrate the predictive capability of our ma- chine learning models, we predicted tractions and intercellular stresses in response to 0 and 1 mg/mL drug concentrations which were not utilized in the training sets. Results revealed the discretized window predictor trained just with four samples (292 im- ages) to best predict both intercellular stresses and tractions using the quadratic support vector machine and stepwise linear regression models, respectively, for the unseen sample images.  more » « less
Award ID(s):
2045750
NSF-PAR ID:
10492893
Author(s) / Creator(s):
; ;
Publisher / Repository:
Elsivier
Date Published:
Journal Name:
Biophysical Journal
Volume:
122
Issue:
17
ISSN:
0006-3495
Page Range / eLocation ID:
3413 to 3424
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Abstract Measuring soil health indicators (SHIs), particularly soil total nitrogen (TN), is an important and challenging task that affects farmers’ decisions on timing, placement, and quantity of fertilizers applied in the farms. Most existing methods to measure SHIs are in-lab wet chemistry or spectroscopy-based methods, which require significant human input and effort, time-consuming, costly, and are low-throughput in nature. To address this challenge, we develop an artificial intelligence (AI)-driven near real-time unmanned aerial vehicle (UAV)-based multispectral sensing solution (UMS) to estimate soil TN in an agricultural farm. TN is an important macro-nutrient or SHI that directly affects the crop health. Accurate prediction of soil TN can significantly increase crop yield through informed decision making on the timing of seed planting, and fertilizer quantity and timing. The ground-truth data required to train the AI approaches is generated via laser-induced breakdown spectroscopy (LIBS), which can be readily used to characterize soil samples, providing rapid chemical analysis of the samples and their constituents (e.g., nitrogen, potassium, phosphorus, calcium). Although LIBS was previously applied for soil nutrient detection, there is no existing study on the integration of LIBS with UAV multispectral imaging and AI. We train two machine learning (ML) models including multi-layer perceptron regression and support vector regression to predict the soil nitrogen using a suite of data classes including multispectral characteristics of the soil and crops in red (R), near-infrared, and green (G) spectral bands, computed vegetation indices (NDVI), and environmental variables including air temperature and relative humidity (RH). To generate the ground-truth data or the training data for the machine learning models, we determine the N spectrum of the soil samples (collected from a farm) using LIBS and develop a calibration model using the correlation between actual TN of the soil samples and the maximum intensity of N spectrum. In addition, we extract the features from the multispectral images captured while the UAV follows an autonomous flight plan, at different growth stages of the crops. The ML model’s performance is tested on a fixed configuration space for the hyper-parameters using various hyper-parameter optimization techniques at three different wavelengths of the N spectrum. 
    more » « less
  2. Machine learning (ML) methods, such as artificial neural networks (ANN), k-nearest neighbors (kNN), random forests (RF), support vector machines (SVM), and boosted decision trees (DTs), may offer stronger predictive performance than more traditional, parametric methods, such as linear regression, multiple linear regression, and logistic regression (LR), for specific mapping and modeling tasks. However, this increased performance is often accompanied by increased model complexity and decreased interpretability, resulting in critiques of their “black box” nature, which highlights the need for algorithms that can offer both strong predictive performance and interpretability. This is especially true when the global model and predictions for specific data points need to be explainable in order for the model to be of use. Explainable boosting machines (EBM), an augmentation and refinement of generalize additive models (GAMs), has been proposed as an empirical modeling method that offers both interpretable results and strong predictive performance. The trained model can be graphically summarized as a set of functions relating each predictor variable to the dependent variable along with heat maps representing interactions between selected pairs of predictor variables. In this study, we assess EBMs for predicting the likelihood or probability of slope failure occurrence based on digital terrain characteristics in four separate Major Land Resource Areas (MLRAs) in the state of West Virginia, USA and compare the results to those obtained with LR, kNN, RF, and SVM. EBM provided predictive accuracies comparable to RF and SVM and better than LR and kNN. The generated functions and visualizations for each predictor variable and included interactions between pairs of predictor variables, estimation of variable importance based on average mean absolute scores, and provided scores for each predictor variable for new predictions add interpretability, but additional work is needed to quantify how these outputs may be impacted by variable correlation, inclusion of interaction terms, and large feature spaces. Further exploration of EBM is merited for geohazard mapping and modeling in particular and spatial predictive mapping and modeling in general, especially when the value or use of the resulting predictions would be greatly enhanced by improved interpretability globally and availability of prediction explanations at each cell or aggregating unit within the mapped or modeled extent. 
    more » « less
  3. Autonomous chemical process development and optimization methods use algorithms to explore the operating parameter space based on feedback from experimentally determined exit stream compositions. Measuring the compositions of multicomponent streams is challenging, requiring multiple analytical techniques to differentiate between similar chemical components in the mixture and determine their concentration. Herein, we describe a universal analytical methodology based on multitarget regression machine learning (ML) models to rapidly determine chemical mixtures' compositions from Fourier transform infrared (FTIR) absorption spectra. Specifically, we used simulated FTIR spectra for up to 6 components in water and tested seven different ML algorithms to develop the methodology. All algorithms resulted in regression models with mean absolute errors (MAE) between 0–0.27 wt%. We validated the methodology with experimental data obtained on mixtures prepared using a network of programmable pumps in line with an FTIR transmission flow cell. ML models were trained using experimental data and evaluated for mixtures of up to 4-components with similar chemical structures, including alcohols ( i.e. , glycerol, isopropanol, and 1-butanol) and nitriles ( i.e. , acrylonitrile, adiponitrile, and propionitrile). Linear regression models predicted concentrations with coefficients of determination, R 2 , between 0.955 and 0.986, while artificial neural network models showed a slightly lower accuracy, with R 2 between 0.854 and 0.977. These R 2 correspond to MAEs of 0.28–0.52 wt% for mixtures with component concentrations between 4–10 wt%. Thus, we demonstrate that ML models can accurately determine the compositions of multicomponent mixtures of similar species, enhancing spectroscopic chemical quantification for use in autonomous, fast process development and optimization. 
    more » « less
  4. Abstract

    An intelligent sensing framework using Machine Learning (ML) and Deep Learning (DL) architectures to precisely quantify dielectrophoretic force invoked on microparticles in a textile electrode-based DEP sensing device is reported. The prediction accuracy and generalization ability of the framework was validated using experimental results. Images of pearl chain alignment at varying input voltages were used to build deep regression models using modified ML and CNN architectures that can correlate pearl chain alignment patterns of Saccharomyces cerevisiae(yeast) cells and polystyrene microbeads to DEP force. Various ML models such as K-Nearest Neighbor, Support Vector Machine, Random Forest, Neural Networks, and Linear Regression along with DL models such as Convolutional Neural Network (CNN) architectures of AlexNet, ResNet-50, MobileNetV2, and GoogLeNet have been analyzed in order to build an effective regression framework to estimate the force induced on yeast cells and microbeads. The efficiencies of the models were evaluated using Mean Absolute Error, Mean Absolute Relative, Mean Squared Error, R-squared, and Root Mean Square Error (RMSE) as evaluation metrics. ResNet-50 with RMSPROP gave the best performance, with a validation RMSE of 0.0918 on yeast cells while AlexNet with ADAM optimizer gave the best performance, with a validation RMSE of 0.1745 on microbeads. This provides a baseline for further studies in the application of deep learning in DEP aided Lab-on-Chip devices.

     
    more » « less
  5. Abstract

    Boreal forest heights are associated with global carbon stocks and energy budgets. The launch of the Advanced Topographic Laser Altimeter System (ATLAS) onboard the NASA's Ice, Cloud and Land Elevation Satellite (ICESat‐2) enables canopy vertical structure measurement at a global scale. However, with a photon‐counting laser system, ICESat‐2 contains high uncertainties in the estimated canopy heights, requiring appropriate quality control before being applied to canopy height modelling.

    We adopted a multivariate quality control approach (i.e. the Cook's distance) to improve the quality of ICESat‐2 samples. The controlled ICESat‐2 data were then input as the response variable for predicting boreal forest heights based on spatially continuous satellite data and machine learning (ML) regression models. The examined ML regressors include artificial neural networks (ANN), gradient boosting machine (GBM), random forest (RF) and support vector regression (SVR).

    The proposed quality control effectively removes low‐quality ICESat‐2 samples and enhances the correlations between ICESat‐2 and airborne laser scanning (ALS) observations. Moreover, the controlled ICESat‐2 samples help achieve a trade‐off between sample quality and quantity for all ML regressors, generating close canopy heights to ALS‐derived counterparts. Overall, RF and GBM make better canopy height predictions than ANN and SVR. Of all explanatory variables, the normalized difference vegetation index calculated based on the first red edge band of Sentinel‐2 (NDVIredEdge1) is considered the most important.

    The proposed quality control on ICESat‐2 sample selection and canopy height model (CHM) development workflow will greatly benefit forest structure investigations in the Arctic community.

     
    more » « less