skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Cell morphology-based machine learning models for human cell state classification
Herein, we implement and access machine learning architectures to ascertain models that differentiate healthy from apoptotic cells using exclusively forward (FSC) and side (SSC) scatter flow cytometry information. To generate training data, colorectal cancer HCT116 cells were subjected to miR-34a treatment and then classified using a conventional Annexin V/propidium iodide (PI)-staining assay. The apoptotic cells were defined as Annexin V-positive cells, which include early and late apoptotic cells, necrotic cells, as well as other dying or dead cells. In addition to fluorescent signal, we collected cell size and granularity information from the FSC and SSC parameters. Both parameters are subdivided into area, height, and width, thus providing a total of six numerical features that informed and trained our models. A collection of logistical regression, random forest, k-nearest neighbor, multilayer perceptron, and support vector machine was trained and tested for classification performance in predicting cell states using only the six aforementioned numerical features. Out of 1046 candidate models, a multilayer perceptron was chosen with 0.91 live precision, 0.93 live recall, 0.92 live f value and 0.97 live area under the ROC curve when applied on standardized data. We discuss and highlight differences in classifier performance and compare the results to the standard practice of forward and side scatter gating, typically performed to select cells based on size and/or complexity. We demonstrate that our model, a ready-to-use module for any flow cytometry-based analysis, can provide automated, reliable, and stain-free classification of healthy and apoptotic cells using exclusively size and granularity information.  more » « less
Award ID(s):
2029121
PAR ID:
10233562
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
npj systems biology and applications
Volume:
7
Issue:
23
ISSN:
2056-7189
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Detection and quantification of bacterial endotoxins is important in a range of health-related contexts, including during pharmaceutical manufacturing of therapeutic proteins and vaccines. Here we combine experimental measurements based on nematic liquid crystalline droplets and machine learning methods to show that it is possible to classify bacterial sources ( Escherichia coli , Pseudomonas aeruginosa , Salmonella minnesota ) and quantify concentration of endotoxin derived from all three bacterial species present in aqueous solution. The approach uses flow cytometry to quantify, in a high-throughput manner, changes in the internal ordering of micrometer-sized droplets of nematic 4-cyano-4′-pentylbiphenyl triggered by the endotoxins. The changes in internal ordering alter the intensities of light side-scattered (SSC, large-angle) and forward-scattered (FSC, small-angle) by the liquid crystal droplets. A convolutional neural network (Endonet) is trained using the large data sets generated by flow cytometry and shown to predict endotoxin source and concentration directly from the FSC/SSC scatter plots. By using saliency maps, we reveal how EndoNet captures subtle differences in scatter fields to enable classification of bacterial source and quantification of endotoxin concentration over a range that spans eight orders of magnitude (0.01 pg mL −1 to 1 μg mL −1 ). We attribute changes in scatter fields with bacterial origin of endotoxin, as detected by EndoNet, to the distinct molecular structures of the lipid A domains of the endotoxins derived from the three bacteria. Overall, we conclude that the combination of liquid crystal droplets and EndoNet provides the basis of a promising analytical approach for endotoxins that does not require use of complex biologically-derived reagents ( e.g. , Limulus amoebocyte lysate). 
    more » « less
  2. Abstract Cellular biophysical metrics exhibit systematic alterations during processes, such as metastasis and immune cell activation, which can be used to identify and separate live cell subpopulations for targeting drug screening. Image‐based biophysical cytometry under extensional flows can accurately quantify cell deformability based on cell shape alterations but needs extensive image reconstruction, which limits its inline utilization to activate cell sorting. Impedance cytometry can measure these cell shape alterations based on electric field screening, while its frequency response offers functional information on cell viability and interior structure, which are difficult to discern by imaging. Furthermore, 1‐D temporal impedance signal trains exhibit characteristic shapes that can be rapidly templated in near real‐time to extract single‐cell biophysical metrics to activate sorting. We present a multilayer perceptron neural network signal templating approach that utilizes raw impedance signals from cells under extensional flow, alongside its training with image metrics from corresponding cells to derive net electrical anisotropy metrics that quantify cell deformability over wide anisotropy ranges and with minimal errors from cell size distributions. Deformability and electrical physiology metrics are applied in conjunction on the same cell for multiparametric classification of live pancreatic cancer cells versus cancer associated fibroblasts using the support vector machine model. 
    more » « less
  3. Obtaining useful insights from machine learning models trained on experimental datasets collected across different groups to improve the sustainability of chemical processes can be challenging due to the small size and heterogeneity of the dataset. Here we show that shallow learning models such as decision trees and random forest algorithms can be an effective tool for guiding experimental research in the sustainable chemistry field. This study trained four different machine learning algorithms (linear regression, decision tree, random forest, and multilayer perceptron) using different sized datasets containing up to 520 unique reaction conditions for the nitrogen reduction reaction (NRR) on heterogeneous electrocatalysts. Using the catalyst properties and experimental conditions as the features, we determined the ability of each model to regress the ammonia production rate and the faradaic efficiency. We observed that the shallow learning decision tree and random forest models had equal or better predictive power compared to the deep learning multilayer perceptron models and the simple linear regression models. Moreover, decision tree and random forest models enable the extraction of feature importance, which is a powerful tool in guiding experimental research. Analysis of the models showed the complex interaction between the applied potential and catalysts on the effective rate for the NRR. We also suggest some underexplored catalysts–electrolyte combinations to experimental researchers looking to improve both the rate and efficiency of the NRR reaction. 
    more » « less
  4. Abstract Two common hemoglobinopathies, sickle cell disease (SCD) and β-thalassemia, arise from genetic mutations within the β-globin gene. In this work, we identified a 500-bp motif (Fetal Chromatin Domain, FCD) upstream of human ϒ-globin locus and showed that the removal of this motif using CRISPR technology reactivates the expression of ϒ-globin. Next, we present two different cell morphology-based machine learning approaches that can be used identify human blood cells (KU-812) that harbor CRISPR-mediated FCD genetic modifications. Three candidate models from the first approach, which uses multilayer perceptron algorithm (MLP 20-26, MLP26-18, and MLP 30-26) and flow cytometry-derived cellular data, yielded 0.83 precision, 0.80 recall, 0.82 accuracy, and 0.90 area under the ROC (receiver operating characteristic) curve when predicting the edited cells. In comparison, the candidate model from the second approach, which uses deep learning (T2D5) and DIC microscopy-derived imaging data, performed with less accuracy (0.80) and ROC AUC (0.87). We envision that equivalent machine learning-based models can complement currently available genotyping protocols for specific genetic modifications which result in morphological changes in human cells. 
    more » « less
  5. ABSTRACT EseN is anEdwardsiella ictaluritype III secretion system effector with phosphothreonine lyase activity. In this work, we demonstrate that EseN inactivates p38 and c-Jun-N-terminal kinase (JNK) in infected head-kidney-derived macrophages (HKDMs). We have previously reported inactivation of extracellular-regulated kinase 1/2 (ERK1/2). Also, for the first time, we demonstrated that EseN is involved in the inactivation of 3-phosphoinositide-dependent kinase 1 (PDK1), which has not been previously demonstrated for any of the EseN homologs in other species. We also found that EseN significantly affected mRNA expression ofIL-10, pro-apoptoticbaxa, andp53, but had no significant effect on anti-apoptoticbcl2or pro-apoptotic apoptotic peptidase activating factor 1. EseN is also involved in the inhibition of caspase-8 and caspase-3/7 but does not affect caspase-9 activity. Repression of apoptosis was further confirmed with flow cytometry using Alexa Fluor 647-labeled annexin V and propidium iodide. In addition, we found that theE. ictaluriT3SS is essential for the inhibition of IL-1β maturation, but EseN is not involved in this process. EseN did not affect cell pyroptosis, as indicated by the lack of EseN impact on the release of lactate dehydrogenase from infected HKDM. The transmission electron microscopy data also indicate that HKDM infected with WT or aneseNmutant died by apoptosis, while HKDM infected with the T3SS mutant more likely died by pyroptosis. Collectively, our results indicate thatE. ictaluriEseN is involved in inactivation of ERK1/2, p38, JNK, and PDK1 signaling pathways that lead to modulation of cell death among infected HKDMs. IMPORTANCEThis work has global significance in the catfish industry, which provides food for increasing global populations.E. ictaluriis a leading cause of disease loss, and EseN is an important player inE. ictalurivirulence. TheE. ictaluriT3SS effector EseN plays an essential role in establishing infection, but the specific role EseN plays is not well characterized. EseN belongs to a family of phosphothreonine lyase effectors that specifically target host mitogen activated protein kinase (MAPK) pathways important in regulating host responses to infection. No phosphothreonine lyase equivalents are known in eukaryotes, making this family of effectors an attractive target for indirect narrow-spectrum antibiotics. Targeting of major vault protein and PDK1 kinase by EseN has not been reported in EseN homologs in other pathogens and may indicate unique functions ofE. ictaluriEseN. EseN targeting of PDK1 is particularly interesting in that it is linked to an extraordinarily diverse group of cellular functions. 
    more » « less