There is currently no gene expression assay that can assess if premalignant lesions will develop into invasive breast cancer. This study sought to identify biomarkers for selecting patients with a high potential for developing invasive carcinoma in the breast with normal histology, benign lesions, or premalignant lesions. A set of 26-gene mRNA expression profiles were used to identify invasive ductal carcinomas from histologically normal tissue and benign lesions and to select those with a higher potential for future cancer development (ADHC) in the breast associated with atypical ductal hyperplasia (ADH). The expression-defined model achieved an overall accuracy of 94.05% (AUC = 0.96) in classifying invasive ductal carcinomas from histologically normal tissue and benign lesions (n = 185). This gene signature classified cancer development in ADH tissues with an overall accuracy of 100% (n = 8). The mRNA expression patterns of these 26 genes were validated using RT-PCR analyses of independent tissue samples (n = 77) and blood samples (n = 48). The protein expression of PBX2 and RAD52 assessed with immunohistochemistry were prognostic of breast cancer survival outcomes. This signature provided significant prognostic stratification in The Cancer Genome Atlas breast cancer patients (n = 1100), as well as basal-like and luminal A subtypes, and was associated with distinct immune infiltration and activities. The mRNA and protein expression of the 26 genes was associated with sensitivity or resistance to 18 NCCN-recommended drugs for treating breast cancer. Eleven genes had significant proliferative potential in CRISPR-Cas9/RNAi screening. Based on this gene expression signature, the VEGFR inhibitor ZM-306416 was discovered as a new drug for treating breast cancer.
more »
« less
A simplified MyProstateScore2.0 for high-grade prostate cancer
Background:The limited diagnostic accuracy of prostate-specific antigen screening for prostate cancer (PCa) has prompted innovative solutions, such as the state-of-the-art 18-gene urine test for clinically-significant PCa (MyProstateScore2.0 (MPS2)).Objective:We aim to develop a non-invasive biomarker test, the simplified MPS2 (sMPS2), which achieves similar state-of-the-art accuracy as MPS2 for predicting high-grade PCa but requires substantially fewer genes than the 18-gene MPS2 to improve its accessibility for routine clinical care.Methods:We grounded the development of sMPS2 in the Predictability, Computability, and Stability (PCS) framework for veridical data science. Under this framework, we stress-tested the development of sMPS2 across various data preprocessing and modeling choices and developed a stability-driven PCS ranking procedure for selecting the most predictive and robust genes for use in sMPS2.Results:The final sMPS2 model consisted of 7 genes and achieved a 0.784 AUROC (95% confidence interval, 0.742–0.825) for predicting high-grade PCa on a blinded external validation cohort. This is only 2.3% lower than the 18-gene MPS2, which is similar in magnitude to the 1–2% in uncertainty induced by different data preprocessing choices.Conclusions:The 7-gene sMPS2 provides a unique opportunity to expand the reach and adoption of non-invasive PCa screening.
more »
« less
- PAR ID:
- 10635742
- Publisher / Repository:
- Sage Journals
- Date Published:
- Journal Name:
- Cancer Biomarkers
- Volume:
- 42
- Issue:
- 1
- ISSN:
- 1574-0153
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Building and expanding on principles of statistics, machine learning, and scientific inquiry, we propose the predictability, computability, and stability (PCS) framework for veridical data science. Our framework, composed of both a workflow and documentation, aims to provide responsible, reliable, reproducible, and transparent results across the data science life cycle. The PCS workflow uses predictability as a reality check and considers the importance of computation in data collection/storage and algorithm design. It augments predictability and computability with an overarching stability principle. Stability expands on statistical uncertainty considerations to assess how human judgment calls impact data results through data and model/algorithm perturbations. As part of the PCS workflow, we develop PCS inference procedures, namely PCS perturbation intervals and PCS hypothesis testing, to investigate the stability of data results relative to problem formulation, data cleaning, modeling decisions, and interpretations. We illustrate PCS inference through neuroscience and genomics projects of our own and others. Moreover, we demonstrate its favorable performance over existing methods in terms of receiver operating characteristic (ROC) curves in high-dimensional, sparse linear model simulations, including a wide range of misspecified models. Finally, we propose PCS documentation based on R Markdown or Jupyter Notebook, with publicly available, reproducible codes and narratives to back up human choices made throughout an analysis. The PCS workflow and documentation are demonstrated in a genomics case study available on Zenodo.more » « less
-
Abstract The link between in vitro hERG ion channel inhibition and subsequent in vivo QT interval prolongation, a critical risk factor for the development of arrythmias such as Torsade de Pointes, is so well established that in vitro hERG activity alone is often sufficient to end the development of an otherwise promising drug candidate. It is therefore of tremendous interest to develop advanced methods for identifying hERG-active compounds in the early stages of drug development, as well as for proposing redesigned compounds with reduced hERG liability and preserved primary pharmacology. In this work, we present CardioGenAI, a machine learning-based framework for re-engineering both developmental and commercially available drugs for reduced hERG activity while preserving their pharmacological activity. The framework incorporates novel state-of-the-art discriminative models for predicting hERG channel activity, as well as activity against the voltage-gated NaV1.5 and CaV1.2 channels due to their potential implications in modulating the arrhythmogenic potential induced by hERG channel blockade. We applied the complete framework to pimozide, an FDA-approved antipsychotic agent that demonstrates high affinity to the hERG channel, and generated 100 refined candidates. Remarkably, among the candidates is fluspirilene, a compound which is of the same class of drugs as pimozide (diphenylmethanes) and therefore has similar pharmacological activity, yet exhibits over 700-fold weaker binding to hERG. Furthermore, we demonstrated the framework's ability to optimize hERG, NaV1.5 and CaV1.2 profiles of multiple FDA-approved compounds while maintaining the physicochemical nature of the original drugs. We envision that this method can effectively be applied to developmental compounds exhibiting hERG liabilities to provide a means of rescuing drug development programs that have stalled due to hERG-related safety concerns. Additionally, the discriminative models can also serve independently as effective components of virtual screening pipelines. We have made all of our software open-source athttps://github.com/gregory-kyro/CardioGenAIto facilitate integration of the CardioGenAI framework for molecular hypothesis generation into drug discovery workflows. Scientific contribution This work introduces CardioGenAI, an open-source machine learning-based framework designed to re-engineer drugs for reduced hERG liability while preserving their pharmacological activity. The complete CardioGenAI framework can be applied to developmental compounds exhibiting hERG liabilities to provide a means of rescuing drug discovery programs facing hERG-related challenges. In addition, the framework incorporates novel state-of-the-art discriminative models for predicting hERG, NaV1.5 and CaV1.2 channel activity, which can function independently as effective components of virtual screening pipelines.more » « less
-
Synthetic matrices with dynamic presentation of cell guidance cues are needed for the development of physiologically relevant in vitro tumor models. Towards the goal of mimicking prostate cancer progression and metastasis, we engineered a tunable hyaluronic acid-based hydrogel platform with protease degradable and cell adhesive properties employing bioorthogonal tetrazine ligation with strained alkenes. The synthetic matrix was first fabricated via a slow tetrazine-norbornene reaction, then temporally modified via a diffusion-controlled method using trans-cyclooctene, a fierce dienophile that reacts with tetrazine with an unusually fast rate. The encapsulated DU145 prostate cancer single cells spontaneously formed multicellular tumoroids after 7 days of culture. In situ modification of the synthetic matrix via covalent tagging of cell adhesive RGD peptide induced tumoroid decompaction and the development of cellular protrusions. RGD tagging did not compromise the overall cell viability, nor did it induce cell apoptosis. In response to increased matrix adhesiveness, DU145 cells dynamically loosen cell-cell adhesion and strengthen cell-matrix interactions to promote an invasive phenotype. Characterization of the 3D cultures by immunocytochemistry and gene expression analyses demonstrated that cells invaded into the matrix via a mesenchymal like migration, with upregulation of major mesenchymal markers, and down regulation of epithelial markers. The tumoroids formed cortactin positive invadopodia like structures, indicating active matrix remodeling. Overall, the engineered tumor model can be utilized to identify potential molecular targets and test pharmacological inhibitors, thereby accelerating the design of innovative strategies for cancer therapeutics.more » « less
-
We have identified 38 specifically excised, differentially expressed snoRNA fragments (sdRNAs) in TCGA prostate cancer (PCa) patient samples as compared to normal prostate controls. SnoRNA-derived fragments sdRNA-D19b and -A24 emerged among the most differentially expressed and were selected for further experimentation. We found that the overexpression of either sdRNA significantly increased PC3 (a well-established model of castration-resistant prostate cancer (CRPC)) cell proliferation, and that sdRNA-D19b overexpression also markedly increased the rate of PC3 cell migration. In addition, both sdRNAs provided drug-specific resistances with sdRNA-D19b levels correlating with paclitaxel resistance and sdRNA-24A conferring dasatinib resistance. In silico and in vitro analyses revealed that two established PCa tumor suppressor genes, CD44 and CDK12, represent targets for sdRNA-D19b and sdRNA-A24, respectively. This outlines a biologically coherent mechanism by which sdRNAs downregulate tumor suppressors in AR-PCa to enhance proliferative and metastatic capabilities and to encourage chemotherapeutic resistance. Aggressive proliferation, rampant metastasis, and recalcitrance to chemotherapy are core characteristics of CRPC that synergize to produce a pathology that ranks second in cancer-related deaths for men. This study defines sdRNA-D19b and -A24 as contributors to AR-PCa, potentially providing novel biomarkers and therapeutic targets of use in PCa clinical intervention.more » « less
An official website of the United States government

