Abstract Clustered regularly interspaced short palindromic repeats (CRISPR)-Cas genome-wide screens are powerful tools for unraveling genotype–phenotype relationships, enabling precise manipulation of genes to study and engineer industrially useful traits. Traditional genetic methods, such as random mutagenesis or RNA interference, often lack the specificity and scalability required for large-scale functional genomic screens. CRISPR systems overcome these limitations by offering precision gene targeting and manipulation, allowing for high-throughput investigations into gene function and interactions. Recent work has shown that CRISPR genome editing is widely adaptable to several yeast species, many of which have natural traits suited for industrial biotechnology. In this review, we discuss recent advances in yeast functional genomics, emphasizing advancements made with CRISPR tools. We discuss how the development and optimization of CRISPR genome-wide screens have enabled a host-first approach to metabolic engineering, which takes advantage of the natural traits of nonconventional yeast—fast growth rates, high stress tolerance, and novel metabolism—to create new production hosts. Lastly, we discuss future directions, including automation and biosensor-driven screens, to enhance high-throughput CRISPR-enabled yeast engineering.
more »
« less
Dimensionality reduction methods for extracting functional networks from large‐scale CRISPR screens
Abstract CRISPR‐Cas9 screens facilitate the discovery of gene functional relationships and phenotype‐specific dependencies. The Cancer Dependency Map (DepMap) is the largest compendium of whole‐genome CRISPR screens aimed at identifying cancer‐specific genetic dependencies across human cell lines. A mitochondria‐associated bias has been previously reported to mask signals for genes involved in other functions, and thus, methods for normalizing this dominant signal to improve co‐essentiality networks are of interest. In this study, we explore three unsupervised dimensionality reduction methods—autoencoders, robust, and classical principal component analyses (PCA)—for normalizing the DepMap to improve functional networks extracted from these data. We propose a novel “onion” normalization technique to combine several normalized data layers into a single network. Benchmarking analyses reveal that robust PCA combined with onion normalization outperforms existing methods for normalizing the DepMap. Our work demonstrates the value of removing low‐dimensional signals from the DepMap before constructing functional gene networks and provides generalizable dimensionality reduction‐based normalization tools.
more »
« less
- Award ID(s):
- 1818293
- PAR ID:
- 10465296
- Publisher / Repository:
- DOI PREFIX: 10.15252
- Date Published:
- Journal Name:
- Molecular Systems Biology
- ISSN:
- 1744-4292
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Background:The limited diagnostic accuracy of prostate-specific antigen screening for prostate cancer (PCa) has prompted innovative solutions, such as the state-of-the-art 18-gene urine test for clinically-significant PCa (MyProstateScore2.0 (MPS2)).Objective:We aim to develop a non-invasive biomarker test, the simplified MPS2 (sMPS2), which achieves similar state-of-the-art accuracy as MPS2 for predicting high-grade PCa but requires substantially fewer genes than the 18-gene MPS2 to improve its accessibility for routine clinical care.Methods:We grounded the development of sMPS2 in the Predictability, Computability, and Stability (PCS) framework for veridical data science. Under this framework, we stress-tested the development of sMPS2 across various data preprocessing and modeling choices and developed a stability-driven PCS ranking procedure for selecting the most predictive and robust genes for use in sMPS2.Results:The final sMPS2 model consisted of 7 genes and achieved a 0.784 AUROC (95% confidence interval, 0.742–0.825) for predicting high-grade PCa on a blinded external validation cohort. This is only 2.3% lower than the 18-gene MPS2, which is similar in magnitude to the 1–2% in uncertainty induced by different data preprocessing choices.Conclusions:The 7-gene sMPS2 provides a unique opportunity to expand the reach and adoption of non-invasive PCa screening.more » « less
-
Abstract The European corn borer (Ostrinia nubilalis) is an agricultural pest and burgeoning model for research on speciation, seasonal adaptation and insect resistance management. Although previous work inO. nubilalishas identified genes associated with differences in life cycle, reproduction, and resistance toBttoxins, the general lack of a robust gene‐editing protocol forO. nubilalishas been a barrier to functional validation of candidate genes. Here, we demonstrate an efficient and practical methodology for heritable gene mutagenesis inO. nubilalisusing the CRISPR/Cas9 genome editing system. Precise loss‐of‐function (LOF) mutations were generated at two circadian clock genes,period(per) andpigment‐dispersing factor receptor(pdfr), and a developmental gene,prothoracicotropic hormone(ptth). Precluding the need for a visible genetic marker, gene‐editing efficiency remained high across different single guide RNAs (sgRNA) and germline transmission of mutations to F1offspring approached 100%. When single or dual sgRNAs were injected at a high concentration, gene‐specific phenotypic differences in behaviour and development were identified in F0mutants. Specifically, F0gene mutants demonstrated that PER, but not PDFR, is essential for normal timing of eclosion. PTTH F0mutants were significantly heavier and exhibited a higher incidence of diapause. This work will accelerate future studies of gene function inO. nubilalisand facilitate the development of similar screens in other Lepidopteran and non‐model insects.more » « less
-
Abstract The multifaceted nature of CRISPR screens has propelled advancements in the field of functional genomics. Pooled CRISPR screens involve creating programmed genetic perturbations across multiple genomic sites in a pool of host cells subjected to a challenge, empowering researchers to identify genetic causes of desirable phenotypes. These genome-wide screens have been widely used in mammalian cells to discover biological mechanisms of diseases and drive the development of targeted drugs and therapeutics. Their use in non-model organisms, especially in microbes to improve bioprocessing-relevant phenotypes, has been limited. Further compounding this issue is the lack of bioinformatic algorithms for analyzing microbial screening data with high accuracy. Here, we describe the general approach and underlying principles for conducting pooled CRISPR knockout screens in non-conventional yeasts and performing downstream analysis of the screening data, while also reviewing state-of-the-art algorithms for identification of CRISPR screening outcomes. Application of pooled CRISPR screens to non-model yeasts holds considerable potential to uncover novel metabolic engineering targets and improve industrial bioproduction. One-Sentence SummaryThis mini-review describes experimental and computational approaches for functional genomic screening using CRISPR technologies in non-conventional microbes.more » « less
-
Summary CRISPR genome engineering and single-cell RNA sequencing have accelerated biological discovery. Single-cell CRISPR screens unite these two technologies, linking genetic perturbations in individual cells to changes in gene expression and illuminating regulatory networks underlying diseases. Despite their promise, single-cell CRISPR screens present considerable statistical challenges. We demonstrate through theoretical and real data analyses that a standard method for estimation and inference in single-cell CRISPR screens—“thresholded regression”—exhibits attenuation bias and a bias-variance tradeoff as a function of an intrinsic, challenging-to-select tuning parameter. To overcome these difficulties, we introduce GLM-EIV (“GLM-based errors-in-variables”), a new method for single-cell CRISPR screen analysis. GLM-EIV extends the classical errors-in-variables model to responses and noisy predictors that are exponential family-distributed and potentially impacted by the same set of confounding variables. We develop a computational infrastructure to deploy GLM-EIV across hundreds of processors on clouds (e.g. Microsoft Azure) and high-performance clusters. Leveraging this infrastructure, we apply GLM-EIV to analyze two recent, large-scale, single-cell CRISPR screen datasets, yielding several new insights.more » « less
An official website of the United States government
