Abstract The European corn borer (Ostrinia nubilalis) is an agricultural pest and burgeoning model for research on speciation, seasonal adaptation and insect resistance management. Although previous work inO. nubilalishas identified genes associated with differences in life cycle, reproduction, and resistance toBttoxins, the general lack of a robust gene‐editing protocol forO. nubilalishas been a barrier to functional validation of candidate genes. Here, we demonstrate an efficient and practical methodology for heritable gene mutagenesis inO. nubilalisusing the CRISPR/Cas9 genome editing system. Precise loss‐of‐function (LOF) mutations were generated at two circadian clock genes,period(per) andpigment‐dispersing factor receptor(pdfr), and a developmental gene,prothoracicotropic hormone(ptth). Precluding the need for a visible genetic marker, gene‐editing efficiency remained high across different single guide RNAs (sgRNA) and germline transmission of mutations to F1offspring approached 100%. When single or dual sgRNAs were injected at a high concentration, gene‐specific phenotypic differences in behaviour and development were identified in F0mutants. Specifically, F0gene mutants demonstrated that PER, but not PDFR, is essential for normal timing of eclosion. PTTH F0mutants were significantly heavier and exhibited a higher incidence of diapause. This work will accelerate future studies of gene function inO. nubilalisand facilitate the development of similar screens in other Lepidopteran and non‐model insects.
more »
« less
Dimensionality reduction methods for extracting functional networks from large‐scale CRISPR screens
Abstract CRISPR‐Cas9 screens facilitate the discovery of gene functional relationships and phenotype‐specific dependencies. The Cancer Dependency Map (DepMap) is the largest compendium of whole‐genome CRISPR screens aimed at identifying cancer‐specific genetic dependencies across human cell lines. A mitochondria‐associated bias has been previously reported to mask signals for genes involved in other functions, and thus, methods for normalizing this dominant signal to improve co‐essentiality networks are of interest. In this study, we explore three unsupervised dimensionality reduction methods—autoencoders, robust, and classical principal component analyses (PCA)—for normalizing the DepMap to improve functional networks extracted from these data. We propose a novel “onion” normalization technique to combine several normalized data layers into a single network. Benchmarking analyses reveal that robust PCA combined with onion normalization outperforms existing methods for normalizing the DepMap. Our work demonstrates the value of removing low‐dimensional signals from the DepMap before constructing functional gene networks and provides generalizable dimensionality reduction‐based normalization tools.
more »
« less
- Award ID(s):
- 1818293
- PAR ID:
- 10555233
- Publisher / Repository:
- Molecular Systems Biology
- Date Published:
- Journal Name:
- Molecular Systems Biology
- Volume:
- 19
- Issue:
- 11
- ISSN:
- 1744-4292
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Clustered regularly interspaced short palindromic repeats (CRISPR)-Cas genome-wide screens are powerful tools for unraveling genotype–phenotype relationships, enabling precise manipulation of genes to study and engineer industrially useful traits. Traditional genetic methods, such as random mutagenesis or RNA interference, often lack the specificity and scalability required for large-scale functional genomic screens. CRISPR systems overcome these limitations by offering precision gene targeting and manipulation, allowing for high-throughput investigations into gene function and interactions. Recent work has shown that CRISPR genome editing is widely adaptable to several yeast species, many of which have natural traits suited for industrial biotechnology. In this review, we discuss recent advances in yeast functional genomics, emphasizing advancements made with CRISPR tools. We discuss how the development and optimization of CRISPR genome-wide screens have enabled a host-first approach to metabolic engineering, which takes advantage of the natural traits of nonconventional yeast—fast growth rates, high stress tolerance, and novel metabolism—to create new production hosts. Lastly, we discuss future directions, including automation and biosensor-driven screens, to enhance high-throughput CRISPR-enabled yeast engineering.more » « less
-
Background:The limited diagnostic accuracy of prostate-specific antigen screening for prostate cancer (PCa) has prompted innovative solutions, such as the state-of-the-art 18-gene urine test for clinically-significant PCa (MyProstateScore2.0 (MPS2)).Objective:We aim to develop a non-invasive biomarker test, the simplified MPS2 (sMPS2), which achieves similar state-of-the-art accuracy as MPS2 for predicting high-grade PCa but requires substantially fewer genes than the 18-gene MPS2 to improve its accessibility for routine clinical care.Methods:We grounded the development of sMPS2 in the Predictability, Computability, and Stability (PCS) framework for veridical data science. Under this framework, we stress-tested the development of sMPS2 across various data preprocessing and modeling choices and developed a stability-driven PCS ranking procedure for selecting the most predictive and robust genes for use in sMPS2.Results:The final sMPS2 model consisted of 7 genes and achieved a 0.784 AUROC (95% confidence interval, 0.742–0.825) for predicting high-grade PCa on a blinded external validation cohort. This is only 2.3% lower than the 18-gene MPS2, which is similar in magnitude to the 1–2% in uncertainty induced by different data preprocessing choices.Conclusions:The 7-gene sMPS2 provides a unique opportunity to expand the reach and adoption of non-invasive PCa screening.more » « less
-
Summary CRISPR genome engineering and single-cell RNA sequencing have accelerated biological discovery. Single-cell CRISPR screens unite these two technologies, linking genetic perturbations in individual cells to changes in gene expression and illuminating regulatory networks underlying diseases. Despite their promise, single-cell CRISPR screens present considerable statistical challenges. We demonstrate through theoretical and real data analyses that a standard method for estimation and inference in single-cell CRISPR screens—“thresholded regression”—exhibits attenuation bias and a bias-variance tradeoff as a function of an intrinsic, challenging-to-select tuning parameter. To overcome these difficulties, we introduce GLM-EIV (“GLM-based errors-in-variables”), a new method for single-cell CRISPR screen analysis. GLM-EIV extends the classical errors-in-variables model to responses and noisy predictors that are exponential family-distributed and potentially impacted by the same set of confounding variables. We develop a computational infrastructure to deploy GLM-EIV across hundreds of processors on clouds (e.g. Microsoft Azure) and high-performance clusters. Leveraging this infrastructure, we apply GLM-EIV to analyze two recent, large-scale, single-cell CRISPR screen datasets, yielding several new insights.more » « less
-
Abstract Data reduction methods are frequently employed in large genomics and phenomics studies to extract core patterns, reduce dimensionality, and alleviate multiple testing effects. Principal component analysis (PCA), in particular, identifies the components that capture the most variance within omics datasets. While data reduction can simplify complex datasets, it remains unclear how the use of PCA impacts downstream analyses such as quantitative trait loci (QTL) or genome-wide association (GWA) approaches and their biological interpretation. In QTL studies, an alternative to data reduction is the use of post-hoc data summarization approaches, such as hotspot analysis, which involves mapping individual traits and consolidating results based on shared genomic locations. To evaluate how different analytical approaches may alter the biological insights derived from multi-dimensional QTL datasets, we compared individual trait hotspots with PCA-based QTL mapping using transcriptomic and metabolomic data from a structured recombinant inbred line population. Interestingly, these two approaches identified different genomic regions and genetic architectures. These findings suggest that mapping PCA-reduced data does not merely streamline analyses but may generate a fundamentally different view of the underlying genetic architecture compared to individual trait mapping and hotspot analysis. Thus, the use of PCA and other data reduction techniques prior to QTL or GWAS mapping should be carefully considered to ensure alignment with the specific biological question being addressed.more » « less
An official website of the United States government

