Summary CRISPR genome engineering and single-cell RNA sequencing have accelerated biological discovery. Single-cell CRISPR screens unite these two technologies, linking genetic perturbations in individual cells to changes in gene expression and illuminating regulatory networks underlying diseases. Despite their promise, single-cell CRISPR screens present considerable statistical challenges. We demonstrate through theoretical and real data analyses that a standard method for estimation and inference in single-cell CRISPR screens—“thresholded regression”—exhibits attenuation bias and a bias-variance tradeoff as a function of an intrinsic, challenging-to-select tuning parameter. To overcome these difficulties, we introduce GLM-EIV (“GLM-based errors-in-variables”), a new method for single-cell CRISPR screen analysis. GLM-EIV extends the classical errors-in-variables model to responses and noisy predictors that are exponential family-distributed and potentially impacted by the same set of confounding variables. We develop a computational infrastructure to deploy GLM-EIV across hundreds of processors on clouds (e.g. Microsoft Azure) and high-performance clusters. Leveraging this infrastructure, we apply GLM-EIV to analyze two recent, large-scale, single-cell CRISPR screen datasets, yielding several new insights.
more »
« less
Reproducibility metrics for CRISPR screens
CRISPR screens are used extensively to systematically interrogate the phenotype-to-genotype problem. In contrast to early CRISPR screens, which defined core cell fitness genes, most current efforts now aim to identify context-specific phenotypes that differentiate a cell line, genetic background or condition of interest, such as a drug treatment. While CRISPR-related technologies have shown great promise and a fast pace of innovation, a better understanding of standards and methods for quality assessment of CRISPR screen results is crucial to guide technology development and application. Specifically, many commonly used metrics for quantifying screen quality do not accurately measure the reproducibility of context-specific hits. We highlight the importance of reporting reproducibility statistics that directly relate to the purpose of the screen and suggest the use of metrics that are sensitive to context-specific signal.
more »
« less
- Award ID(s):
- 1818293
- PAR ID:
- 10342363
- Date Published:
- Journal Name:
- bioRxiv
- ISSN:
- 2692-8205
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The ENCODE Consortium’s efforts to annotate noncoding cis-regulatory elements (CREs) have advanced our understanding of gene regulatory landscapes. Pooled, noncoding CRISPR screens offer a systematic approach to investigate cis-regulatory mechanisms. The ENCODE4 Functional Characterization Centers conducted 108 screens in human cell lines, comprising >540,000 perturbations across 24.85 megabases of the genome. Using 332 functionally confirmed CRE–gene links in K562 cells, we established guidelines for screening endogenous noncoding elements with CRISPR interference (CRISPRi), including accurate detection of CREs that exhibit variable, often low, transcriptional effects. Benchmarking five screen analysis tools, we find that CASA produces the most conservative CRE calls and is robust to artifacts of low-specificity single guide RNAs. We uncover a subtle DNA strand bias for CRISPRi in transcribed regions with implications for screen design and analysis. Together, we provide an accessible data resource, predesigned single guide RNAs for targeting 3,275,697 ENCODE SCREEN candidate CREs with CRISPRi and screening guidelines to accelerate functional characterization of the noncoding genome.more » « less
-
Abstract We present GuideScan2 for memory-efficient, parallelizable construction of high-specificity CRISPR guide RNA (gRNA) databases and user-friendly design and analysis of individual gRNAs and gRNA libraries for targeting coding and non-coding regions in custom genomes. GuideScan2 analysis identifies widespread confounding effects of low-specificity gRNAs in published CRISPR screens and enables construction of a gRNA library that reduces off-target effects in a gene essentiality screen. GuideScan2 also enables the design and experimental validation of allele-specific gRNAs in a hybrid mouse genome. GuideScan2 will facilitate CRISPR experiments across a wide range of applications.more » « less
-
Abstract Clustered regularly interspaced short palindromic repeats (CRISPR)-associated nuclease (Cas) technologies facilitate routine genome engineering of one or a few genes at a time. However, large-scale CRISPR screens with guide RNA libraries remain challenging in plants. Here, we have developed a comprehensive all-in-one CRISPR toolbox for Cas9-based genome editing, cytosine base editing, adenine base editing (ABE), Cas12a-based genome editing and ABE, and CRISPR-Act3.0-based gene activation in both monocot and dicot plants. We evaluated all-in-one T-DNA expression vectors in rice (Oryza sativa, monocot) and tomato (Solanum lycopersicum, dicot) protoplasts, demonstrating their broad and reliable applicability. To showcase the applications of these vectors in CRISPR screens, we constructed guide RNA (gRNA) pools for testing in rice protoplasts, establishing a high-throughput approach to select high-activity gRNAs. Additionally, we demonstrated the efficacy of sgRNA library screening for targeted mutagenesis of ACETOLACTATE SYNTHASE in rice, recovering novel candidate alleles for herbicide resistance. Furthermore, we carried out a CRISPR activation screen in Arabidopsis thaliana, rapidly identifying potent gRNAs for FLOWERING LOCUS T activation that confer an early-flowering phenotype. This toolbox contains 61 versatile all-in-one vectors encompassing nearly all commonly used CRISPR technologies. It will facilitate large-scale genetic screens for loss-of-function or gain-of-function studies, presenting numerous promising applications in plants.more » « less
-
null (Ed.)Genetic screens are used to identify genes involved in specific biological processes. An EMS mutagenesis screen in Drosophila melanogaster identified growth control phenotypes in the developing eye. One mutant line from this screen, H.3.2, was phenotypically characterized using the FLP/FRT system and genetically mapped by complementation analysis and genomic sequencing by undergraduate students participating in the multi-institution Fly-CURE consortium. H.3.2 was found to have a nonsense mutation in short stop (shot), an ortholog of the mammalian spectraplakin dystonin (DST). shot and DST are involved in cytoskeletal organization and play roles during cell growth and proliferation.more » « less
An official website of the United States government

