skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Shi, Honglue"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract Structured RNA lies at the heart of many central biological processes, from gene expression to catalysis. RNA structure prediction is not yet possible due to a lack of high-quality reference data associated with organismal phenotypes that could inform RNA function. We present GARNET (Gtdb Acquired RNa with Environmental Temperatures), a new database for RNA structural and functional analysis anchored to the Genome Taxonomy Database (GTDB). GARNET links RNA sequences to experimental and predicted optimal growth temperatures of GTDB reference organisms. Using GARNET, we develop sequence- and structure-aware RNA generative models, with overlapping triplet tokenization providing optimal encoding for a GPT-like model. Leveraging hyperthermophilic RNAs in GARNET and these RNA generative models, we identify mutations in ribosomal RNA that confer increased thermostability to theEscherichia coliribosome. The GTDB-derived data and deep learning models presented here provide a foundation for understanding the connections between RNA sequence, structure, and function. 
    more » « less
  2. Abstract CRISPR-Cas12a is an RNA-guided, programmable genome editing enzyme found within bacterial adaptive immune pathways. Unlike CRISPR-Cas9, Cas12a uses only a single catalytic site to both cleave target double-stranded DNA (dsDNA) (cis-activity) and indiscriminately degrade single-stranded DNA (ssDNA) (trans-activity). To investigate how the relative potency of cis- versus trans-DNase activity affects Cas12a-mediated genome editing, we first used structure-guided engineering to generate variants of Lachnospiraceae bacterium Cas12a that selectively disrupt trans-activity. The resulting engineered mutant with the biggest differential between cis- and trans-DNase activity in vitro showed minimal genome editing activity in human cells, motivating a second set of experiments using directed evolution to generate additional mutants with robust genome editing activity. Notably, these engineered and evolved mutants had enhanced ability to induce homology-directed repair (HDR) editing by 2–18-fold compared to wild-type Cas12a when using HDR donors containing mismatches with crRNA at the PAM-distal region. Finally, a site-specific reversion mutation produced improved Cas12a (iCas12a) variants with superior genome editing efficiency at genomic sites that are difficult to edit using wild-type Cas12a. This strategy establishes a pipeline for creating improved genome editing tools by combining structural insights with randomization and selection. The available structures of other CRISPR-Cas enzymes will enable this strategy to be applied to improve the efficacy of other genome-editing proteins. 
    more » « less
  3. The majority of base pairs in double-stranded DNA exist in the canonical Watson-Crick geometry. However, they can also adopt alternate Hoogsteen conformations in various complexes of DNA with proteins and small molecules, which are key for biological function and mechanism. While detection of Hoogsteen base pairs in large DNA complexes and assemblies poses considerable challenges for traditional structural biology techniques, we show here that multidimensional dynamic nuclear polarization–enhanced solid-state NMR can serve as a unique spectroscopic tool for observing and distinguishing Watson-Crick and Hoogsteen base pairs in a broad range of DNA systems based on characteristic NMR chemical shifts and internuclear dipolar couplings. We illustrate this approach using a model 12-mer DNA duplex, free and in complex with the antibiotic echinomycin, which features two central adenine-thymine base pairs with Watson-Crick and Hoogsteen geometry, respectively, and subsequently extend it to the ∼200 kDa Widom 601 DNA nucleosome core particle. 
    more » « less