skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 10:00 PM ET on Friday, February 6 until 10:00 AM ET on Saturday, February 7 due to maintenance. We apologize for the inconvenience.


Search for: All records

Creators/Authors contains: "Baker, David"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract Directional interactions that generate regular coordination geometries are a powerful means of guiding molecular and colloidal self-assembly, but implementing such high-level interactions with proteins remains challenging due to their complex shapes and intricate interface properties. Here we describe a modular approach to protein nanomaterial design inspired by the rich chemical diversity that can be generated from the small number of atomic valencies. We design protein building blocks using deep learning-based generative tools, incorporating regular coordination geometries and tailorable bonding interactions that enable the assembly of diverse closed and open architectures guided by simple geometric principles. Experimental characterization confirms the successful formation of more than 20 multicomponent polyhedral protein cages, two-dimensional arrays and three-dimensional protein lattices, with a high (10%–50%) success rate and electron microscopy data closely matching the corresponding design models. Due to modularity, individual building blocks can assemble with different partners to generate distinct regular assemblies, resulting in an economy of parts and enabling the construction of reconfigurable networks for designer nanomaterials. 
    more » « less
  2. Abstract Naturally generated lipid nanoparticles termed extracellular vesicles (EVs) hold significant promise as engineerable therapeutic delivery vehicles. However, active loading of protein cargo into EVs in a manner that is useful for delivery remains a challenge. Here, we demonstrate that by rationally designing proteins to traffic to the plasma membrane and associate with lipid rafts, we can enhance loading of protein cargo into EVs for a set of structurally diverse transmembrane and peripheral membrane proteins. We then demonstrate the capacity of select lipid tags to mediate increased EV loading and functional delivery of an engineered transcription factor to modulate gene expression in target cells. We envision that this technology could be leveraged to develop new EV-based therapeutics that deliver a wide array of macromolecular cargo. 
    more » « less
  3. Abstract The organization of membrane proteins between and within membrane-bound compartments is critical to cellular function. Yet we lack approaches to regulate this organization in a range of membrane-based materials, such as engineered cells, exosomes, and liposomes. Uncovering and leveraging biophysical drivers of membrane protein organization to design membrane systems could greatly enhance the functionality of these materials. Towards this goal, we use de novo protein design, molecular dynamic simulations, and cell-free systems to explore how membrane-protein hydrophobic mismatch could be used to tune protein cotranslational integration and organization in synthetic lipid membranes. We find that membranes must deform to accommodate membrane-protein hydrophobic mismatch, which reduces the expression and co-translational insertion of membrane proteins into synthetic membranes. We use this principle to sort proteins both between and within membranes, thereby achieving one-pot assembly of vesicles with distinct functions and controlled split-protein assembly, respectively. Our results shed light on protein organization in biological membranes and provide a framework to design self-organizing membrane-based materials with applications such as artificial cells, biosensors, and therapeutic nanoparticles. 
    more » « less
  4. Abstract Navigating uncertainty is a critical challenge in all fields of science, especially when translating knowledge into real-world policies or management decisions. However, the wide variance in concepts and definitions of uncertainty across scientific fields hinders effective communication. As a microcosm of diverse fields within Earth Science, NASA’s Carbon Monitoring System (CMS) provides a useful crucible in which to identify cross-cutting concepts of uncertainty. The CMS convened the Uncertainty Working Group (UWG), a group of specialists across disciplines, to evaluate and synthesize efforts to characterize uncertainty in CMS projects. This paper represents efforts by the UWG to build a heuristic framework designed to evaluate data products and communicate uncertainty to both scientific and non-scientific end users. We consider four pillars of uncertainty: origins, severity, stochasticity versus incomplete knowledge, and spatial and temporal autocorrelation. Using a common vocabulary and a generalized workflow, the framework introduces a graphical heuristic accompanied by a narrative, exemplified through contrasting case studies. Envisioned as a versatile tool, this framework provides clarity in reporting uncertainty, guiding users and tempering expectations. Beyond CMS, it stands as a simple yet powerful means to communicate uncertainty across diverse scientific communities. 
    more » « less
  5. Abstract De novo design of complex protein folds using solely computational means remains a substantial challenge1. Here we use a robust deep learning pipeline to design complex folds and soluble analogues of integral membrane proteins. Unique membrane topologies, such as those from G-protein-coupled receptors2, are not found in the soluble proteome, and we demonstrate that their structural features can be recapitulated in solution. Biophysical analyses demonstrate the high thermal stability of the designs, and experimental structures show remarkable design accuracy. The soluble analogues were functionalized with native structural motifs, as a proof of concept for bringing membrane protein functions to the soluble proteome, potentially enabling new approaches in drug discovery. In summary, we have designed complex protein topologies and enriched them with functionalities from membrane proteins, with high experimental success rates, leading to a de facto expansion of the functional soluble fold space. 
    more » « less
  6. La Niña climate anomalies have historically been associated with substantial reductions in the atmospheric CO2growth rate. However, the 2021 La Niña exhibited a unique near-neutral impact on the CO2growth rate. In this study, we investigate the underlying mechanisms by using an ensemble of net CO2fluxes constrained by CO2observations from the Orbiting Carbon Observatory-2 in conjunction with estimates of gross primary production and fire carbon emissions. Our analysis reveals that the close-to-normal atmospheric CO2growth rate in 2021 was the result of the compensation between increased net carbon uptake over the tropics and reduced net carbon uptake over the Northern Hemisphere mid-latitudes. Specifically, we identify that the extreme drought and warm anomalies in Europe and Asia reduced the net carbon uptake and offset 72% of the increased net carbon uptake over the tropics in 2021. This study contributes to our broader understanding of how regional processes can shape the trajectory of atmospheric CO2concentration under climate change. 
    more » « less
  7. Abstract Mutations in human proteins lead to diseases. The structure of these proteins can help understand the mechanism of such diseases and develop therapeutics against them. With improved deep learning techniques, such as RoseTTAFold and AlphaFold, we can predict the structure of proteins even in the absence of structural homologs. We modeled and extracted the domains from 553 disease-associated human proteins without known protein structures or close homologs in the Protein Databank. We noticed that the model quality was higher and the Root mean square deviation (RMSD) lower between AlphaFold and RoseTTAFold models for domains that could be assigned to CATH families as compared to those which could only be assigned to Pfam families of unknown structure or could not be assigned to either. We predicted ligand-binding sites, protein–protein interfaces and conserved residues in these predicted structures. We then explored whether the disease-associated missense mutations were in the proximity of these predicted functional sites, whether they destabilized the protein structure based on ddG calculations or whether they were predicted to be pathogenic. We could explain 80% of these disease-associated mutations based on proximity to functional sites, structural destabilization or pathogenicity. When compared to polymorphisms, a larger percentage of disease-associated missense mutations were buried, closer to predicted functional sites, predicted as destabilizing and pathogenic. Usage of models from the two state-of-the-art techniques provide better confidence in our predictions, and we explain 93 additional mutations based on RoseTTAFold models which could not be explained based solely on AlphaFold models. 
    more » « less
  8. Abstract We investigated the biogeography of benthic foraminifera in a highly urbanized tropical seascape, i.e. Hong Kong, in order to assess their utility as bioindicators relative to other marine fauna. Hong Kong is one of the largest coastal cities on the planet and studies of other benthic fauna in the region are available for comparison. We found that: (1) turbid, muddy habitats host a unique foraminiferal fauna; (2) areas with intermediate levels of eutrophication have the highest foraminiferal species diversity; (3) semi-enclosed and heavily polluted environments host a distinct foraminiferal fauna, characterized by low taxonomic diversity and/or high dominance, and that is acclimated to stressful marine conditions. Biodiversity patterns of foraminifera in Hong Kong are generally consistent with those of other soft-sediment macro- and meio-fauna (e.g. polychaetes, molluscs and ostracods); however, foraminifera may be more sensitive than these other groups to eutrophication and associated changes in coastal food webs. The tolerance of some, but not other, species to eutrophic and hypoxic conditions means that foraminiferal faunas can serve as bioindicators across a wide array of environmental conditions, in contrast with corals whose sensitivity to eutrophication results in their absence from eutrophied settings. The well-known autoecology of foraminifera taxa can help to characterize environmental conditions of different habitats and regional environmental gradients. Although the use of fauna as bioindicators may be most robust when data are compared for multiple taxonomic groups, when such broad sampling is not available, benthic foraminifera are particularly well suited for environmental assessments due to their ubiquity, interspecific environmental breadth, and the well-understood environmental preference of individual taxa. 
    more » « less