Sequence control in synthetic copolymers remains a tantalizing objective in polymer science due to the influence of sequence on material properties and self-organization. A greater understanding of sequence development throughout the polymerization process will aid the design of simple, generalizable methods to control sequence and tune supramolecular assembly. In previous simulations of solution-based step-growth copolymerizations, we have shown that weak, non-bonding attractions between monomers of the same type can produce a microphase separation among the lengthening nascent oligomers and thereby alter sequence. This work explores the phenomenon further, examining how effective attractive interactions, mediated by a solvent selective for one of the reacting species, impact the development of sequence and the supramolecular assembly in a simple A–B copolymerization. We find that as the effective attractions between monomers increase, an emergent self-organization of the reactants causes a shift in reaction kinetics and sequence development. When the solvent-mediated interactions are selective enough, the simple mixture of A and B monomers oligomerize and self-assemble into structures characteristic of amphiphilic copolymers. The composition and morphology of these structures and the sequences of their chains are sensitive to the relative balance of affinities between the comonomer species. Our results demonstrate the impact of differing A–B monomer–solvent affinities on sequence development in solution-based copolymerizations and are of consequence to the informed design of synthetic methods for sequence controlled amphiphilic copolymers and their aggregates.
more »
« less
This content will become publicly available on February 17, 2026
Predicting self-assembly of sequence-controlled copolymers with stochastic sequence variation
Sequence-controlled copolymers can self-assemble into a wide assortment of complex architectures, with exciting applications in nanofabrication and personalized medicine. However, polymer synthesis is notoriously imprecise, and stochasticity in both chemical synthesis and self-assembly poses a significant challenge to tight control over these systems. While it is increasingly viable to design “protein-like” sequences, specifying each individual monomer in a chain, the effect of variability within those sequences has not been well studied. In this work, we performed nearly 15[thin space (1/6-em)]000 molecular dynamics simulations of sequence-controlled copolymer aggregates with varying level of sequence stochasticity. We utilized unsupervised learning to characterize the resulting morphologies and found that sequence variation leads to relatively smooth and predictable changes in morphology compared to ensembles of identical chains. Furthermore, structural response to sequence variation was accurately modeled using supervised learning, revealing several interesting trends in how specific families of sequences break down as monomer sequences become more variable. Our work presents a way forward in understanding and controlling the effect of sequence variation in sequence-controlled copolymer systems, which can hopefully be used to design advanced copolymer systems for technological applications in the future.
more »
« less
- PAR ID:
- 10580312
- Publisher / Repository:
- Royal Society of Chemistry
- Date Published:
- Journal Name:
- Soft Matter
- Volume:
- 21
- Issue:
- 11
- ISSN:
- 1744-683X
- Page Range / eLocation ID:
- 2143 to 2151
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The emergence of data-intensive scientific discovery and machine learning has dramatically changed the way in which scientists and engineers approach materials design. Nevertheless, for designing macromolecules or polymers, one limitation is the lack of appropriate methods or standards for converting systems into chemically informed, machine-readable representations. This featurization process is critical to building predictive models that can guide polymer discovery. Although standard molecular featurization techniques have been deployed on homopolymers, such approaches capture neither the multiscale nature nor topological complexity of copolymers, and they have limited application to systems that cannot be characterized by a single repeat unit. Herein, we present, evaluate, and analyze a series of featurization strategies suitable for copolymer systems. These strategies are systematically examined in diverse prediction tasks sourced from four distinct datasets that enable understanding of how featurization can impact copolymer property prediction. Based on this comparative analysis, we suggest directly encoding polymer size in polymer representations when possible, adopting topological descriptors or convolutional neural networks when the precise polymer sequence is known, and using chemically informed unit representations when developing extrapolative models. These results provide guidance and future directions regarding polymer featurization for copolymer design by machine learning.more » « less
-
Small nucleolytic ribozymes are RNAs that cleave their own phosphodiester backbone. While proteinaceous enzymes are regulated by a variety of known mechanisms, methods of regulation for ribozymes remain unclear. Twister is one ribozyme class for which many structural and catalytic properties have been elucidated. However, few studies have analyzed the activity of twister ribozymes in the context of native flanking sequence, even though ribozymes as transcribed in nature do not exist in isolation. Interactions between the ribozyme and its neighboring sequences can induce conformational changes that inhibit self-cleavage, providing a regulatory mechanism that could naturally determine ribozyme activity in vivo and in synthetic applications. To date, eight twister ribozymes have been identified within the staple crop rice (Oryza sativa). Herein, we select several twister ribozymes from rice and show that they are differentially regulated by their flanking sequence using published RNA-seq datasets, structure probing, and co-transcriptional cleavage assays. We found that the Osa 1-2 ribozyme does not interact with its flanking sequences. However, sequences flanking the Osa 1-3 and Osa 1-8 ribozymes form inactive conformations, referred to here as “ribozymogens”, that attenuate ribozyme self-cleavage activity. For the Osa 1-3 ribozyme, we show that activity can be rescued upon addition of a complementary antisense oligonucleotide, suggesting ribozymogens can be controlled via external signals. In all, our data provide a plausible mechanism wherein flanking sequence differentially regulates ribozyme activity in vivo. More broadly, the ability to regulate ribozyme behavior locally has potential applications in control of gene expression and synthetic biology.more » « less
-
Abstract Transforming renewable resources into functional and degradable polymers is driven by the ever‐increasing demand to replace unsustainable polyolefins. However, the utility of many degradable homopolymers remains limited due to their inferior properties compared to commodity polyolefins. Therefore, the synthesis of sequence‐defined copolymers from one‐pot monomer mixtures is not only conceptually appealing in chemistry, but also economically attractive by maximizing materials usage and improving polymers’ performances. Among many polymerization strategies, ring‐opening (co)polymerization of cyclic monomers enables efficient access to degradable polymers with high control on molecular weights and molecular weight distributions. Herein, we highlight recent advances in achieving one‐pot, sequence‐controlled polymerizations of cyclic monomer mixtures using a single catalytic system that combines multiple catalytic cycles. The scopes of cyclic monomers, catalysts, and polymerization mechanisms are presented for this type of sequence‐controlled ring‐opening copolymerization.more » « less
-
Gram-positive bacteria are some of the earliest known life forms, diverging from gram-negative bacteria 2 billion years ago. These organisms utilize sortase enzymes to attach proteins to their peptidoglycan cell wall, a structural feature that distinguishes the two types of bacteria. The transpeptidase activity of sortases make them an important tool in protein engineering applications, e.g., in sortase-mediated ligations or sortagging. However, due to relatively low catalytic efficiency, there are ongoing efforts to create better sortase variants for these uses. Here, we use bioinformatics tools, principal component analysis and ancestral sequence reconstruction, in combination with protein biochemistry, to analyze natural sequence variation in these enzymes. Principal component analysis on the sortase superfamily distinguishes previously described classes and identifies regions of relatively high sequence variation in structurally-conserved loops within each sortase family, including those near the active site. Using ancestral sequence reconstruction, we determined sequences of ancestral Staphylococcus and Streptococcus Class A sortase proteins. Enzyme assays revealed that the ancestral Streptococcus enzyme is relatively active and shares similar sequence variation with other Class A Streptococcus sortases. Taken together, we highlight how natural sequence variation can be utilized to investigate this important protein family, arguing that these and similar techniques may be used to discover or design sortases with increased catalytic efficiency and/or selectivity for sortase-mediated ligation experiments.more » « less
An official website of the United States government
