NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

CG-BigSMILES: Line Notation for Coarse-Grained Models of Polymers

https://doi.org/10.1021/acs.macromol.5c00516

Leão, Bruno S; Zou, Weizhong; Rebello, Nathan J; Rubinstein, Michael; Franco, Luís_F M; Olsen, Bradley D (September 2025, Macromolecules)

Free, publicly-accessible full text available September 9, 2026
The Block Copolymer Phase Behavior Database

https://doi.org/10.1021/acs.jcim.4c00242

Rebello, Nathan J; Arora, Akash; Mochigase, Hidenobu; Lin, Tzyy-Shyang; Shi, Jiale; Audus, Debra J; Muckley, Eric S; Osmani, Ardiana; Olsen, Bradley D (August 2024, Journal of Chemical Information and Modeling)

Full Text Available
BigSMARTS: A Topologically Aware Query Language and Substructure Search Algorithm for Polymer Chemical Structures

https://doi.org/10.1021/acs.jcim.3c00978

Rebello, Nathan J.; Lin, Tzyy-Shyang; Nazeer, Heeba; Olsen, Bradley D. (November 2023, Journal of Chemical Information and Modeling)

Molecular search is important in chemistry, biology, and informatics for identifying molecular structures within large data sets, improving knowledge discovery and innovation, and making chemical data FAIR (findable, accessible, interoperable, reusable). Search algorithms for polymers are significantly less developed than those for small molecules because polymer search relies on searching by polymer name, which can be challenging because polymer naming is overly broad (i.e., polyethylene), complicated for complex chemical structures, and often does not correspond to official IUPAC conventions. Chemical structure search in polymers is limited to substructures, such as monomers, without awareness of connectivity or topology. This work introduces a novel query language and graph traversal search algorithm for polymers that provides the first search method able to fully capture all of the chemical structures present in polymers. The BigSMARTS query language, an extension of the small-molecule SMARTS language, allows users to write queries that localize monomer and functional group searches to different parts of the polymer, like the middle block of a triblock, the side chain of a graft, and the backbone of a repeat unit. The substructure search algorithm is based on the traversal of graph representations of the generating functions for the stochastic graphs of polymers. Operationally, the algorithm first identifies cycles representing the monomers and then the end groups and finally performs a depth-first search to match entire subgraphs. To validate the algorithm, hundreds of queries were searched against hundreds of target chemistries and topologies from the literature, with approximately 440,000 query–target pairs. This tool provides a detailed algorithm that can be implemented in search engines to provide search results with full matching of the monomer connectivity and polymer topology.
more » « less
Full Text Available
Machine Translation between BigSMILES Line Notation and Chemical Structure Diagrams

https://doi.org/10.1021/acs.macromol.3c01378

Deagen, Michael E; Dalle-Cort, Bérenger; Rebello, Nathan J; Lin, Tzyy-Shyang; Walsh, Dylan J; Olsen, Bradley D (January 2024, Macromolecules)

Full Text Available
Quantifying Pairwise Similarity for Complex Polymers

https://doi.org/10.1021/acs.macromol.3c00761

Shi, Jiale; Rebello, Nathan J.; Walsh, Dylan; Zou, Weizhong; Deagen, Michael E.; Leao, Bruno Salomao; Audus, Debra J.; Olsen, Bradley D. (September 2023, Macromolecules)

Defining the similarity between chemical entities is an essential task in polymer informatics, enabling ranking, clustering, and classification. Despite its importance, the pairwise chemical similarity of polymers remains an open problem. Here, a similarity function for polymers with well-defined backbones is designed based on polymers’ stochastic graph representations generated from canonical BigSMILES, a structurally based line notation for describing macromolecules. The stochastic graph representations are separated into three parts: repeat units, end groups, and polymer topology. The earth mover’s distance is utilized to calculate the similarity of the repeat units and end groups, while the graph edit distance is used to calculate the similarity of the topology. These three values can be linearly or nonlinearly combined to yield an overall pairwise chemical similarity score for polymers that is largely consistent with the chemical intuition of expert users and is adjustable based on the relative importance of different chemical features for a given similarity problem. This method gives a reliable solution to quantitatively calculate the pairwise chemical similarity score for polymers and represents a vital step toward building search engines and quantitative design tools for polymer data.
more » « less
Full Text Available
Random Forest Predictor for Diblock Copolymer Phase Behavior

https://doi.org/10.1021/acsmacrolett.1c00521

Arora, Akash; Lin, Tzyy-Shyang; Rebello, Nathan J.; Av-Ron, Sarah H.; Mochigase, Hidenobu; Olsen, Bradley D. (November 2021, ACS Macro Letters)

Full Text Available
Extending BigSMILES to non-covalent bonds in supramolecular polymer assemblies

https://doi.org/10.1039/d2sc02257e

Zou, Weizhong; Martell Monterroza, Alexis; Yao, Yunxin; Millik, S. Cem; Cencer, Morgan M.; Rebello, Nathan J.; Beech, Haley K.; Morris, Melody A.; Lin, Tzyy-Shyang; Castano, Cleotilde S.; et al (September 2022, Chemical Science)

As a machine-recognizable representation of polymer connectivity, BigSMILES line notation extends SMILES from deterministic to stochastic structures. The same framework that allows BigSMILES to accommodate stochastic covalent connectivity can be extended to non-covalent bonds, enhancing its value for polymers, supramolecular materials, and colloidal chemistry. Non-covalent bonds are captured through the inclusion of annotations to pseudo atoms serving as complementary binding pairs, minimal key/value pairs to elaborate other relevant attributes, and indexes to specify the pairing among potential donors and acceptors or bond delocalization. Incorporating these annotations into BigSMILES line notation enables the representation of four common classes of non-covalent bonds in polymer science: electrostatic interactions, hydrogen bonding, metal–ligand complexation, and π–π stacking. The principal advantage of non-covalent BigSMILES is the ability to accommodate a broad variety of non-covalent chemistry with a simple user-orientated, semi-flexible annotation formalism. This goal is achieved by encoding a universal but non-exhaustive representation of non-covalent or stochastic bonding patterns through syntax for (de)protonated and delocalized state of bonding as well as nested bonds for correlated bonding and multi-component mixture. By allowing user-defined descriptors in the annotation expression, further applications in data-driven research can be envisioned to represent chemical structures in many other fields, including polymer nanocomposite and surface chemistry.
more » « less
Full Text Available
PolyDAT: A Generic Data Schema for Polymer Characterization

https://doi.org/10.1021/acs.jcim.1c00028

Lin, Tzyy-Shyang; Rebello, Nathan J.; Beech, Haley K.; Wang, Zi; El-Zaatari, Bassil; Lundberg, David J.; Johnson, Jeremiah A.; Kalow, Julia A.; Craig, Stephen L.; Olsen, Bradley D. (March 2021, Journal of Chemical Information and Modeling)
null (Ed.)
Full Text Available
Influence of Counterion Structure on Conductivity of Polymerized Ionic Liquids

https://doi.org/10.1021/acsmacrolett.9b00070

Keith, Jordan R.; Rebello, Nathan J.; Cowen, Benjamin J.; Ganesan, Venkat (March 2019, ACS Macro Letters)

Full Text Available
Calculating Pairwise Similarity of Polymer Ensembles via Earth Mover’s Distance

https://doi.org/10.1021/acspolymersau.3c00029

Shi, Jiale; Walsh, Dylan; Zou, Weizhong; Rebello, Nathan J.; Deagen, Michael E.; Fransen, Katharina A.; Gao, Xian; Olsen, Bradley D.; Audus, Debra J. (January 2024, ACS Polymers Au)

Search for: All records