Published October 21, 2020 | Version v1
Journal article Open

Targeted sequence design within the coarse-grained polymer genome

Description

The chemical design of polymers with target structural and/or functional properties represents a grand challenge in materials science. While data-driven design approaches are promising, success with polymers has been limited, largely due to limitations in data availability. Here, we demonstrate the targeted sequence design of single-chain structure in polymers by combining coarse-grained modeling, machine learning, and model optimization. Nearly 2000 unique coarse-grained polymers are simulated to construct and analyze machine learning models. We find that deep neural networks inexpensively and reliably predict structural properties with limited sequence information as input. By coupling trained ML models with sequential model-based optimization, polymer sequences are proposed to exhibit globular, swollen, or rod-like behaviors, which are verified by explicit simulations. This work highlights the promising integration of coarse-grained modeling with data-driven design and represents a necessary and crucial step toward more complex polymer design efforts.

Data availability

All datasets used in this work are available as Supplementary Materials along with example scripts demonstrating the construction of the ML models. Additional data related to this paper may be requested from the authors.

Files

sciadv.abc6216.pdf

Files (3.4 MB)

Name Size Download all
md5:2a78de814935c204cf1f840b75e1e827
1.2 MB Preview Download
Article
md5:5c18d1bed4270df7a3b5e95a4a53416c
2.2 MB Preview Download

Additional details

Identifiers

DOI
10.1126/sciadv.abc6216
Other
oai:uchicago.tind.io:11046

Funding

National Science Foundation
1828629
U.S. Department of Energy
Solvay Inc.

UChicago Information

Division(s)
Pritzker School of Molecular Engineering