Published November 17, 2023 | Version v1
Journal article Open

Generative BigSMILES: An extension for polymer informatics, computer simulations & ML/AI

  • 1. University of Chicago
  • 2. Massachusetts Institute of Technology

Description

The BigSMILES notation, a concise tool for polymer ensemble representation, is augmented here by introducing an enhanced version called generative BigSMILES. G-BigSMILES is designed for generative workflows, and is complemented by tailored software tools for ease of use. This extension integrates additional data, including reactivity ratios (or connection probabilities among repeat units), molecular weight distributions, and ensemble size. An algorithm, interpretable as a generative graph is devised that utilizes these data, enabling molecule generation from defined polymer ensembles. Consequently, the G-BigSMILES notation allows for efficient specification of complex molecular ensembles via a streamlined line notation, thereby providing a foundational tool for automated polymeric materials design. In addition, the graph interpretation of the G-BigSMILES notation sets the stage for robust machine learning methods capable of encapsulating intricate polymeric ensembles. The combination of G-BigSMILES with advanced machine learning techniques will facilitate straightforward property determination and in silico polymeric material synthesis automation. This integration has the potential to significantly accelerate materials design processes and advance the field of polymer science.

Data availability

The code for G-BigSMILES can be found at https://github.com/InnocentBug/bigSMILESgen. The version of the code employed for this study is version 0.1.1. All data used throughout the article can be reproduced with the SI.ipynb at the same location.

Files

Generative-BigSMILES.pdf

Files (2.6 MB)

Name Size Download all
Supplementary information
md5:cd92ffff505ffe10b6a00c482984d180
799.6 kB Preview Download
Article
md5:7b26442ab46dd8f4cc80cf3a11f68834
1.8 MB Preview Download

Additional details

Identifiers

DOI
10.1039/D3DD00147D
Other
oai:uchicago.tind.io:10875

Funding

Schmidt Futures
Eric and Wendy Schmidt AI in Science Postdoctoral Fellowship
National Science Foundation
Convergence Accelerator

UChicago Information

Division(s)
Pritzker School of Molecular Engineering