Published November 17, 2023
| Version v1
Journal article
Open
Generative BigSMILES: An extension for polymer informatics, computer simulations & ML/AI
- 1. University of Chicago
- 2. Massachusetts Institute of Technology
Description
The BigSMILES notation, a concise tool for polymer ensemble representation, is augmented here by introducing an enhanced version called generative BigSMILES. G-BigSMILES is designed for generative workflows, and is complemented by tailored software tools for ease of use. This extension integrates additional data, including reactivity ratios (or connection probabilities among repeat units), molecular weight distributions, and ensemble size. An algorithm, interpretable as a generative graph is devised that utilizes these data, enabling molecule generation from defined polymer ensembles. Consequently, the G-BigSMILES notation allows for efficient specification of complex molecular ensembles via a streamlined line notation, thereby providing a foundational tool for automated polymeric materials design. In addition, the graph interpretation of the G-BigSMILES notation sets the stage for robust machine learning methods capable of encapsulating intricate polymeric ensembles. The combination of G-BigSMILES with advanced machine learning techniques will facilitate straightforward property determination and in silico polymeric material synthesis automation. This integration has the potential to significantly accelerate materials design processes and advance the field of polymer science.
Data availability
The code for G-BigSMILES can be found at https://github.com/InnocentBug/bigSMILESgen. The version of the code employed for this study is version 0.1.1. All data used throughout the article can be reproduced with the SI.ipynb at the same location.Files
Generative-BigSMILES.pdf
Files
(2.6 MB)
| Name | Size | Download all |
|---|---|---|
|
Supplementary information md5:cd92ffff505ffe10b6a00c482984d180 |
799.6 kB | Preview Download |
|
Article md5:7b26442ab46dd8f4cc80cf3a11f68834 |
1.8 MB | Preview Download |
Additional details
Identifiers
- DOI
- 10.1039/D3DD00147D
- Other
- oai:uchicago.tind.io:10875
Funding
- Schmidt Futures
- Eric and Wendy Schmidt AI in Science Postdoctoral Fellowship
- National Science Foundation
- Convergence Accelerator