Machine learning can be used to predict the properties of polymers and explore vast chemical spaces. However, the limited number of available experimental datasets hinders the enhancement of the predictive performance of a model. This study proposes a machine learning approach that leverages transfer learning and ensemble modeling to efficiently predict the glass transition temperature (Tg) of fluorinated polymers and guide the design of high Tg copolymers. Initially, the quantum machine 9 (QM9) dataset is employed for model pretraining, thus providing robust molecular representations for the subsequent fine-tuning of a specialized copolymer dataset. Ensemble modeling is used to further enhance prediction robustness and reliability, effectively addressing the problems owing to the limited and unevenly distributed nature of the copolymer dataset. Finally, a fine-tuned ensemble model is used to navigate a vast chemical space comprising 61 monomers and identify promising candidates for high Tg fluorinated polymers. The model predicts 247 entries capable of achieving a Tg over 390 K, of which 14 are experimentally validated. This study demonstrates the potential of machine learning in material design and discovery, highlighting the effectiveness of transfer learning and ensemble modeling strategies for overcoming the challenges posed by small datasets in complex copolymer systems.
Keywords: ensemble modeling; fluorinated polymer; glass transition temperature; machine learning; transfer learning.
© 2024 The Author(s). Macromolecular Rapid Communications published by Wiley‐VCH GmbH.