New photocatalysts are traditionally identified through trial-and-error methods. Machine learning has shown considerable promise for improving the efficiency of photocatalyst discovery from a large potential pool. Here, we describe a multi-step, target-driven consensus method using a stacking meta-learning algorithm that robustly predicts bandgaps and H2 evolution activities of photocatalysts. Trained on small datasets, these models can rapidly screen a large space (>10 million materials) to identify promising, non-toxic compounds as candidate water splitting photocatalysts. Two effective compounds and two controls possessing optimal bandgap values (∼2 eV) but not photoactivity as predicted by the models were synthesized. Their experimentally measured bandgaps and H2 evolution activities were consistent with the predictions. Conspicuously, the two compounds with strong photoactivities under UV and visible light are promising visible-light-driven water splitting photocatalysts. This study demonstrates the power of machine learning and the potential of big data to accelerate discovery of next-generation photocatalysts.
Keywords: catalysis; chemistry; computational chemistry.
© 2021 The Authors.