Generative Design of Inorganic Compounds Using Deep Diffusion Language Models.

Rongzhi Dong, Nihang Fu, Edirisuriya M D Siriwardane, Jianjun Hu
Author Information
  1. Rongzhi Dong: Department of Computer Science and Engineering, University of South Carolina, Columbia, South Carolina 29201, United States. ORCID
  2. Nihang Fu: Department of Computer Science and Engineering, University of South Carolina, Columbia, South Carolina 29201, United States.
  3. Edirisuriya M D Siriwardane: Department of Physics, University of Colombo, Colombo 00300, Sri Lanka. ORCID
  4. Jianjun Hu: Department of Computer Science and Engineering, University of South Carolina, Columbia, South Carolina 29201, United States. ORCID

Abstract

Due to the vast chemical space, discovering materials with a specific function is challenging. Chemical formulas are obligated to conform to a set of exacting criteria, such as charge neutrality, balanced electronegativity, synthesizability, and mechanical stability. In response to this formidable task, we introduce a deep-learning-based generative model for material composition and structure design by learning and exploiting explicit and implicit chemical knowledge. Our pipeline first uses deep diffusion language models as the generator of compositions and then applies a template-based crystal structure prediction algorithm to predict their corresponding structures, which is then followed by structure relaxation using a universal graph neural network-based potential. Density functional theory (DFT) calculations of the formation energies and energy-above-the-hull analysis are used to validate new structures generated through our pipeline. Based on the DFT calculation results, six new materials, including TiHfO, TaNbP, YMoN, TaReO, HfTiO, and HfMnO, with formation energy less than zero have been found. Remarkably, among these, four materials, namely, TiHfO, TaNbP, YMoN, and TaReO, exhibit an e-above-hull energy of less than 0.3 eV. These findings have proved the effectiveness of our approach.

Word Cloud

Created with Highcharts 10.0.0materialsstructurechemicalpipelinestructuresDFTformationnewTiHfOTaNbPYMoNTaReOenergylessDuevastspacediscoveringspecificfunctionchallengingChemicalformulasobligatedconformsetexactingcriteriachargeneutralitybalancedelectronegativitysynthesizabilitymechanicalstabilityresponseformidabletaskintroducedeep-learning-basedgenerativemodelmaterialcompositiondesignlearningexploitingexplicitimplicitknowledgefirstusesdeepdiffusionlanguagemodelsgeneratorcompositionsappliestemplate-basedcrystalpredictionalgorithmpredictcorrespondingfollowedrelaxationusinguniversalgraphneuralnetwork-basedpotentialDensityfunctionaltheorycalculationsenergiesenergy-above-the-hullanalysisusedvalidategeneratedBasedcalculationresultssixincludingHfTiOHfMnOzerofoundRemarkablyamongfournamelyexhibite-above-hull03eVfindingsprovedeffectivenessapproachGenerativeDesignInorganicCompoundsUsingDeepDiffusionLanguageModels

Similar Articles

Cited By