搜索

x

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

机器学习模型预测稀土化合物的热力学稳定性

秦成龙 赵亮 蒋刚

引用本文:
Citation:

机器学习模型预测稀土化合物的热力学稳定性

秦成龙, 赵亮, 蒋刚
cstr: 32037.14.aps.74.20250362

Machine learning model predicted thermodynamic stability of rare earth compounds

QIN Chenglong, ZHAO Liang, JIANG Gang
cstr: 32037.14.aps.74.20250362
Article Text (iFLYTEK Translation)
PDF
HTML
导出引用
  • 热力学稳定性在先进材料设计中占据核心地位, 其决定了材料在服役条件下的结构完整性与性能持续性. 本研究利用由280569个密度泛函理论(DFT)计算得到的能量数据集, 采用随机森林(RF)和神经网络(NN)两种机器学习(ML)模型来预测稀土化合物的热力学相稳定性. 研究使用一系列不包含结构信息的综合特征描述符, 使其适用于由任意数量元素构成的材料. 经5折交叉验证测试, 两种模型在分类和回归任务中均展现出卓越性能. 它们不仅能够精准地将化合物划分为稳定或不稳定类别, 还能精确预测化合物的形成能. 此外, 利用训练完成的模型, 对稀土化合物La-Al和Ce-H的二元相图进行预测. 考虑到单一模型在预测某些化合物时可能存在局限性, 为提升模型的鲁棒性, 采用了一种集成学习策略. 通过协同组合RF和NN模型的预测结果, 集成学习方法在准确预测稀土化合物相图方面表现出色, 成功捕捉到了多个数据库中没有的亚稳相.
    This study aims to predict the thermodynamic stability of rare-earth compounds by using machine learning (ML) models, providing crucial data support for designing advanced materials and facilitating the discovery of new rare-earth compounds.In terms of methods, this study is based on a dataset consisting of 280569 compounds. The formation energies of these compounds are calculated by density functional theory (DFT). A system consisting of 145 feature descriptors is constructed, covering stoichiometric properties, statistical properties of elements, electronic structure properties, and properties of ionic compounds, comprehensively describing the characteristics of rare-earth compounds. Two ML models, i.e. random forest (RF) and neural network (NN), are selected to perform classification and regression tasks respectively. The 5-fold cross-validation is used to improve the reliability of the models. The min-max scaling technique is used for preprocessing data, and an ensemble learning architecture is constructed to address the limitations of single model.In the classification task, the RF and NN algorithms perform remarkably well. With 5-fold cross-validation, the accuracy reaches approximately 0.97, and the F1 score is around 0.98, enabling the precise classification of compounds into stable or unstable categories. In the regression task, the mean absolute errors (MAEs) of the formation energy predictions by the RF and NN models are 0.055 eV/atom and 0.071 eV/atom, respectively. This indicates that the model predictions are highly accurate and can replace complete DFT calculations to a certain extent. In the predictive analysis of system outside the test set, six representative components are selected from the material project database, covering binary, ternary, and quaternary systems. The prediction errors of all compositions are controlled within 0.5 eV/atom, with an error percentage of lower than 25%, indicating that the model has strong ability of extrapolation and prediction. When predicting the binary phase diagrams of rare-earth compounds La-Al and Ce-H by using the trained models, the convex hull phase diagrams constructed through the ensemble learning architecture, which combines the prediction results of the RF and NN models, are highly consistent with those constructed from the open quantum materials database. The models successfully capture several metastable phases that are not present in multiple databases. Moreover, the convex hull distances of the predicted phases are mostly less than 0.1 eV/atom, with the maximum not exceeding 0.2 eV/atom.In conclusion, this study successfully uses ML models to predict the thermodynamic stability of rare-earth compounds. The constructed models demonstrate strong capabilities in classification and regression tasks. The ensemble learning architecture effectively improves the model performance, providing a promising tool for discovering materials in the field of rare-earth science, contributing to the research and development of new rare-earth compounds, and designing advanced materials.
      通信作者: 赵亮, zhaol@scu.edu.cn ; 蒋刚, gjiang@scu.edu.cn
    • 基金项目: 国家自然科学基金(批准号: 12304274)和中央高校基本科研业务费专项资金(批准号: 2024SCU12104)资助的课题.
      Corresponding author: ZHAO Liang, zhaol@scu.edu.cn ; JIANG Gang, gjiang@scu.edu.cn
    • Funds: Project supported by the National Natural Science Foundation of China (Grant No. 12304274) and the Fundamental Research Funds for the Central Universities of China (Grant No. 2024SCU12104).
    [1]

    Dutta T, Kim K H, Uchimiya M, Kwon E E, Jeon B H, Deep A, Yun S T 2016 Environ. Res. 150 182Google Scholar

    [2]

    Ramos S J, Dinali G S, Oliveira C, Martins G C, Moreira C G, Siqueira J O, Guilherme L R G 2016 Curr. Pollut. Rep. 2 28Google Scholar

    [3]

    杜志勇, 沈丽萍, 王清 2025 现代肿瘤医学 33 1Google Scholar

    Du Z Y, Shen L P, Wang Q 2025 J. Mod. Oncol. 33 1Google Scholar

    [4]

    Meng S Y, Li G, Wang P, He M, Sun X H, Li Z X 2023 Mater. Chem. Front. 7 806Google Scholar

    [5]

    Zheng B Z, Fan J Y, Chen B, Qin X, Wang J, Wang F, Deng R R, Liu X G 2022 Chem. Rev. 122 5519Google Scholar

    [6]

    陈娇, 赵超宇, 刘冬 2024 热加工工艺 53 11

    Chen J, Zhao C Y, Liu D 2024 Hot Work. Technol. 53 11

    [7]

    刘贵立 2006 物理学报 55 6570Google Scholar

    Liu G L 2006 Acta Phys. Sin. 55 6570Google Scholar

    [8]

    张国英, 张辉, 魏丹, 罗志成, 李昱材 2009 物理学报 58 444Google Scholar

    Zhang G Y, Zhang H, Wei D, Luo Z C, Li Y C 2009 Acta Phys. Sin. 58 444Google Scholar

    [9]

    Agrawal A, Choudhary A 2016 APL Mater. 4 053208Google Scholar

    [10]

    Pham T L, Nguyen N D, Nguyen V D, Kino H, Miyake T, Dam H C 2018 J. Chem. Phys. 148 204106Google Scholar

    [11]

    Pilania G, Liu X Y, Wang Z 2019 J. Mater. Sci. 54 8361Google Scholar

    [12]

    Singh P, Del Rose T, Vazquez G, Arroyave R, Mudryk Y 2022 Acta Mater. 229 117759Google Scholar

    [13]

    张桥, 谭薇, 宁勇祺, 聂国政, 蔡孟秋, 王俊年, 朱慧平, 赵宇清 2024 物理学报 73 230201Google Scholar

    Zhang Q, Tan W, Ning Y Q, Nie G Z, Cai M Q, Wang J N, Zhu H P, Zhao Y Q 2024 Acta Phys. Sin. 73 230201Google Scholar

    [14]

    Lotfi S, Zhang Z, Viswanathan G, Fortenberry K, Mansouri Tehrani A, Brgoch J 2020 Matter 3 261Google Scholar

    [15]

    Schmidt J, Shi J, Borlido P, Chen L, Botti S, Marques M A L 2017 Chem. Mater. 29 5090Google Scholar

    [16]

    Talapatra A, Uberuaga B P, Stanek C R, Pilania G 2021 Chem. Mater. 33 845Google Scholar

    [17]

    Li W, Jacobs R, Morgan D 2018 Comput. Mater. Sci. 150 454Google Scholar

    [18]

    Odabaşı Ç, Yıldırım R 2020 Sol. Energy Mater. Sol. Cells 205 110284Google Scholar

    [19]

    Batra R, Chen C, Evans T G, Walton K S, Ramprasad R 2020 Nat. Mach. Intell. 2 704Google Scholar

    [20]

    Qin C L, Liu J D, Yu Y S, Xu Z H, Du J G, Jiang G, Zhao L 2024 Ceram. Int. 50 1220Google Scholar

    [21]

    Kirklin S, Saal J E, Meredig B, Thompson A, Doak J W, Aykol M, Rühl S, Wolverton C 2015 npj Comput. Mater. 1 15010Google Scholar

    [22]

    Zagorac D, Muller H, Ruehl S, Zagorac J, Rehme S 2019 J. Appl. Crystallogr. 52 918Google Scholar

    [23]

    Ward L, Agrawal A, Choudhary A, Wolverton C 2016 npj Comput Mater 2 16028Google Scholar

    [24]

    Ward L, Dunn A, Faghaninia A, Zimmermann N E R, Bajaj S, Wang Q, Montoya J, Chen J, Bystrom K, Dylla M, Chard K, Asta M, Persson K A, Snyder G J, Foster I, Jain A 2018 Comput. Mater. Sci. 152 60Google Scholar

    [25]

    Yang C, Ren C, Jia Y F, Wang G, Li M J, Lu W C 2022 Acta Mater. 222 117431Google Scholar

    [26]

    Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E 2011 J. Mach. Learn. Res. 12 2825

    [27]

    Bartel C J, Trewartha A, Wang Q, Dunn A, Jain A, Ceder G 2020 npj Comput. Mater. 6 97Google Scholar

    [28]

    Jain A, Ong S P, Hautier G, Chen W, Richards W D, Dacek S, Cholia S, Gunter D, Skinner D, Ceder G, Persson K A 2013 APL Mater. 1 011002Google Scholar

    [29]

    Jha D, Ward L, Paul A, Liao W K, Choudhary A, Wolverton C, Agrawal A 2018 Sci. Rep. 8 17593Google Scholar

  • 图 1  (a)数据集元素流行分布; (b)数据集稀土元素统计分布柱状图; (c)带有ICSD标签的数据集稀土元素流行分布; (d) 带有ICSD标签的数据集稀土元素统计分布柱状图

    Fig. 1.  (a) Popular distribution of elements in the dataset; (b) statistical distribution histograms of rare earth elements in the dataset; (c) a histogram of the statistical distribution of rare earth elements in a dataset labeled with ICSD; (d) statistical distribution histograms of rare earth elements in datasets with ICSD labels.

    图 2  (a)数据集的形成能分布; (b)数据集材料到凸包的距离统计图; (c)带有ICSD标签的数据集的形成能分布; (d)带有ICSD标签的数据集材料到凸包的距离统计图

    Fig. 2.  (a) Statistical chart of the formation energy distribution of the dataset; (b) statistical graph of the distance from the dataset material to the convex hull; (c) statistical graph of formation energy distribution for datasets with ICSD labels; (d) statistical graph of distance from material to convex hull in dataset with ICSD label.

    图 3  (a) RF和(b) NN模型预测的形成能散点图

    Fig. 3.  (a) RF and (b) NN model predicted formation energy scatter plots.

    图 4  形成能小于0 eV/atom的子集 (a) RF和(b) NN模型预测的形成能散点图

    Fig. 4.  Subset with formation energy less than 0 eV/atom: (a) RF and (b) NN model predicted formation energy scatter plots.

    图 5  化合物稳定性的分类结果 (a) RF和(d) NN模型的混淆矩阵; (b) RF和(e) NN模型的受试者工作特征(ROC)曲线; (c) RF和(f) NN模型的精确率-召回率(P-R)曲线

    Fig. 5.  Classification results of compound stability: (a) RF and (d) NN model confusion matrices; (b) RF and (e) NN model receiver operating characteristic (ROC) curves; (c) RF and (f) NN model precision-recall (P-R) curves.

    图 6  集成学习架构预测出的 (a) La-Al和(b) Ce-H二元体系的凸包相图; 黑色实线代表凸包边界, 绿色点代表稳定的组分(凸包能量距离等于0 eV/atom), 红色点代表亚稳定的组分(凸包能量距离小于0.2 eV/atom)

    Fig. 6.  Ensemble learning architecture-predicted convex hull phase diagrams of (a) La-Al and (b) Ce-H binary systems; the black solid line represents the boundaries of the convex hull, the green dots represent the stabilized components (the distance to the convex hull equal to 0 eV/atom), and the red dots represent the sub-stabilized components (the distance to the convex hull less than 0.2 eV/atom).

    表 1  使用ML模型预测以及DFT计算得到的组分形成能

    Table 1.  Formation energies of the compositions calculated using ML model and DFT.

    组分 ML/
    (eV·atom–1)
    DFT/
    (eV·atom–1)
    误差
    百分比/%
    EuH2 –0.58 –0.687 15.6
    Tb2O3 –3.52 –3.982 11.6
    CeSi –0.58 –0.749 22.6
    NdVO3 –3.14 –3.221 2.5
    PrH3O3 –1.97 –2.199 10.4
    LaP3H3O10 –2.22 –1.942 14.3
    下载: 导出CSV

    表 2  预测组分的形成能(Ef)和和凸包能量距离(Ehull)

    Table 2.  Formation enthalpy (Ef) and distance to the convex hull (Ehull) of predicted compositions.

    组分Ef /(eV·atom–1)Ehull/(eV·atom–1)
    Ce2H3–0.5310.0038
    Ce3H8–0.5250.0082
    CeH5–0.1430.1882
    La5Al9–0.4110.0736
    La7Al10–0.4140.0419
    La4Al5–0.4070.0316
    La2Al5–0.3750.0945
    La9Al4–0.2440.008
    下载: 导出CSV
  • [1]

    Dutta T, Kim K H, Uchimiya M, Kwon E E, Jeon B H, Deep A, Yun S T 2016 Environ. Res. 150 182Google Scholar

    [2]

    Ramos S J, Dinali G S, Oliveira C, Martins G C, Moreira C G, Siqueira J O, Guilherme L R G 2016 Curr. Pollut. Rep. 2 28Google Scholar

    [3]

    杜志勇, 沈丽萍, 王清 2025 现代肿瘤医学 33 1Google Scholar

    Du Z Y, Shen L P, Wang Q 2025 J. Mod. Oncol. 33 1Google Scholar

    [4]

    Meng S Y, Li G, Wang P, He M, Sun X H, Li Z X 2023 Mater. Chem. Front. 7 806Google Scholar

    [5]

    Zheng B Z, Fan J Y, Chen B, Qin X, Wang J, Wang F, Deng R R, Liu X G 2022 Chem. Rev. 122 5519Google Scholar

    [6]

    陈娇, 赵超宇, 刘冬 2024 热加工工艺 53 11

    Chen J, Zhao C Y, Liu D 2024 Hot Work. Technol. 53 11

    [7]

    刘贵立 2006 物理学报 55 6570Google Scholar

    Liu G L 2006 Acta Phys. Sin. 55 6570Google Scholar

    [8]

    张国英, 张辉, 魏丹, 罗志成, 李昱材 2009 物理学报 58 444Google Scholar

    Zhang G Y, Zhang H, Wei D, Luo Z C, Li Y C 2009 Acta Phys. Sin. 58 444Google Scholar

    [9]

    Agrawal A, Choudhary A 2016 APL Mater. 4 053208Google Scholar

    [10]

    Pham T L, Nguyen N D, Nguyen V D, Kino H, Miyake T, Dam H C 2018 J. Chem. Phys. 148 204106Google Scholar

    [11]

    Pilania G, Liu X Y, Wang Z 2019 J. Mater. Sci. 54 8361Google Scholar

    [12]

    Singh P, Del Rose T, Vazquez G, Arroyave R, Mudryk Y 2022 Acta Mater. 229 117759Google Scholar

    [13]

    张桥, 谭薇, 宁勇祺, 聂国政, 蔡孟秋, 王俊年, 朱慧平, 赵宇清 2024 物理学报 73 230201Google Scholar

    Zhang Q, Tan W, Ning Y Q, Nie G Z, Cai M Q, Wang J N, Zhu H P, Zhao Y Q 2024 Acta Phys. Sin. 73 230201Google Scholar

    [14]

    Lotfi S, Zhang Z, Viswanathan G, Fortenberry K, Mansouri Tehrani A, Brgoch J 2020 Matter 3 261Google Scholar

    [15]

    Schmidt J, Shi J, Borlido P, Chen L, Botti S, Marques M A L 2017 Chem. Mater. 29 5090Google Scholar

    [16]

    Talapatra A, Uberuaga B P, Stanek C R, Pilania G 2021 Chem. Mater. 33 845Google Scholar

    [17]

    Li W, Jacobs R, Morgan D 2018 Comput. Mater. Sci. 150 454Google Scholar

    [18]

    Odabaşı Ç, Yıldırım R 2020 Sol. Energy Mater. Sol. Cells 205 110284Google Scholar

    [19]

    Batra R, Chen C, Evans T G, Walton K S, Ramprasad R 2020 Nat. Mach. Intell. 2 704Google Scholar

    [20]

    Qin C L, Liu J D, Yu Y S, Xu Z H, Du J G, Jiang G, Zhao L 2024 Ceram. Int. 50 1220Google Scholar

    [21]

    Kirklin S, Saal J E, Meredig B, Thompson A, Doak J W, Aykol M, Rühl S, Wolverton C 2015 npj Comput. Mater. 1 15010Google Scholar

    [22]

    Zagorac D, Muller H, Ruehl S, Zagorac J, Rehme S 2019 J. Appl. Crystallogr. 52 918Google Scholar

    [23]

    Ward L, Agrawal A, Choudhary A, Wolverton C 2016 npj Comput Mater 2 16028Google Scholar

    [24]

    Ward L, Dunn A, Faghaninia A, Zimmermann N E R, Bajaj S, Wang Q, Montoya J, Chen J, Bystrom K, Dylla M, Chard K, Asta M, Persson K A, Snyder G J, Foster I, Jain A 2018 Comput. Mater. Sci. 152 60Google Scholar

    [25]

    Yang C, Ren C, Jia Y F, Wang G, Li M J, Lu W C 2022 Acta Mater. 222 117431Google Scholar

    [26]

    Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E 2011 J. Mach. Learn. Res. 12 2825

    [27]

    Bartel C J, Trewartha A, Wang Q, Dunn A, Jain A, Ceder G 2020 npj Comput. Mater. 6 97Google Scholar

    [28]

    Jain A, Ong S P, Hautier G, Chen W, Richards W D, Dacek S, Cholia S, Gunter D, Skinner D, Ceder G, Persson K A 2013 APL Mater. 1 011002Google Scholar

    [29]

    Jha D, Ward L, Paul A, Liao W K, Choudhary A, Wolverton C, Agrawal A 2018 Sci. Rep. 8 17593Google Scholar

  • [1] 吴阳海, 杜海龙, 薛雷, 李佳鲜, 薛淼, 郑国尧. 基于机器学习的托卡马克偏滤器靶板热负荷预测研究. 物理学报, 2025, 74(13): 135205. doi: 10.7498/aps.74.20250381
    [2] 刘兆圣, 张桥, 宁勇祺, 符秀交, 邹代峰, 王俊年, 赵宇清. 基于机器学习与第一性原理计算的高居里温度Janus预测. 物理学报, 2025, 74(22): . doi: 10.7498/aps.74.20251026
    [3] 王越, 叶函函, 熊伟, 王先华, 施海亮, 李超, 程晨, 吴时超. 一种光谱特征增强驱动的机器学习地基红外高光谱云检测方法. 物理学报, 2025, 74(20): 200202. doi: 10.7498/aps.74.20250982
    [4] 郭焱, 吕恒, 丁春玲, 袁晨智, 金锐博. 分数阶涡旋光衍射过程的机器学习识别. 物理学报, 2025, 74(1): 014203. doi: 10.7498/aps.74.20241458
    [5] 张童, 王加豪, 田帅, 孙旭冉, 李日. 基于机器学习的铸件凝固过程动态收缩行为. 物理学报, 2025, 74(2): 028103. doi: 10.7498/aps.74.20241581
    [6] 梁晨, 卢少瑜, 黄栋, 陈鑫, 冯岩. 基于机器学习从单颗粒动力学中诊断尘埃等离子体全局性质信息. 物理学报, 2025, 74(20): 205202. doi: 10.7498/aps.74.20251129
    [7] 王鹏, 麦麦提尼亚孜·麦麦提阿卜杜拉. 机器学习的量子动力学. 物理学报, 2025, 74(6): 060701. doi: 10.7498/aps.74.20240999
    [8] 宋睿, 刘雪梅, 王海滨, 吕皓, 宋晓艳. 机器学习辅助的WC-Co硬质合金硬度预测. 物理学报, 2024, 73(12): 126201. doi: 10.7498/aps.73.20240284
    [9] 张桥, 谭薇, 宁勇祺, 聂国政, 蔡孟秋, 王俊年, 朱慧平, 赵宇清. 基于机器学习和第一性原理计算的Janus材料预测. 物理学报, 2024, 73(23): 230201. doi: 10.7498/aps.73.20241278
    [10] 张旭, 丁进敏, 侯晨阳, 赵一鸣, 刘鸿维, 梁生. 基于机器学习的激光匀光整形方法. 物理学报, 2024, 73(16): 164205. doi: 10.7498/aps.73.20240747
    [11] 张嘉晖. 蛋白质计算中的机器学习. 物理学报, 2024, 73(6): 069301. doi: 10.7498/aps.73.20231618
    [12] 欧阳鑫健, 张岩星, 王之龙, 张锋, 陈韦嘉, 庄园, 揭晓, 刘来君, 王大威. 面向铁电相变的机器学习: 基于图卷积神经网络的分子动力学模拟. 物理学报, 2024, 73(8): 086301. doi: 10.7498/aps.73.20240156
    [13] 刘烨, 牛赫然, 李兵兵, 马欣华, 崔树旺. 机器学习在宇宙线粒子鉴别中的应用. 物理学报, 2023, 72(14): 140202. doi: 10.7498/aps.72.20230334
    [14] 管星悦, 黄恒焱, 彭华祺, 刘彦航, 李文飞, 王炜. 生物分子模拟中的机器学习方法. 物理学报, 2023, 72(24): 248708. doi: 10.7498/aps.72.20231624
    [15] 郭唯琛, 艾保全, 贺亮. 机器学习回归不确定性揭示自驱动活性粒子的群集相变. 物理学报, 2023, 72(20): 200701. doi: 10.7498/aps.72.20230896
    [16] 张嘉伟, 姚鸿博, 张远征, 蒋伟博, 吴永辉, 张亚菊, 敖天勇, 郑海务. 通过机器学习实现基于摩擦纳米发电机的自驱动智能传感及其应用. 物理学报, 2022, 71(7): 078702. doi: 10.7498/aps.71.20211632
    [17] 万新阳, 章烨辉, 陆帅华, 吴艺蕾, 周跫桦, 王金兰. 机器学习加速搜寻新型双钙钛矿氧化物光催化剂. 物理学报, 2022, 71(17): 177101. doi: 10.7498/aps.71.20220601
    [18] 林键, 叶梦, 朱家纬, 李晓鹏. 机器学习辅助绝热量子算法设计. 物理学报, 2021, 70(14): 140306. doi: 10.7498/aps.70.20210831
    [19] 陈江芷, 杨晨温, 任捷. 基于波动与扩散物理系统的机器学习. 物理学报, 2021, 70(14): 144204. doi: 10.7498/aps.70.20210879
    [20] 杨自欣, 高章然, 孙晓帆, 蔡宏灵, 张凤鸣, 吴小山. 铅基钙钛矿铁电晶体高临界转变温度的机器学习研究. 物理学报, 2019, 68(21): 210502. doi: 10.7498/aps.68.20190942
计量
  • 文章访问数:  1707
  • PDF下载量:  61
  • 被引次数: 0
出版历程
  • 收稿日期:  2025-03-20
  • 修回日期:  2025-04-19
  • 上网日期:  2025-04-29
  • 刊出日期:  2025-07-05

/

返回文章
返回