机器学习的量子动力学

王鹏; 麦麦提尼亚孜·麦麦提阿卜杜拉

doi:10.7498/aps.74.20240999

摘要

基于第一性原理思想, 采用量子动力学方法对机器学习的迭代运动过程进行建模. 在机器学习的参数空间定义广义目标函数, 利用Schrödinger方程和势能等效得到机器学习过程的量子动力学方程, 通过Wick转动进一步建立了量子动力学与热动力学的关系, 这为利用物理理论和数学理论对机器学习的迭代过程进行研究提供了可能. 本文工作将机器学习的迭代过程转化为含时偏微分方程来进行精确数学表述, 该方程表明机器学习过程可能存在多尺度的退火过程和同一尺度下的时间演化过程. 利用量子动力学方程证明了机器学习在时间演化时的收敛性, 解释了机器学习中的扩散模型是量子动力学方程在经典近似和低阶泰勒近似下的映射模型, 导出了人工智能中常用的Softmax和Sigmoid函数. 这些结果表明量子动力学方法在研究机器学习理论中是有效的.

关键词:

Abstract

In order to solve the current lack of rigorous theoretical models in the machine learning process, in this paper the iterative motion process of machine learning is modeled by using quantum dynamic method based on the principles of first-principles thinking. This approach treats the iterative evolution of algorithms as a physical motion process, defines a generalized objective function in the parameter space of machine learning algorithms, and regards the iterative process of machine learning as the process of seeking the optimal value of this generalized objective function. In physical terms, this process corresponds to the system reaching its ground energy state. Since the dynamic equation of a quantum system is the Schrödinger equation, we can obtain the quantum dynamic equation that describes the iterative process of machine learning by treating the generalized objective function as the potential energy term in the Schrödinger equation. Therefore, machine learning is the process of seeking the ground energy state of the quantum system constrained by a generalized objective function. The quantum dynamic equation for machine learning transforms the iterative process into a time-dependent partial differential equation for precise mathematical representation, enabling the use of physical and mathematical theories to study the iterative process of machine learning. This provides theoretical support for implementing the iterative process of machine learning by using quantum computers. In order to further explain the iterative process of machine learning on classical computers by using quantum dynamic equation, the Wick rotation is used to transform the quantum dynamic equation into a thermodynamic equation, demonstrating the convergence of the time evolution process in machine learning. The system will be transformed into the ground energy state as time approaches infinity. Taylor expansion is used to approximate the generalized objective function, which has no analytical expression in the parameter space. Under the zero-order Taylor approximation of the generalized objective function, the quantum dynamic equation and thermodynamic equation for machine learning degrade into the free-particle equation and diffusion equation, respectively. This result indicates that the most basic dynamic processes during the iteration of machine learning on quantum computers and classical computers are wave packet dispersion and wave packet diffusion, respectively, thereby explaining, from a dynamic perspective, the basic principles of diffusion models that have been successfully utilized in the generative neural networks in recent years. Diffusion models indirectly realize the thermal diffusion process in the parameter space by adding Gaussian noise to and removing Gaussian noise from the image, thereby optimizing the generalized objective function in the parameter space. The diffusion process is the dynamic process in the zero-order approximation of the generalized objective function. Meanwhile, we also use the thermodynamic equation of machine learning to derive the Softmax function and Sigmoid function, which are commonly used in artificial intelligence. These results show that the quantum dynamic method is an effective theoretical approach to studying the iterative process of machine learning, which provides a rigorous mathematical and physical model for studying the iterative process of machine learning on both quantum computers and classical computers.

Keywords:

作者及机构信息

西南民族大学计算机科学与工程学院, 成都　610225

通信作者: 王鹏, wp002005@163.com

Authors and contacts

School of Computer Science and Engineering, Southwest Minzu University, Chengdu 610225, China

Corresponding author: WANG Peng, wp002005@163.com

文章全文

参考文献

[1]	Metropolis N, Rosenbluth A W, Rosenbluth M N, Teller A H, Teller E 1953 J. Chem. Phys. 21 1087 Google Scholar
[2]	Kirkpatrick S, Gelatt C D, Vecchi M P 1983 Science 220 671 Google Scholar
[3]	Finnila A B, Gomez M A, Sebenik C, Stenson C, Doll J D 1994 Chem. Phys. Lett. 219 343 Google Scholar
[4]	Wang F, Wang P 2024 Quantum Inf. Process. 23 66 Google Scholar
[5]	王鹏, 辛罡 2023 自动化学报 49 2396 Google Scholar Wang P, Xin G 2023 Acta Autom. Sin. 49 2396 Google Scholar
[6]	王鹏, 黄焱, 任超, 郭又铭 2013 电子学报 41 2468 Google Scholar Wang P, Huang Y, Ren C, Guo Y 2013 Acta Electron. Sin. 41 2468 Google Scholar
[7]	王鹏, 王方 2022 电子科技大学学报(自然科学版) 51 2 Google Scholar Wang P, Wang F 2022 J. Univ. Electron. Sci. Technol. (Nat. Sci. Ed.) 51 2 Google Scholar
[8]	Johnson M W, Amin M H S, Gildert S 2011 Nature 473 194 Google Scholar
[9]	Sohl-Dickstein J, Weiss E, Maheswaranathan N, Ganguli S 2015 Proceedings of the 32 ^nd International Conference on Machine Learning Lille, France, July 7–9, 2015 p2256
[10]	Song Y, Sohl-Dickstein J, Kingma D P, Kumar A, Ermon S, Poole B 2020 arXiv: 2011.13456 [cs.LG]
[11]	Xin G, Wang P, Jiao Y 2021 Expert. Syst. Appl. 185 115615 Google Scholar
[12]	Jin J, Wang P 2021 Swarm Evol. Comput. 65 100916 Google Scholar
[13]	Wick G C 1954 Phys. Rev. 96 1124 Google Scholar
[14]	Dhariwal P, Nichol A 2021 Advances in Neural Information Processing Systems (NeurIPS 2021) December 7–10, 2021 (Virtual-only Conference) p8780
[15]	Ho J, Jain A, Abbeel P 2020 Advances in Neural Information Processing Systems (NeurIPS 2020) December 6–12, 2020 (Virtual-only Conference) p6840
[16]	Nichol A Q, Dhariwal P 2021 Proceedings of the 38th International Conference on Machine Learning July 18–24, 2021 (Virtual-only Conference) p8162
[17]	Lim S, Yoon E, Byun T, Kang T, Kim S, Lee K, Choi S 2023 Advances in Neural Information Processing Systems (NeurIPS 2023) New Orleans, USA, December 10–16, 2023 p37799
[18]	Anderson J B 1975 J. Chem. Phys. 63 1499 Google Scholar
[19]	Kosztin I, Faber B, Schulten K 1996 Am. J. Phys. 64 633 Google Scholar
[20]	Haghighi M K, Lüchow A 2017 J. Phys. Chem. A 121 6165 Google Scholar
[21]	Jeong J, Shin J 2023 Advances in Neural Information Processing Systems (NeurIPS 2023) New Orleans, USA, December 10–16, 2023 p67374
[22]	Morawietz T, Artrith N 2021 J. Comput. Aid. Mol. Des. 35 557 Google Scholar

施引文献

图 1 优化问题的量子动力学框架

Fig. 1. Quantum dynamical framework for optimization problems

下载: 全尺寸图片幻灯片

图 2 波包色散过程

Fig. 2. Process of wave packet dispersion

下载: 全尺寸图片幻灯片

图 3 波包色散到经典扩散的转化

Fig. 3. Transition from wave packet dispersion to classical diffusion

下载: 全尺寸图片幻灯片

图 4 Sigmoid函数随时间的演化

Fig. 4. Evolution of the Sigmoid function over time

下载: 全尺寸图片幻灯片

图 5 扩散模型的量子动力学诠释

Fig. 5. Quantum dynamical interpretation of diffusion models

下载: 全尺寸图片幻灯片

图 6 参数空间的采样映射

Fig. 6. Sampling mapping of parameter space

下载: 全尺寸图片幻灯片

图 7 基于扩散模型的推理结构

Fig. 7. Inference structure based on diffusion models

下载: 全尺寸图片幻灯片

[1]	Metropolis N, Rosenbluth A W, Rosenbluth M N, Teller A H, Teller E 1953 J. Chem. Phys. 21 1087 Google Scholar
[2]	Kirkpatrick S, Gelatt C D, Vecchi M P 1983 Science 220 671 Google Scholar
[3]	Finnila A B, Gomez M A, Sebenik C, Stenson C, Doll J D 1994 Chem. Phys. Lett. 219 343 Google Scholar
[4]	Wang F, Wang P 2024 Quantum Inf. Process. 23 66 Google Scholar
[5]	王鹏, 辛罡 2023 自动化学报 49 2396 Google Scholar Wang P, Xin G 2023 Acta Autom. Sin. 49 2396 Google Scholar
[6]	王鹏, 黄焱, 任超, 郭又铭 2013 电子学报 41 2468 Google Scholar Wang P, Huang Y, Ren C, Guo Y 2013 Acta Electron. Sin. 41 2468 Google Scholar
[7]	王鹏, 王方 2022 电子科技大学学报(自然科学版) 51 2 Google Scholar Wang P, Wang F 2022 J. Univ. Electron. Sci. Technol. (Nat. Sci. Ed.) 51 2 Google Scholar
[8]	Johnson M W, Amin M H S, Gildert S 2011 Nature 473 194 Google Scholar
[9]	Sohl-Dickstein J, Weiss E, Maheswaranathan N, Ganguli S 2015 Proceedings of the 32 ^nd International Conference on Machine Learning Lille, France, July 7–9, 2015 p2256
[10]	Song Y, Sohl-Dickstein J, Kingma D P, Kumar A, Ermon S, Poole B 2020 arXiv: 2011.13456 [cs.LG]
[11]	Xin G, Wang P, Jiao Y 2021 Expert. Syst. Appl. 185 115615 Google Scholar
[12]	Jin J, Wang P 2021 Swarm Evol. Comput. 65 100916 Google Scholar
[13]	Wick G C 1954 Phys. Rev. 96 1124 Google Scholar
[14]	Dhariwal P, Nichol A 2021 Advances in Neural Information Processing Systems (NeurIPS 2021) December 7–10, 2021 (Virtual-only Conference) p8780
[15]	Ho J, Jain A, Abbeel P 2020 Advances in Neural Information Processing Systems (NeurIPS 2020) December 6–12, 2020 (Virtual-only Conference) p6840
[16]	Nichol A Q, Dhariwal P 2021 Proceedings of the 38th International Conference on Machine Learning July 18–24, 2021 (Virtual-only Conference) p8162
[17]	Lim S, Yoon E, Byun T, Kang T, Kim S, Lee K, Choi S 2023 Advances in Neural Information Processing Systems (NeurIPS 2023) New Orleans, USA, December 10–16, 2023 p37799
[18]	Anderson J B 1975 J. Chem. Phys. 63 1499 Google Scholar
[19]	Kosztin I, Faber B, Schulten K 1996 Am. J. Phys. 64 633 Google Scholar
[20]	Haghighi M K, Lüchow A 2017 J. Phys. Chem. A 121 6165 Google Scholar
[21]	Jeong J, Shin J 2023 Advances in Neural Information Processing Systems (NeurIPS 2023) New Orleans, USA, December 10–16, 2023 p67374
[22]	Morawietz T, Artrith N 2021 J. Comput. Aid. Mol. Des. 35 557 Google Scholar

[1]	张童, 王加豪, 田帅, 孙旭冉, 李日. 基于机器学习的铸件凝固过程动态收缩行为. 物理学报, 2025, 74(2): 028103. doi: 10.7498/aps.74.20241581
[2]	王扬, 徐映红, 赵烨丹, 张立溥. (1+1)维非线性薛定谔方程PT对称势函数的数值反演. 物理学报, 2025, 74(13): 134203. doi: 10.7498/aps.74.20250129
[3]	梁晨, 卢少瑜, 黄栋, 陈鑫, 冯岩. 基于机器学习从单颗粒动力学中诊断尘埃等离子体全局性质信息. 物理学报, 2025, 74(20): 205202. doi: 10.7498/aps.74.20251129
[4]	秦成龙, 赵亮, 蒋刚. 机器学习模型预测稀土化合物的热力学稳定性. 物理学报, 2025, 74(13): 130201. doi: 10.7498/aps.74.20250362
[5]	张嘉晖. 蛋白质计算中的机器学习. 物理学报, 2024, 73(6): 069301. doi: 10.7498/aps.73.20231618
[6]	谢国大, 潘攀, 任信钢, 冯乃星, 方明, 李迎松, 黄志祥. 高阶SF-SFDTD方法在含时薛定谔方程求解中的应用研究. 物理学报, 2024, 73(3): 030201. doi: 10.7498/aps.73.20230771
[7]	欧阳鑫健, 张岩星, 王之龙, 张锋, 陈韦嘉, 庄园, 揭晓, 刘来君, 王大威. 面向铁电相变的机器学习: 基于图卷积神经网络的分子动力学模拟. 物理学报, 2024, 73(8): 086301. doi: 10.7498/aps.73.20240156
[8]	张逸凡, 任卫, 王伟丽, 丁书剑, 李楠, 常亮, 周倩. 机器学习结合固溶强化模型预测高熵合金硬度. 物理学报, 2023, 72(18): 180701. doi: 10.7498/aps.72.20230646
[9]	林键, 叶梦, 朱家纬, 李晓鹏. 机器学习辅助绝热量子算法设计. 物理学报, 2021, 70(14): 140306. doi: 10.7498/aps.70.20210831
[10]	陈江芷, 杨晨温, 任捷. 基于波动与扩散物理系统的机器学习. 物理学报, 2021, 70(14): 144204. doi: 10.7498/aps.70.20210879
[11]	申钰田, 孟胜. 光解水的原子尺度机理和量子动力学. 物理学报, 2019, 68(1): 018202. doi: 10.7498/aps.68.20181312
[12]	范桁. 量子计算与量子模拟. 物理学报, 2018, 67(12): 120301. doi: 10.7498/aps.67.20180710
[13]	胡耀垓, 赵正予, 张援农. 电离层钡云释放早期动力学行为的数值模拟. 物理学报, 2012, 61(8): 089401. doi: 10.7498/aps.61.089401
[14]	刘晓静, 张佰军, 李海波, 刘兵, 张春丽, 郭义庆, 张丙新. 应用量子理论方法研究中子双缝衍射. 物理学报, 2010, 59(6): 4117-4122. doi: 10.7498/aps.59.4117
[15]	刘奎, 丁宏林, 张贤高, 余林蔚, 黄信凡, 陈坤基. 量子点浮置栅量子线沟道三栅结构单电子场效应管存储特性的数值模拟. 物理学报, 2008, 57(11): 7052-7056. doi: 10.7498/aps.57.7052
[16]	厉江帆, 单树民, 杨建坤, 姜宗福. 失谐量子频率转换系统薛定谔方程的显式解析解. 物理学报, 2007, 56(10): 5597-5601. doi: 10.7498/aps.56.5597
[17]	辛国锋, 陈国鹰, 花吉珍, 赵润, 康志龙, 冯荣珠, 安振峰. 941nm大功率应变单量子阱激光器的波长设计. 物理学报, 2004, 53(5): 1293-1298. doi: 10.7498/aps.53.1293
[18]	张解放, 徐昌智, 何宝钢. 变量分离法与变系数非线性薛定谔方程的求解探索. 物理学报, 2004, 53(11): 3652-3656. doi: 10.7498/aps.53.3652
[19]	李培咸, 郝跃, 范隆, 张进城, 张金凤, 张晓菊. 基于量子微扰的AlGaN/GaN异质结波函数半解析求解. 物理学报, 2003, 52(12): 2985-2988. doi: 10.7498/aps.52.2985
[20]	刘剑波, 蔡喜平. 一维定态薛定谔方程的宏观模拟解法. 物理学报, 2001, 50(5): 820-824. doi: 10.7498/aps.50.820

计量

文章访问数: 6081
PDF下载量: 304
被引次数: 0

姓名
邮箱
手机号码
标题
留言内容
验证码

搜索

留言板

机器学习的量子动力学