基于图形处理器加速数值求解三维含时薛定谔方程

唐富明; 刘凯; 杨溢; 屠倩; 王凤; 王哲; 廖青

doi:10.7498/aps.69.20200700

摘要

量子力学领域中对强激光场与原子分子相互作用的理论研究非常依赖于数值求解含时薛定谔方程. 本文在强场电离的背景下并行求解氢原子的三维含时薛定谔方程. 基于球极坐标系, 采用分裂算符-傅里叶变换方法将含时薛定谔方程进行了离散化. 由此可得到长度规范下的光电子连续态波函数. 图形处理器(GPU)可以依托多线程结构充分发挥细粒度并行的优势, 实现整体算法的并行加速. 计算表明, 相对于中央处理器(CPU), GPU并行计算有着最高约60倍的加速比. 由此可见, 基于GPU加速数值求解三维含时薛定谔方程能够显著缩短计算耗费的时间. 这一工作对利用GPU快速求解三维含时薛定谔方程有着重要的指导意义.

关键词:

Abstract

In the field of quantum mechanics, the theoretical study of the interaction between intense laser field and atoms and molecules depends very much on the numerical solution of the time-dependent Schrödinger equation. However, solving the three-dimensional time-dependent Schrödinger equation is not a simple task, and the analytical solution cannot be obtained, so it can only be solved numerically with the help of computer. In order to shorten the computing time and obtain the results quickly, it is necessary to use parallel methods to speed up computing. In this paper, under the background of strong field ionization, the three-dimensional time-dependent Schrödinger equation of hydrogen atom is solved in parallel, and the suprathreshold ionization of hydrogen atom under the action of linearly polarized infrared laser electric field is taken for example. Based on the spherical polar coordinate system, the time-dependent Schrödinger equation is discretized by the splitting operator-Fourier transform method, and the photoelectron continuous state wave function under the length gauge can be obtained. In Graphics processing unit (GPU) accelerated applications, the sequential portion of the workload runs on central processing unit (CPU) (which is optimized for single-threaded performance), while the compute-intensive part of the application runs in parallel on thousands of GPU cores. The GPU can make full use of the advantage of fine-grained parallelism based on multi-thread structure to realize parallel acceleration of the whole algorithm. Two accelerated computing modes of CPU parallel and GPU parallel are adopted, and their parallel acceleration performance is discussed. Compared with the results from the existing physical laws, the calculation error is also within an acceptable range, and the result is also consistent with the result from the existing physical laws of suprathreshold ionization, which also verifies the correctness of the program. In order to obtain a relatively accurate acceleration ratio, many different experiments are carried out. Computational experiments show that under the condition of ensuring accuracy, the GPU parallel computing speeds by up to about 60 times maximally based on the computational performance of CPU. It can be seen that the accelerated numerical solution of three-dimensional time-dependent Schrödinger equation based on GPU can significantly shorten the computational time. This work has important guiding significance for rapidly solving the three-dimensional time-dependent Schrödinger equation by using GPU.

Keywords:

three-dimensional time-dependent Schrödinger equation /
strong-field ionization /
parallel computing

作者及机构信息

武汉工程大学, 光学信息与模式识别湖北省重点实验室, 武汉　430205

通信作者: 廖青, liaoqing@wit.edu.cn

基金项目: 国家自然科学基金(批准号: 11674257, 11604248, 11874019, 11947096)和湖北省高等学校优秀中青年科技创新团队计划(批准号: T201806)资助的课题

Authors and contacts

Hubei Key Laboratory of Optical Information and Pattern Recognition, Wuhan Institute of Technology, Wuhan 430205, China

Corresponding author: Liao Qing, liaoqing@wit.edu.cn

Funds: Project supported by the National Natural Science Foundation of China (Grant Nos. 11674257, 11604248, 11874019, 11947096) and the Program for Distinguished Middle-aged and Young Innovative Research Team in Higher Education of Hubei Province, China (Grant No. T201806)

文章全文

参考文献

[1]	Corkum P 1993 Phys. Rev. Lett. 71 1994 Google Scholar
[2]	Liu K L, Luo S Q, Li M, Li Y, Feng Y D, Du B J, Zhou Y M, Lu P X, Barth I 2019 Phys. Rev. Lett. 122 053202 Google Scholar
[3]	Lewenstein M, Balcou P, Ivanov M Y, L’Huillier A, Corkum P B 1994 Phys. Rev. A 49 2117 Google Scholar
[4]	Zhang X F, Zhu X S, Wang D, Li L, Liu X, Liao Q, Lan P F, Lu P X 2019 Phys. Rev. A 99 013414 Google Scholar
[5]	Gaarde M B, Tate J L, Schafer K J 2008 J. Phys. B 41 132001 Google Scholar
[6]	Liu K, Qin M Y, Li Q G, Liao Q 2018 Opt. Quantum Electron. 50 364 Google Scholar
[7]	Liao Q, Li Y, Qin M Y, Lu P X 2017 Phys. Rev. A 96 063408 Google Scholar
[8]	Liu K, Wang F, Wang Z, Qin M Y, Liao Q 2019 J. Opt. Soc. Am. B 36 2624 Google Scholar
[9]	Muller H G 1999 Laser Phys. 9 138
[10]	Bauer D, Koval P 2006 Comput. Phys. Commun. 174 396 Google Scholar
[11]	Madsen L B, Nikolopoulos L A A, Kjeldsen T K, Fernández J 2007 Phys. Rev. A 76 063407 Google Scholar
[12]	Keldysh L V 1964 Sov. Phys. JETP 20 1307
[13]	Faisal F H M 1973 J. Phys. B: At. Mol. Opt. Phys. 6 L89 Google Scholar
[14]	Reiss H R 1980 Phys. Rev. A 22 1786 Google Scholar
[15]	Gallagher T F 1988 Phys. Rev. Lett. 61 2304 Google Scholar
[16]	Corkum P B, Burnett N H, Brunel F 1989 Phys. Rev. Lett. 62 1259 Google Scholar
[17]	肖相如, 王慕雪, 黎敏, 耿基伟, 刘运全, 彭良友 2016 物理学报 65 220203 Google Scholar Xiao X R, Wang M X, Li M, Geng J W, Liu Y Q, Peng L Y 2016 Acta Phys. Sin. 65 220203 Google Scholar
[18]	Gainullin I 2017 Comput. Phys. Commun. 72 210
[19]	Liu Q, Liu F, Hou C 2020 Procedia Computer Sci. 171 312 Google Scholar
[20]	Penfold T J 2017 Phys. Chem. Chem. Phys. 19 19601 Google Scholar
[21]	Broin C Ó2015 Ph. D. Dissertation (Dublin: Dublin City University)
[22]	Broin C Ó, Nikolopoulos L A A 2014 Comput. Phys. Commun. 185 1791 Google Scholar
[23]	Feit M D, Fleck J A, Steiger A 1982 J. Comput. Phys. 47 412 Google Scholar
[24]	Kjeldsen T K 2007 Ph. D. Dissertation (Arhus: University of Arhus)

施引文献

图 1 数据传输流程图

Fig. 1. The flowchart of data transmission.

下载: 全尺寸图片幻灯片

图 2 加速比随着角量子数的变化

Fig. 2. Speedup ratio as a function of angular quantum number.

下载: 全尺寸图片幻灯片

图 3 加速比随着径向网格点的变化

Fig. 3. Speedup ratio as a function of radial grid point.

下载: 全尺寸图片幻灯片

图 4 加速比随着矩阵大小的变化

Fig. 4. Speedup ratio as a function of the size of matrix.

下载: 全尺寸图片幻灯片

图 5 加速比随着矩阵大小的变化

Fig. 5. Speedup ratio as a function of the size of matrix.

下载: 全尺寸图片幻灯片

图 6 氢原子的光电子末态动量分布　(a) CPU计算结果; (b) GPU计算结果

Fig. 6. Photoelectron final-state momentum distributions of hydrogen atom: (a) Calculation results of CPU; (b) calculation results of GPU.

下载: 全尺寸图片幻灯片

表 1 TDSE算法步骤

Table 1. TDSE algorithm steps.

算法　　　$\varPhi (t + \Delta t) = {{\rm{e}}^{ - {\rm{i}}H(t)\Delta t}}\varPhi (t)$
Input: ${f_l}({r_i}, t)$
Output: ${f_l}({r_i}, t)$
1. for n do
2.　for l do
3.　　${f_l}({r_i}, t) = {\rm{ifft} }\left( { {\rm{diag} }\Big( { { {\rm{e} }^{ - {\rm{i} }\tfrac{ {\Delta t} }{2}\tfrac{ { {k^2} } }{2} } } } \Big) \cdot {\rm{fft} }\left( { {f_l}({r_i}, t)} \right)} \right)$
4.　end for
5.　for i and l do
6.　　${f_l}({r_i}, t) = { {\rm{e} }^{ - {\rm{i} }\tfrac{ {\Delta t} }{2}\left[ {\tfrac{ {l(l + 1)} }{ {2 r_i^2} }\, - \, \frac{1}{ { {r_i} } } } \right]} } \cdot {f_l}({r_i}, t)$
7.　end for
8.　for i and j do
9.　　$\varPhi ({r_i}, {x_j}, t) = \sum\limits_{l = 0}^L {{f_l}({r_i}, t){P_l}({x_j})} $
10. end for
11.　for i and j do
12.　$\left\| {\varPhi ({r_i}, {x_j}, t)} \right\rangle = { {\rm{e} }^{ {\rm{i} }\Delta tE(n){r_i}{x_j} } } \cdot \left\| {\varPhi ({r_i}, {x_j}, t)} \right\rangle$
13.　end for
14.　for i and j do
15.　${f_l}({r_i}, t) = \sum\limits_{j = 1}^{L + 1} {{w_j}{P_l}({x_j})} \varPhi ({r_i}, {x_j}, t)$
16.　end for
17. for i and l do
18.　${f_l}({r_i}, t) = { {\rm{e} }^{ - {\rm{i} }\tfrac{ {\Delta t} }{2}\left[ {\tfrac{ {l(l + 1)} }{ {2 r_i^2} }\, - \, \frac{1}{ { {r_i} } } } \right]} } \cdot {f_l}({r_i}, t)$
19. end for
20. for l do
21.　${f_l}({r_i}, t) = {\rm{ifft} }\left( { {\rm{diag} }\Big( { { {\rm{e} }^{ - {\rm{i} }\tfrac{ {\Delta t} }{2}\tfrac{ { {k^2} } }{2} } } } \Big) \cdot {\rm{fft} }\left( { {f_l}({r_i}, t)} \right)} \right)$
22. end for
23. end for

下载: 导出CSV

表 2 不同角量子数下CPU与GPU的计算时间比较

Table 2. Computation time of CPU and GPU under different angular quantum numbers.

角量子数L	计算时间/s
角量子数L	CPU	GPU
4	2164.309	159.368
9	4120.602	164.418
19	7922.537	205.440
39	17682.308	378.104
79	36774.347	757.198

下载: 导出CSV

表 3 不同径向网格点下CPU与GPU的计算时间比较

Table 3. Computation time of CPU and GPU under different radial grid points.

径向网格点数R	计算时间/s
径向网格点数R	CPU	GPU
2¹²	1118.348	148.302
2¹³	1871.128	154.614
2¹⁴	3846.120	160.763
2¹⁵	7922.537	205.440
2¹⁶	16862.467	354.554

下载: 导出CSV

表 4 不同矩阵大小下CPU与GPU的计算时间比较

Table 4. Computation time of CPU and GPU under different matrix sizes.

矩阵大小	计算时间/s
矩阵大小	CPU	GPU
5 × 2¹²	199.158	149.895
10 × 2¹³	965.276	166.039
20 × 2¹⁴	3846.120	160.763
40 × 2¹⁵	17682.308	378.104
80 × 2¹⁶	74761.695	1524.669

下载: 导出CSV

表 5 不同矩阵大小下CPU与GPU的计算时间比较

Table 5. Computation time of CPU and GPU under different matrix sizes.

矩阵大小	计算时间/s
矩阵大小	CPU	GPU
5 × 2¹²	437.584	315.448
10 × 2¹³	2075.667	463.183
20 × 2¹⁴	9252.539	629.088
40 × 2¹⁵	40617.723	814.985
80 × 2¹⁶	182135.643	3024.669

下载: 导出CSV

[1]	Corkum P 1993 Phys. Rev. Lett. 71 1994 Google Scholar
[2]	Liu K L, Luo S Q, Li M, Li Y, Feng Y D, Du B J, Zhou Y M, Lu P X, Barth I 2019 Phys. Rev. Lett. 122 053202 Google Scholar
[3]	Lewenstein M, Balcou P, Ivanov M Y, L’Huillier A, Corkum P B 1994 Phys. Rev. A 49 2117 Google Scholar
[4]	Zhang X F, Zhu X S, Wang D, Li L, Liu X, Liao Q, Lan P F, Lu P X 2019 Phys. Rev. A 99 013414 Google Scholar
[5]	Gaarde M B, Tate J L, Schafer K J 2008 J. Phys. B 41 132001 Google Scholar
[6]	Liu K, Qin M Y, Li Q G, Liao Q 2018 Opt. Quantum Electron. 50 364 Google Scholar
[7]	Liao Q, Li Y, Qin M Y, Lu P X 2017 Phys. Rev. A 96 063408 Google Scholar
[8]	Liu K, Wang F, Wang Z, Qin M Y, Liao Q 2019 J. Opt. Soc. Am. B 36 2624 Google Scholar
[9]	Muller H G 1999 Laser Phys. 9 138
[10]	Bauer D, Koval P 2006 Comput. Phys. Commun. 174 396 Google Scholar
[11]	Madsen L B, Nikolopoulos L A A, Kjeldsen T K, Fernández J 2007 Phys. Rev. A 76 063407 Google Scholar
[12]	Keldysh L V 1964 Sov. Phys. JETP 20 1307
[13]	Faisal F H M 1973 J. Phys. B: At. Mol. Opt. Phys. 6 L89 Google Scholar
[14]	Reiss H R 1980 Phys. Rev. A 22 1786 Google Scholar
[15]	Gallagher T F 1988 Phys. Rev. Lett. 61 2304 Google Scholar
[16]	Corkum P B, Burnett N H, Brunel F 1989 Phys. Rev. Lett. 62 1259 Google Scholar
[17]	肖相如, 王慕雪, 黎敏, 耿基伟, 刘运全, 彭良友 2016 物理学报 65 220203 Google Scholar Xiao X R, Wang M X, Li M, Geng J W, Liu Y Q, Peng L Y 2016 Acta Phys. Sin. 65 220203 Google Scholar
[18]	Gainullin I 2017 Comput. Phys. Commun. 72 210
[19]	Liu Q, Liu F, Hou C 2020 Procedia Computer Sci. 171 312 Google Scholar
[20]	Penfold T J 2017 Phys. Chem. Chem. Phys. 19 19601 Google Scholar
[21]	Broin C Ó2015 Ph. D. Dissertation (Dublin: Dublin City University)
[22]	Broin C Ó, Nikolopoulos L A A 2014 Comput. Phys. Commun. 185 1791 Google Scholar
[23]	Feit M D, Fleck J A, Steiger A 1982 J. Comput. Phys. 47 412 Google Scholar
[24]	Kjeldsen T K 2007 Ph. D. Dissertation (Arhus: University of Arhus)

[1]	谢国大, 潘攀, 任信钢, 冯乃星, 方明, 李迎松, 黄志祥. 高阶SF-SFDTD方法在含时薛定谔方程求解中的应用研究. 物理学报, 2024, 73(3): 030201. doi: 10.7498/aps.73.20230771
[2]	车佳殷, 陈超, 李卫艳, 李维, 陈彦军. 强场原子电离响应时间的研究进展. 物理学报, 2023, 72(19): 193301. doi: 10.7498/aps.72.20230983
[3]	邵绪强, 梅鹏, 陈文新. 基于稳定性SPH-SWE数值模型的真实感流体动画实时模拟. 物理学报, 2021, 70(23): 234701. doi: 10.7498/aps.70.20211251
[4]	张仁强, 蒋翔宇, 俞炯弛, 曾充, 宫明, 徐顺. 格点量子色动力学蒸馏算法中关联函数的计算优化. 物理学报, 2021, 70(16): 161201. doi: 10.7498/aps.70.20210030
[5]	张斌, 赵健, 赵增秀. 基于多组态含时Hartree-Fock方法研究电子关联对于H2分子强场电离的影响. 物理学报, 2018, 67(10): 103301. doi: 10.7498/aps.67.20172701
[6]	肖俊, 李登宇, 王雅丽, 史祎诗. 并行化叠层成像算法研究. 物理学报, 2016, 65(15): 154203. doi: 10.7498/aps.65.154203
[7]	张义招, 包芸. 三维湍流Rayleigh-Bénard热对流的高效并行直接求解方法. 物理学报, 2015, 64(15): 154702. doi: 10.7498/aps.64.154702
[8]	刘玉柱, Gerber Thomas, Knopp Gregor. 利用强场多光子电离技术实现对多原子分子离子振动量子态的光学操控. 物理学报, 2014, 63(24): 244208. doi: 10.7498/aps.63.244208
[9]	林晨森, 陈硕, 李启良, 杨志刚. 耗散粒子动力学GPU并行计算研究. 物理学报, 2014, 63(10): 104702. doi: 10.7498/aps.63.104702
[10]	黄培培, 刘大刚, 刘腊群, 王辉辉, 夏梦局, 陈颖. 单路脉冲功率真空装置的三维数值模拟研究. 物理学报, 2013, 62(19): 192901. doi: 10.7498/aps.62.192901
[11]	丁锐, 金亚秋. 随机Gauss粗糙面上三维导体目标散射差场的随机泛函解析计算方法. 物理学报, 2011, 60(12): 124102. doi: 10.7498/aps.60.124102
[12]	周庆, 何校栋, 胡月. 用简单物理模型构建通用对称加密系统. 物理学报, 2011, 60(9): 094701. doi: 10.7498/aps.60.094701
[13]	廖臣, 刘大刚, 刘盛纲. 三维电磁粒子模拟并行计算的研究. 物理学报, 2009, 58(10): 6709-6718. doi: 10.7498/aps.58.6709
[14]	刘运全, 张杰, 武慧春, 盛政明. 超短电子脉冲在聚焦强激光场中的三维有质动力散射. 物理学报, 2006, 55(3): 1176-1180. doi: 10.7498/aps.55.1176
[15]	谭荣, 李高翔. 低频强场作用下三维光子晶体中二能级原子的自发辐射性质. 物理学报, 2005, 54(5): 2059-2065. doi: 10.7498/aps.54.2059
[16]	刘剑波, 蔡喜平. 一维定态薛定谔方程的宏观模拟解法. 物理学报, 2001, 50(5): 820-824. doi: 10.7498/aps.50.820
[17]	李学信, 徐至展, 汤燕. 一维含时薛定谔方程的数值求解及其在强场高次谐波中的应用. 物理学报, 1997, 46(2): 267-271. doi: 10.7498/aps.46.267
[18]	陈荣清, 徐至展, 孙岚, 姚关华, 张文琦, 李萍. 激光方脉冲作用下强场自电离过程. 物理学报, 1991, 40(10): 1584-1589. doi: 10.7498/aps.40.1584
[19]	仝晓民, 李家明. 强激光场阈上电离理论. 物理学报, 1991, 40(2): 190-197. doi: 10.7498/aps.40.190
[20]	姚关华, 徐至展. 强激光场诱导自电离中的相干效应. 物理学报, 1988, 37(11): 1760-1766. doi: 10.7498/aps.37.1760

计量

文章访问数: 9388
PDF下载量: 166
被引次数: 0

姓名
邮箱
手机号码
标题
留言内容
验证码

搜索

留言板