搜索

x
中国物理学会期刊

光谱诊断中神经网络快速分析模型及外推方法

CSTR: 32037.14.aps.74.20241739

Rapid analysis model and extrapolation method of neural network in spectral diagnostic

CSTR: 32037.14.aps.74.20241739
PDF
HTML
导出引用
  • 对于旨在实现高参数和长脉冲运行的磁约束聚变装置而言, 基于离子温度实时测量的等离子体反馈控制至关重要, 而电荷交换复合光谱是等离子体离子温度的基本测量手段. 本文提出了一种基于神经网络的电荷交换复合光谱诊断数据快速分析方法, 并对其跨参数区间的外推能力进行研究. 该研究使用中国环流器二号A装置HL-2A的12.2×104个光谱数据及离线解谱获得的离子温度标签值构成数据集. 模型基于卷积神经网络, 相对于标签值实现了拟合优度 R^2\sim 0.92 的效果, 在推理阶段单光谱耗时小于1 ms, 相比传统方法加速了100—1000倍. 在外推能力方面, 提出基于低温度实验数据生成高温度的合成光谱数据的方法, 并通过在只包含离子温度2 keV以下的训练集中添加大约5%的合成数据, 大幅增加了模型在外推参数区间(2—4 keV)分析的准确性, 将模型在3—4 keV区间测试的误差降低了约60%. 该研究证明了在磁约束核聚变领域利用合成数据提升人工智能算法性能的可行性.

     

    Real-time measurement and feedback control of key plasma parameters are critical for future fusion reactor operation, with ion temperature being a vital control target as part of the triple product for fusion ignition. However, plasma diagnostics often require complex data analysis. A widely used method of obtaining ion temperature T_\mathrmi from charge exchange recombination spectroscopy (CXRS) is iterative spectral fitting, which is time-consuming and requires expert intervention during data analysis. Therefore, the traditional method cannot meet the demand for real-time T_\mathrmi measurement. Neural network (NN), which can learn the underlying relationships between the measured spectra and T_\mathrmi , is a promising approach to cope with this problem. In fact, NN approach has been widely adopted in the field of magnetically confined plasma. Previous study in JET has achieved a satisfactory accuracy for inferring T_\mathrmi from CXRS spectra compared with the traditional fitting results. Recently, the study of disruption prediction has achieved great progress with the help of deep NNs. However, these researches are conducted on steadily-operating devices, where for NN models, the data distribution of training set is similar to that of test set. This is not the case for newly-built tokamaks like HL-3, nor for future fusion reactors such as ITER. For new devices, there will be a period for the plasma parameters to rise from low to high ranges. In this case, investigating the extrapolation capability of NN models based on low parameter training data is of paramount importance.
    A convolutional neural network (CNN)-based model is proposed to accelerate the analysis of spectral data of CXRS, with a focus on investigating the model’s extrapolation capability in a much higher T_\mathrmi range. The dataset contains about 122000 spectral data, as well as their corresponding T_\mathrmi inferred from offline iterative process. The results demonstrate that the CNN-based model achieves excellent analysis of T_\mathrmi as indicated by a coefficient of determination (R²) of 0.92, and reduces the inference time for analyzing a single spectrum to less than 1 ms, reaching 100–1000 times faster than traditional spectral fitting methods. However, the performance of the data-driven neural network model is limited by challenges such as insufficient data and imbalanced data distribution, which further deteriorates the extrapolation capability. Generally, data with higher T_\mathrmi account for a small portion of the total dataset. In our study, only about 5% of the spectra correspond to T_\mathrmi > 2\mathrm \;\mathrmk\mathrme\mathrmV (ranging from 2 to 4 keV). However, they reflect the temperature of central plasma, which is more important for assessing the performance of plasma. To overcome this limitation, this study synthesizes high-temperature data based on experimental data from discharges with T_\mathrmi in low-temperature range. By incorporating 5% synthetic data into the training set only consisting of data with T_\mathrmi < 2\;\mathrm \mathrmk\mathrme\mathrmV , the model’s extrapolation capability is extended to cover the whole range of T_\mathrmi < 4\;\mathrmk\mathrme\mathrmV . The mean relative error (MRE) of the model in the range of 3\;\mathrm \mathrmk\mathrme\mathrmV < T_\mathrmi < 4\;\mathrmk\mathrme\mathrmV is reduced from 35% to below 15%, corresponding to a reduction of approximately 60% relative to the MRE before adding synthetic data. This approach demonstrates the feasibility of using synthetic data to enhance the performance of artificial intelligence algorithms in the field of magnetic confinement fusion. These findings provide valuable insights for the development of real-time ion temperature measurement and feedback control for future high-parameter fusion devices. Furthermore, the study lays a foundation for research in areas that require high-performance cross-device characteristic, such as machine learning-based disruption prediction and tearing mode control.

     

    目录

    /

    返回文章
    返回