搜索

x
中国物理学会期刊

机器学习在宇宙线粒子鉴别中的应用

CSTR: 32037.14.aps.72.20230334

Application of machine learning in cosmic ray particle identification

CSTR: 32037.14.aps.72.20230334
PDF
HTML
导出引用
  • 基于热中子探测器实验模拟数据, 使用决策树(decision tree, DT)、随机森林(random forest, RF)和BP神经网络(back-propagation neural network, BPNN)构建了宇宙线粒子鉴别机器学习模型, 对每种粒子分别使用不同的机器学习算法基于模拟数据进行模型训练, 并针对算法进行超参数调整, 将每种算法的AUC值和Q品质因子作为粒子成分鉴别的评价指标. 实验结果表明, 不同机器学习模型对粒子预测精度影响很大. 在测试检验中, 经过交叉网格搜索方法调参后的决策树鉴别模型对中成分(碳氮氧和镁铝硅)比较敏感, 鉴别模型AUC值均在0.95以上, Q品质因子均大于6; 经交叉网格搜索方法调参后的随机森林鉴别模型对于宇宙线粒子鉴别的效果最好, 所有粒子鉴别模型的AUC值均大于0.92且Q品质因子均在4以上; BP神经网络算法只对质子和铁核比较敏感. 本研究对宇宙线粒子鉴别和筛选提供了新的方法和选择, 可为热中子探测器后续开展宇宙线能谱测量提供新思路.

     

    Machine learning algorithms can learn the rules and patterns of big data through computers, excavate potential information hidden behind the data, and be widely used to solve classification, regression, clustering, and other problems. Firstly, this paper uses CORSIKA software to simulate the process of cosmic ray cascade shower in the atmosphere, generating information such as the initial energy, zenith angle, azimuth angle of cosmic ray particles. Then, this paper uses the Geant4 toolkit to conduct thermal neutron detector response simulation, generating 4000 particles in each of proton, helium, CNO, MgAlSi and iron. Based on the experimental simulation data of thermal neutron detector, this paper constructs machine learning models for identifying cosmic ray particles by using decision tree (DT), random forest (RF) and BP neural network (BP NN) respectively. For each particle, all the machine learning algorithms are used for model training based on the simulation data. The cross grid search method is used to adjust the hyper parameters of each machine learning algorithm. The AUC value and Q quality factor value of each algorithm are used as evaluation indexes for particle composition identification. The AUC value is a general indicator for evaluating algorithm performance in machine learning and the Q quality factor value is an evaluation index commonly used in the field of high energy physics. The Experimental results show that different machine learning models have great influence on particle prediction accuracy, and the random forest cosmic ray particle identification model has sufficient accuracy and generalization capability. In the test, the decision tree algorithm adjusted by cross grid search method is sensitive to the medium components (CNO and MgAlSi). The AUC values of the algorithm are all above 0.95 and the Q quality factor values are all above 6. The random forest algorithm adjusted by the cross grid search method has the best effect on the identification of cosmic ray particles. The AUC values of the algorithm are all more than 0.92 and the Q quality factor values are all more than 4. The BP neural network algorithm is only sensitive to proton and iron. This study provides a new method and selection for identifying and screening the cosmic ray particles and it also provides a new idea for the following measurement of cosmic ray energy spectrum by thermal neutron detector.

     

    目录

    /

    返回文章
    返回