杨光 钞苏亚 聂敏 刘原华 张美玲



杨光, 钞苏亚, 聂敏, 刘原华, 张美玲

Construction method of hybrid quantum long-short term memory neural network for image classification

Yang Guang, Chao Su-Ya, Nie Min, Liu Yuan-Hua, Zhang Mei-Ling
  • 长短期记忆(long-short term memory, LSTM)神经网络通过引入记忆单元来解决长期依赖、梯度消失和梯度爆炸问题, 广泛应用于时间序列分析与预测. 将量子计算与LSTM神经网络结合将有助于提高其计算效率并降低模型参数个数, 从而显著改善传统LSTM神经网络的性能. 本文提出一种可用于图像分类的混合量子LSTM (hybrid quantum LSTM, HQLSTM)网络模型, 利用变分量子电路代替经典LSTM网络中的神经细胞, 以实现量子网络记忆功能, 同时引入Choquet离散积分算子来增强数据之间的聚合程度. HQLSTM网络中的记忆细胞由多个可实现不同功能的变分量子电路(variation quantum circuit, VQC)构成, 每个VQC由三部分组成: 编码层利用角度编码降低网络模型设计的复杂度; 变分层采用量子自然梯度优化算法进行设计, 使得梯度下降方向不以特定参数为目标, 从而优化参数更新过程, 提升网络模型的泛化性和收敛速度; 测量层利用泡利 Z 门进行测量, 并将测量结果的期望值输入到下一层实现对量子电路中有用信息的提取. 在MNIST, FASHION-MNIST和CIFAR数据集上的图像分类实验结果表明, 与经典LSTM、量子LSTM相比, HQLSTM模型获得了较高的图片分类精度和较低的损失值. 同时, HQLSTM、量子LSTM网络空间复杂度相较于经典的LSTM网络实现了明显的降低.
    Long-short term memory (LSTM) neural network solves the problems of long-term dependence, gradient disappearance and gradient explosion by introducing memory units, and is widely used in time series analysis and prediction. Combining quantum computing with LSTM neural network will help to improve its computational efficiency and reduce the number of model parameters, thus significantly improving the performance of traditional LSTM neural network. This paper proposes a hybrid quantum LSTM (hybrid quantum long-short term memory, HQLSTM) network model that can be used to realize the image classification. It uses variable quantum circuits to replace the nerve cells in the classical LSTM network to realize the memory function of the quantum network. At the same time, it introduces Choquet integral operator to enhance the degree of aggregation between data. The memory cells in the HQLSTM network are composed of multiple variation quantum circuits (VQC) that can realize different functions. Each VQC consists of three parts: the coding layer, which uses angle coding to reduce the complexity of network model design; the variation layer, which is designed with quantum natural gradient optimization algorithm, so that the gradient descent direction does not target specific parameters, thereby optimizing the parameter update process and improving the generalization and convergence speed of the network model; the measurement layer, which uses the Pauli Z gate to measure, and the expected value of the measurement result is input to the next layer to extract useful information from the quantum circuit. The experimental results on the MNIST, FASHION-MNIST and CIFAR datasets show that the HQLSTM model achieves higher image classification accuracy and lower loss value than the classical LSTM model and quantum LSTM model. At the same time, the network space complexity of HQLSTM and quantum LSTM are significantly reduced compared with the classical LSTM network.
      通信作者: 钞苏亚, 1920464642@qq.com
    • 基金项目: 国家自然科学基金(批准号: 61971348, 61201194)和陕西省自然科学基础研究计划(批准号: 2021JM-464)资助的课题
      Corresponding author: Chao Su-Ya, 1920464642@qq.com
    • Funds: Project supported by the National Natural Science Foundation of China (Grant Nos. 61971348, 61201194) and the Natural Science Basic Research Program of Shaanxi province, China (Grant No. 2021JM-464)

  • 图 1  LSTM网络模型结构

    Fig. 1.  The structure of the LSTM network model.

    图 2  二维的Choquet离散积分算子图示

    Fig. 2.  Two-dimensional Choquet discrete integral operator diagram.

    图 3  HQLSTM网络模型结构

    Fig. 3.  The structure of the HQLSTM network model.

    图 4  HQLSTM细胞中的VQC结构

    Fig. 4.  The overall structure of the HQLSTM network model.

    图 5  计算Fubini-Study度量张量的部分VQC结构

    Fig. 5.  Calculate part of the VQC structure of the Fubini-Study metric tensor.

    图 6  数据集样本 (a) MNIST数据集; (b) FASHION-MNIST数据集; (c) CIFAR数据集

    Fig. 6.  Dataset image samples: (a) MNIST dataset; (b) FASHION_MNIST dataset; (c) CIFAR dataset.

    图 7  MNIST数据集 (a)分类精度对比; (b)损失函数值对比

    Fig. 7.  MNIST dataset: (a) Comparison of classification accuracy; (b) comparison of loss value.

    图 8  不同优化算法损失值对比

    Fig. 8.  Comparison of loss values ​​of different optimization algorithms.

    图 9  FASHION-MNIST数据集 (a) 分类精度对比; (b) 损失函数值对比

    Fig. 9.  FASHION-MNIST dataset: (a) Comparison of classification accuracy; (b) comparison of loss value

    图 10  不同优化算法损失值对比

    Fig. 10.  Comparison of loss values ​​of different optimization algorithms.

    图 11  CIFAR彩色数据集 (a)分类精度对比; (b)损失函数值对比

    Fig. 11.  CIFAR color dataset: (a) Classification accuracy; (b) comparison of loss value.

    表 1  LSTM网络模型参数

    Table 1.  LSTM network model parameters.

    表 2  QLSTM和HQLSTM网络模型参数

    Table 2.  QLSTM and HQLSTM network model parameters.

    表 3  不同网络模型图像分类精度比较

    Table 3.  Comparison of image classification accuracy of different network models.

    Ref. [35]MNIST1097.894
