搜索

x
中国物理学会期刊

面向物理神经网络的BCM规则:机制、实现与展望

BCM Rules for Physical Neural Networks: Mechanism, Realization, and Prospects

PDF
导出引用
  • 随着人工智能的发展,利用物理系统模拟实现计算功能的物理神经网络(PNNs)受到了广泛关注。然而,PNNs中非理想特性与噪声累积,使其难以直接采用依赖高精度浮点数进行反向传播的传统训练方法。Bienenstock-Cooper-Munro(BCM)规则作为一种高阶突触模型,通过输入依赖的权重更新规则为神经网络训练提供了一种统计层面的学习与稳态机制。尤其是其内在突触可塑性与稳态可塑性机制的协同作用,对维持PNNs系统学习过程的稳定性至关重要。本文综述了面向PNNs的BCM规则的机制、物理实现与应用前景等方面研究进展。详细介绍了BCM规则的理论模型与动力学特性,深入探讨了以忆阻器为主的新型器件模拟BCM规则的进展,并总结了基于BCM规则学习机制和稳态机制在物理系统中的训练、提高鲁棒性与减缓灾难性遗忘等方面潜力。最后,从PNNs的实际应用出发,展望了以功能为导向的BCM规则面临的机遇与挑战。

     

    Physical neural networks (PNNs) leverage intrinsic device dynamics for energy-efficient computing by mapping neural network operations onto physical parameters. However, practical PNN deployment faces critical challenges: cumulative non-idealities during signal propagation and the mismatch between conventional training algorithms and physical hardware. Biological systems overcome similar limitations through synaptic plasticity combining local learning with homeostatic regulation. The Bienenstock-Cooper-Munro (BCM) rule, a prominent model of synaptic plasticity, addresses these challenges by integrating Hebbian learning with a history-dependent sliding threshold that provides intrinsic stability.
    This review systematically examines BCM rules from three perspectives: theoretical foundations, hardware implementation, and PNN applications. The BCM rule exhibits three essential features: spike-rate dependent plasticity (SRDP), threshold sliding, and non-monotonic enhanced depression effect (EDE). At the device level, first-order memristors naturally emulate SRDP through flux-controlled conductance modulation. Threshold sliding requires second-order dynamics, demonstrated in WOx, HfOx, and STO-based devices where additional state variables modulate competition between potentiation and decay. EDE realization follows two approaches: external coding strategies (e.g., triplet-STDP) that induce non-monotonic behavior through precisely timed spikes, and intrinsic dynamics strategies leveraging competing physical processes in two-terminal devices to achieve EDE under natural spike-rate coding without external circuitry. At the system level, BCM enables unsupervised learning in single-layer, multi-layer, and convolutional networks, approaching backpropagation performance in certain tasks while offering biological plausibility and energy efficiency. The sliding threshold mechanism provides homeostatic regulation that prevents weight divergence, mitigates catastrophic forgetting, and enables self-repair in fault-tolerant systems.
    Despite progress, challenges remain: theoretical gaps between frequency-based BCM and observed spike-timing plasticity, reliance on external coding schemes compromising device simplicity, and parameter sensitivity in complex networks. Future directions include deepening theoretical understanding of BCM dynamics, developing intrinsic device strategies fully capturing BCM features under natural coding, and fostering interdisciplinary collaboration among materials, circuits, and algorithms to enable large-scale robust PNN implementations.

     

    目录

    /

    返回文章
    返回