๐ค AI Summary
To address the exploration difficulty and excessive noise robustness arising from binary spiking mechanisms in Spiking Neural Networks (SNNs) for deep reinforcement learning, this paper proposes the Temporally Correlated Noise Spiking Action Network (TCN-SAN). The method introduces parameterized, time-domain noise into both the membrane potential accumulation and spike transmission processes of Leaky Integrate-and-Fire (LIF) neuronsโmarking the first such integration in SNN-based RL. A dynamically decaying noise schedule is further designed to jointly regularize policy stability, thereby balancing exploration efficiency and convergence stability. Evaluated on multiple continuous-control benchmarks in OpenAI Gym, TCN-SAN significantly outperforms existing SNN-based and mainstream deep RL exploration methods, achieving an average performance gain of 12.6% and accelerating policy convergence by 37%.
๐ Abstract
As a general method for exploration in deep reinforcement learning (RL), NoisyNet can produce problem-specific exploration strategies. Spiking neural networks (SNNs), due to their binary firing mechanism, have strong robustness to noise, making it difficult to realize efficient exploration with local disturbances. To solve this exploration problem, we propose a noisy spiking actor network (NoisySAN) that introduces time-correlated noise during charging and transmission. Moreover, a noise reduction method is proposed to find a stable policy for the agent. Extensive experimental results demonstrate that our method outperforms the state-of-the-art performance on a wide range of continuous control tasks from OpenAI gym.