Fast Reinforcement Learning for Anti-jamming Communications

02/13/2020
by   Pei-Gen Ye, et al.
0

This letter presents a fast reinforcement learning algorithm for anti-jamming communications which chooses previous action with probability τ and applies ϵ-greedy with probability (1-τ). A dynamic threshold based on the average value of previous several actions is designed and probability τ is formulated as a Gaussian-like function to guide the wireless devices. As a concrete example, the proposed algorithm is implemented in a wireless communication system against multiple jammers. Experimental results demonstrate that the proposed algorithm exceeds Q-learing, deep Q-networks (DQN), double DQN (DDQN), and prioritized experience reply based DDQN (PDDQN), in terms of signal-to-interference-plus-noise ratio and convergence rate.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset
Success!
Error Icon An error occurred

Sign in with Google

×

Use your Google Account to sign in to DeepAI

×

Consider DeepAI Pro