Abstract: Reinforcement learning has been widely used to train a sequential power control policy from simulation environment in Internet of Vehicles. However, disturbance is usually inevitably ...