DQN agent not moving after performing technique?

digitado ⋅ 10 de March de 2026

the agent learned and performed a difficult technique, but stops moving afterwards, even though there are more points to be had.

What could this behavior be explained by?

Stable baselines 3 DQN

model = DQN( policy="CnnPolicy", env=train_env, learning_rate=1e-4, buffer_size=500_000, optimize_memory_usage=True, replay_buffer_kwargs={"handle_timeout_termination": False}, learning_starts=10_000, # Warm up with random actions first batch_size=32, gamma=0.99, target_update_interval=1_000, train_freq=4, gradient_steps=1, exploration_fraction=0.3, exploration_initial_eps=1.0, exploration_final_eps=0.01, tensorboard_log=TENSORBOARD_DIR, verbose=1, )

submitted by /u/Handy_Cap
[link] [comments]

Like 0

Liked Liked