Struggling to get PPO to work for pickup & delivery task — stuck, need for guidance
Hi everyone, we’re a group of students working on a reinforcement learning project and we’re honestly pretty stuck. We’ve been trying to solve this for weeks and feel like we’re missing some fundamental understanding. Below are the main problems we’re facing: Problem Setup We train an agent to pick up items and deliver them to a target on a small grid. We use PPO as we thought it is most advanced We split the reward 50/50 between pickup […]