PPO vs SAC on real robot
I’m working on an RL project using two algorithms: SAC and PPO. The project consists of a robot made up of 3 arms (controlled by 3 servos) attached to a plate on which a ball will roll. Through infrared sensors placed under the plate, I can detect the ball’s position (observations) and move the plate (pitch and roll actions) to bring the ball to the center and stabilize it. The reward is defined based on the distance from […]