Help with reinforcement learning Pick & Place

Currently I am trying to get into reinforcement learning, about two months ago I managed to make a curriculum that teaches my ur10e robot to reach a target within about 6cm.

Ever since then I have attempted to teach it to pick and place, ie. have it start at home position, move towards block, grasp block and move it above treshold or to target.

In those two months I haven’t really made any progress and all my attempts of improvements have given me 0 results.

I am wondering if someone with more success could review my code for anything I could change because I have been stumped on this and have no clue what to try next.

Or give me a working example similar to my own, or tips on changes, any advice honestly.

Whats the issue? If I limit my learning to stage 0( reach a point 20cm above block) it succeeds to 100% success ratio in about 1000-2000 episodes but when I load the save and inspect the results it maybe reaches it about 30% of the time (success being 6cm to the target, failures are a bit farther at up to 13cm away) , honestly don’t know why.

If I then implement stage 1 then, it falls apart, after 1000 episodes reaches 20% success, after which will fall to 3% and stay 3-10%.

Stage 2 wasn’t even tested much because I struggle with stage 0 and stage 1 as is.

ur10e robot arm, 2f85 gripper, Stable baselines 3, gymnasium-robotics, mujoco, SAC+HER curriculum, 1000-2000 episodes with 1000 timesteps each

I have already tried increasing it to something like 10k+ episodes but it just gets stuck at 2k episodes and falls to 0%

https://github.com/OverlordDestro/ur10e_HER_SAC_SB3_GYM

submitted by /u/Lord_Destro
[link] [comments]

Liked Liked