RL Topic for a Project

’m scoping out a topic on robotic clothes folding and need a sanity check on my proposed stack. I’m thinking of combining a VLA (Vision-Language-Action) foundation model for semantic reasoning, SERL (Sample Efficient RL) for fine-tuning the physical manipulation, and DAgger / HIL for human-in-the-loop corrections during out-of-distribution states. I want to know if this is actually feasible ? any landmines I might runinto ?

submitted by /u/Ok_Abbreviations2264
[link] [comments]

Liked Liked