Open-source synthetic manufacturing environment for uncertainty-aware RL / planning

Hi everyone — I’m working on an open-source environment for studying sequential decision-making in manufacturing systems.

The current demo is a synthetic process-window benchmark: an agent/planner selects process settings, observes noisy quality outcomes, tracks uncertainty, and recommends the next experiment. The motivation is similar to sparse-data physical systems, where each real experiment is expensive and the goal is not just prediction, but deciding what to try next.

Repo:
https://github.com/programmablemanufacturing/programmable-manufacturing-lab

I’d appreciate feedback from the RL community on:

  • what baseline planners would be useful to include first;
  • whether this should be framed closer to contextual bandits, model-based RL, Bayesian optimization, or POMDP-style planning;
  • what metrics would make sense beyond reward, such as regret, sample efficiency, uncertainty calibration, or build-to-confidence.

The goal is to create a small public benchmark that others can critique, extend, or use for educational experiments.

submitted by /u/Consistent_Scene3887
[link] [comments]

Liked Liked