Is my GRPO LLM training on my ETL-Doctor-Pipeline-Env working?
|
I am training Qwen3-0.6B on an RL environment made specifically for llms which I made myself. Feeling lost and confused. Here is the HF space link: https://huggingface.co/spaces/Atharva1232/etl_pipeline_doctor and here’s the github: https://github.com/Its-Atharva-Gupta/EPL-Pipeline-Doctor-Env I did use claude code for making the environment, since this is for a hackathon and the time limit is really short. Is my training going well or do I refactor something? submitted by /u/Full_Promotion4522 |
Like
0
Liked
Liked