Página de exemplo
Política de privacidade

Is my GRPO LLM training on my ETL-Doctor-Pipeline-Env working?

Is my GRPO LLM training on my ETL-Doctor-Pipeline-Env working?

digitado ⋅ 22 de April de 2026

Is my GRPO LLM training on my ETL-Doctor-Pipeline-Env working?

https://preview.redd.it/hg6sw1ps6qwg1.png?width=897&format=png&auto=webp&s=ffbc86307eb7f8ab88a7fbb132cd69c20fe62c33

I am training Qwen3-0.6B on an RL environment made specifically for llms which I made myself. Feeling lost and confused. Here is the HF space link: https://huggingface.co/spaces/Atharva1232/etl_pipeline_doctor and here’s the github: https://github.com/Its-Atharva-Gupta/EPL-Pipeline-Doctor-Env I did use claude code for making the environment, since this is for a hackathon and the time limit is really short. Is my training going well or do I refactor something?

submitted by /u/Full_Promotion4522
[link] [comments]

Like 0

Liked Liked

« Our favorite gear at Sea Otter Classic wasn’t the bikes—it was the accessories » Surrogate Functionals for Machine-Learned Orbital-Free Density Functional Theory

Search

Posts recentes

Post Title
Post Title
DOJ claims xAI’s unpermitted gas turbines are a matter of ‘national, economic, and energy security’
Plaud says its software business topped $100M in ARR after shipping over 2M AI notetakers
Robinhood’s note on 10% layoffs shows blaming AI isn’t cutting it

Comentários

No comments to show.

Arquivos

Categorias

technocracy

Digitado © 2026