I’m a bit shocked that this finally worked

digitado ⋅ 23 de May de 2026

So, I’ve been working on RL models for the last few years as my hobby, did I finally get my trader to be profitable? I’ve been on a multi-year optimization journey, from model architecture, reward shaping journey, and endless training loops and unintentional curriculum learning …

Happy to answer any question about my journey so far, the architecture, or the endless optimizations and tricks I needed to do to get to something that’s actually profitable …

To give some color, this is PPO + Mamba

https://preview.redd.it/4t81o5fmax2h1.png?width=6352&format=png&auto=webp&s=27a61650e7b4ca704ef5b8cd1251f6f084a2d0f6

submitted by /u/KingSignificant5097
[link] [comments]

Like 0

Liked Liked