Crypto Futures Trading – Validation dataset

digitado ⋅ 28 de February de 2026

So I’ve been training an PPO (+mamba) agent to trade a specific crypto future, these are the results of a 64 env run on unseen data, each environment is starting on the same timestamp:

– Plots for each env/agent
– Overall P&L: +3643.39$

Is it somewhat expected that there would be such variance in evaluation runs and that running multiple parallel environments improves the overall P&L?

submitted by /u/KingSignificant5097
[link] [comments]

Like 0

Liked Liked