Crypto Futures Trading – Validation dataset
So I’ve been training an PPO (+mamba) agent to trade a specific crypto future, these are the results of a 64 env run on unseen data, each environment is starting on the same timestamp:
– Plots for each env/agent
– Overall P&L: +3643.39$
Is it somewhat expected that there would be such variance in evaluation runs and that running multiple parallel environments improves the overall P&L?
submitted by /u/KingSignificant5097
[link] [comments]
Like
0
Liked
Liked