GRPO for load balancing
Genuine doubt to network/communications/curious engineers, how do you employ (if so) reinforcement learning for the optimal control of backend routing.
I recently got really curious and used Go for the minimal implementation of GRPO for this problem
Here is the code
https://github.com/karimluna/tiny-grpo
What do you think?, my framing is about the scalability with a minimal implementation of this algorithm so share with my your ideas or implementations!
submitted by /u/Volta-5
[link] [comments]
Like
0
Liked
Liked