What are graders and why do you need them?

When working with basic neural networks trained on reinforcement learning environments, Only rewards existed, but now that I’m starting to work on environments in which an llm will act, I encountered something called graders, and there are no good resources (which I could find) for learning about them. I don’t get what’s the difference between a rewadfunction and a grader.

submitted by /u/Full_Promotion4522
[link] [comments]

Liked Liked