RubricBench Exposes a Big Flaw in AI Grading
RubricBench measures how far AI-generated grading rubrics drift from human standards—and shows why automated evaluation can misfire.
Like
0
Liked
Liked
RubricBench measures how far AI-generated grading rubrics drift from human standards—and shows why automated evaluation can misfire.