[R] Prompt Repetition Shows Null Result on Agentic Engineering Tasks (n=20, blind scored)
We tested prompt repetition on engineering tasks with Claude Haiku 4.5 agents. Blind scored, pre-registeredrubrics. Both groups scored 100%. Nothing to improve.
The surprise: in our experiments, treatment agents finished in fewer turns and used 13% fewer output tokens.
submitted by /u/antidrugue
[link] [comments]
Like
0
Liked
Liked