[R] Prompt Repetition Shows Null Result on Agentic Engineering Tasks (n=20, blind scored)

We tested prompt repetition on engineering tasks with Claude Haiku 4.5 agents. Blind scored, pre-registeredrubrics. Both groups scored 100%. Nothing to improve.

The surprise: in our experiments, treatment agents finished in fewer turns and used 13% fewer output tokens.

submitted by /u/antidrugue
[link] [comments]

Liked Liked