Página de exemplo
Política de privacidade

[R] Prompt Repetition Shows Null Result on Agentic Engineering Tasks (n=20, blind scored)

digitado ⋅ 24 de February de 2026

We tested prompt repetition on engineering tasks with Claude Haiku 4.5 agents. Blind scored, pre-registeredrubrics. Both groups scored 100%. Nothing to improve.

The surprise: in our experiments, treatment agents finished in fewer turns and used 13% fewer output tokens.

submitted by /u/antidrugue
[link] [comments]

Like 0

Liked Liked

« ActionEngine: From Reactive to Programmatic GUI Agents via State Machine Memory » Best Piano Learning Apps in 2026: An In-Depth Comparison of Music Education Technology

Search

Posts recentes

The trap Anthropic built for itself
How to Build Interactive Geospatial Dashboards Using Folium with Heatmaps, Choropleths, Time Animation, Marker Clustering, and Advanced Interactive Plugins
A Coding Implementation to Build a Hierarchical Planner AI Agent Using Open-Source LLMs with Tool Execution and Structured Multi-Agent Reasoning
Google DeepMind Introduces Unified Latents (UL): A Machine Learning Framework that Jointly Regularizes Latents Using a Diffusion Prior and Decoder
3 Questions: How AI could optimize the power grid

Comentários

No comments to show.

Arquivos

Categorias

technocracy

Digitado © 2025