Quick Summary
Research from arXiv (2605.30621) shows that an agent's ability to update its "harness" (prompts, skills, memory) does not equate to its ability to benefit from it. Mid-tier models typically benefit the most from self-evolution, whereas powerful models (such as Claude 4.6 Opus) do not show significantly greater improvement compared to smaller models.
Why It Matters
This suggests that resources should be invested in the agent's task-solving capabilities rather than solely focusing on the evolutionary component (evolver).