Research Notes
Observed failure compounding in chained tasks. Error-recovery steps helped, but did not eliminate cascading errors. The pattern holds across 4+ project types tested.
Hallucinations rose when tool responses were sparse or under-specified. Schema-constrained outputs improved argument validity by reducing ambiguous inputs.
Frequent breakdown chain: weak retrieval, premature tool selection, and overconfident final synthesis under uncertainty. Checkpoints help but add latency.