This is super useful — thank you.
When those planning/orchestration loops happened, what was the hardest part in practice:
- detecting that the agent was stuck,
- understanding why it never reached an “end” state,
- or stopping it safely without breaking useful runs?
Also curious — would a lightweight guardrail that detected loop patterns or missing stop conditions early have actually helped?