This is super useful β thank you.
When those planning/orchestration loops happened, what was the hardest part in practice:
- detecting that the agent was stuck,
- understanding why it never reached an βendβ state,
- or stopping it safely without breaking useful runs?
Also curious β would a lightweight guardrail that detected loop patterns or missing stop conditions early have actually helped?