Experiences with Costly Agent Retries and Loops in Production: Seeking Real Examples

·Apr 04, 2026 02:51 AM·

For teams running agents in production: Have you seen retries or loops burn money before anyone noticed? I’m specifically interested in: - repeated retries on large contexts - agents failing to reach an end state - cases where traces/logs existed but still didn’t make the failure obvious One real example would help a lot.

Arize AX Support

Experiences with Costly Agent Retries and Loops in Production: Seeking Real Examples

Swainy l.

·Apr 04, 2026 02:51 AM·

For teams running agents in production: Have you seen retries or loops burn money before anyone noticed? I’m specifically interested in: - repeated retries on large contexts - agents failing to reach an end state - cases where traces/logs existed but still didn’t make the failure obvious One real example would help a lot.