Transient error Internal Server Error encountered while exporting span batch, retrying in 8s.Hello, I am seeing this error after integrating arize-otel for litellm. We have our agents running within ECS Fargate on AWS. Any help with this would be really appreciated
This error means Phoenix (the OpenTelemetry collector endpoint) is returning a 500 Internal Server Error when your app tries to export spans. Common causes include sending spans with incorrect or unsupported semantic conventions, or the Phoenix server being overloaded or misconfigured. Double-check that your spans use valid OpenInference conventions and that your Phoenix server is healthy and reachable. If you are manually instrumenting, ensure your span kind and attributes are valid鈥攊nvalid values can trigger this error, as discussed in Phoenix issue #2950.If the error persists, check Phoenix server logs for more details about the failure. Also, ensure your ECS Fargate tasks can reach the Phoenix endpoint and that there are no network or authentication issues. Would you like more detail on debugging or validating your span data?Sources: - Phoenix issue #2950
馃挕聽Hint: Mention RunLLM in the thread for followups.
Hey Vibhaj R. this channel is for Phoenix. Looks like you found the right support channel Arize AX Support
thanks
actually the question is about arize cloud, is there a different channel for that?
