OpenAIInstrumentor: First Token Latency in Stream Response

does OpenAIInstrumentor capture the first token latency in a stream response?

· Sorted by Oldest

RunLLM
APP
·
The OpenAIInstrumentor does not explicitly mention capturing the first token latency in a stream response in the provided data sources. It is designed to instrument calls to OpenAI's API for observability, but specific metrics like first token latency are not detailed. Would you like more information on how to potentially capture this metric manually?
Sources: - https://github.com/Arize-ai/phoenix/issues/1977
💡 Hint: Mention RunLLM in the thread for followups.
Xiaohan W.
·
Also does it support audio response?
Xander S.
·
Hey Xiaohan W., we don't currently capture time to first token, but this definitely makes sense as metric you'd want to keep track of. Feel free to open an enhancement request on GitHub!
Xander S.
·
We have an issue for audio and vision here. Feel free to comment!

RunLLM
APP
·
The OpenAIInstrumentor does not explicitly mention capturing the first token latency in a stream response in the provided data sources. It is designed to instrument calls to OpenAI's API for observability, but specific metrics like first token latency are not detailed. Would you like more information on how to potentially capture this metric manually?
Sources: - https://github.com/Arize-ai/phoenix/issues/1977
💡 Hint: Mention RunLLM in the thread for followups.
Xiaohan W.
·
Also does it support audio response?
Xander S.
·
Hey Xiaohan W., we don't currently capture time to first token, but this definitely makes sense as metric you'd want to keep track of. Feel free to open an enhancement request on GitHub!
Xander S.
·
We have an issue for audio and vision here. Feel free to comment!