Roger Y. Thanks for the reply. Indeed! But in theory phoenix could instrument cache creation request too. With this way, you can link generation & cache creation request.
Hi Kenan, what kind of work flow do you have in mind?
Very typical use case actually. I need to reuse a ~100k tokens across many parallel calls and trying to reduce latency & cost.