Hi, does phoenix allow running experiment with multi-step QAs?
馃挕聽Hint: Mention RunLLM in the thread for followups.
Hi Xander S., the simplest example is case
User: Hi
Agent: Hi
User: What is my name?
Agent: I dont know
User: My name is Ihor
Agent: Hi Ihor
User: What is my name
Agent: Your name is IhorHere is sequence of 4 request to Agent. I want to evaluate only the last one. When running this task it will create 4 spans, I believe. How to assign all for to one experiment datasample task run?
