Issues with Evaluations and Displaying Results in Phoenix Sessions

·Jun 14, 2024 01:51 PM

I am having some issues with evaluations.

I have my own server that stores traces in a database
When I get a request to view traces, I filter spans and start a new phoenix session feeding the TraceDataset in the session.

session = px.launch_app(trace=trace_ds)

I created a custom eval (copied from Bring your own evaluator page). While I can run the evals, problems start with displaying the results
Using this code, does not works at all:

px.Client().log_evaluations(
        SpanEvaluations(eval_name="bro", dataframe=relevance_classifications)
    )

I have been able to show the eval results with this:

dataset = px.active_session().get_trace_dataset()
dataset.append_evaluations(
        SpanEvaluations(eval_name="bro", dataframe=relevance_classifications)
    )

px.close_app()
session = px.launch_app(trace=dataset)
session.view()

But with this, when I rerun evaluations, it fails at px.active_session().get_trace_dataset()

4 comments