I like that some of the Colab notebooks use classification_report from sklearn , It's helpful to see precision and recall for the span evaluations I'm using. Is there a way I can log this data onto the Phoenix UI so that I can track the results there, perhaps at a project level?