Tristan B.

Commented on Guidance on Calculating Recall, Precision, and F1...·Posted inPhoenix Support

Tristan B.

Thanks Roger Y.! We will use this to get started on our end

Commented on Guidance on Calculating Recall, Precision, and F1...·Posted inPhoenix Support

Tristan B.

I am going to wait until I get a response from the team since that answer is incorrect

Commented on Guidance on Calculating Recall, Precision, and F1...·Posted inPhoenix Support

Tristan B.

RunLLM but that is the issue. in order to calculate these metrics you either need A) a list of the outputs and expected values at once to get both the numerator and the denominator or B) you would need to be able to return a 0, 1, or None on an individual row by row basis so you can nullify rows that don't count towards the denominator in those metrics.

Commented on Guidance on Calculating Recall, Precision, and F1...·Posted inPhoenix Support

Tristan B.

RunLLM so would I pass this into the run_experiment method using the parameter evaluators (i.e. `run_experiment(evaluators=[ClassificationEval]) or what is the proper way to implement this?

Posted in Phoenix Support·

Tristan B.

Guidance on Calculating Recall, Precision, and F1 Score in Phoenix

I am currently trying to calculate evals of recall, precision and f1 score for a binary classification of our LLM outputs on a dataset but am not sure what makes the most sense using Phoenix. For example, for recall I attempted to build a custom eval metric comparing the output and expected, but the issue is that I can't output a None value for those examples with a ground truth labels of negative (since we only care about positive values in recall) without causing an error. Let me know what y'all think, ty!

9Comments

Commented on Bulk Adding Examples to Phoenix Datasets via CSV:...·Posted inPhoenix Support

Tristan B.

sweet!

Posted in Phoenix Support·

Tristan B.

Bulk Adding Examples to Phoenix Datasets via CSV: Is It Possible?

hey! is there functionality via either the UI or the python library to add more examples in bulk via CSV to a dataset once it has already created in Phoenix? right now I see the option to create a new dataset in the UI via CSSV and also the button to add more examples one by one manually on the front end but looking for a more optimal solution. thanks!

2Comments

Posted in Phoenix Support·

Tristan B.

Using Base64 Images for Experimentation in Prompt Playground

hi all! We have a dataset that contains base64 encoded images that we have been iterating on using the python library. Can we use this for experimentation in the prompt playground? I am not seeing a way to tell the model it is an image versus just text as an input in the UI

2Comments

Commented on Help Needed: Uploading New Version of Existing Dat...·Posted inPhoenix Support

Tristan B.

Roger Y. Was able to migrate a real dataset to the new format and use it for our downstream tasks! Thank you very much for your quick response and assistance 🙂

Commented on Help Needed: Uploading New Version of Existing Dat...·Posted inPhoenix Support

Tristan B.

I just generated a new toy dataset and then generated the list of patches with the updated columns/names and sent off the request. It indeed outputted one version_id for all examples changes in the post request. will try now on our real data :)