Howdy! Is there an effective way to add labels/tags to my dataset cases used in experiments? I want to add a tag, such as "adversarial", so I can run analytics on which types of cases perform well vs. still need work.
Yes, Phoenix supports adding tags to dataset examples for cohort analysis. You can add a tag column (e.g., "adversarial") to your pandas DataFrame and specify it in the schema using the tag_column_names parameter. This allows you to filter, group, and analyze performance by tag in the Phoenix UI and during experiments. For example, add a column like adversarial to your DataFrame, then set tag_column_names=["adversarial"] in your schema. Tags are then available for analytics and cohort filtering in the app聽as described here. Would you like a code example?Sources:
- Phoenix: How to define your schema
馃挕聽Hint: Mention RunLLM in the thread for followups.
One follow up question - what's the best way to report correctness/success of the experiment by these labels? Right now, I found docs that point me to this:
import phoenix as px
client = px.Client()
# Get the current dataset version
dataset = client.get_dataset(id="...", version_id="...")
df = dataset.as_dataframe()
df.head()
experiment = client.get_experiment(experiment_id="...")It seemed to work for me! Thanks!
