Hain-Lee H.

Commented on Need Help: Uploading Datasets to Deployed Phoenix...·Posted inPhoenix Support

i do see this warning, but it still seems to work (i can upload a dataset): UserWarning: :warning::warning: The Phoenix server (6.0.0) and client (7.1.1) versions are severely mismatched. Upgrade either the client or server to ensure API compatibility

Commented on Need Help: Uploading Datasets to Deployed Phoenix...·Posted inPhoenix Support

Hain-Lee H.

yup, i did

Commented on Need Help: Uploading Datasets to Deployed Phoenix...·Posted inPhoenix Support

Hain-Lee H.

got it, thanks!

Commented on Need Help: Uploading Datasets to Deployed Phoenix...·Posted inPhoenix Support

Hain-Lee H.

ah ok, so i should specify the headers arg instead of api_key?

Posted in Phoenix Support·

Hain-Lee H.

Need Help: Uploading Datasets to Deployed Phoenix Server

Sorry if I missed this in the docs, but is there example code for uploading datasets and running experiments to a deployed phoenix server? I tried doing this but it didn't work:

phoenix_client = px.Client(
  endpoint="https://app.phoenix.arize.com",
  api_key=PHOENIX_API_KEY
)
dataset = phoenix_client.upload_dataset(
  ...
)

The error I get is: HTTPStatusError: Client error '401 Unauthorized' for url 'https://app.phoenix.arize.com/v1/datasets/upload?sync=true'

9Comments

Commented on Using the run_evals Function with Custom Evaluator...·Posted inPhoenix Support

Hain-Lee H.

awesome, thanks so much for taking the time to answer my questions

Commented on Using the run_evals Function with Custom Evaluator...·Posted inPhoenix Support

Hain-Lee H.

haha, yeah otherwise it's llm-judges all the way down 😆

Commented on Using the run_evals Function with Custom Evaluator...·Posted inPhoenix Support

Hain-Lee H.

or at this stage is it more like "you can, and we're still figuring out the easiest ways for creating an llm judge"? i know there's the documented conceptual steps of creating a judge, but it doesn't show what do actually do for step 5 so i've just been writing my own code to run llm_classify in a loop

Commented on Using the run_evals Function with Custom Evaluator...·Posted inPhoenix Support

Hain-Lee H.

would you recommend using experiments and datasets to develop the llm judge itself?

Commented on Using the run_evals Function with Custom Evaluator...·Posted inPhoenix Support

Hain-Lee H.

got it, so sounds like for the task itself it's best to have groundtruth dataset and run experiments over that to start with, rather than to work backwards by trying to create a custom eval first. would the custom eval for the task be more for monitoring then? and to craft an llm as a judge eval itself we would leverage the experiment/dataset workflow like i described?