When running experiments, is it possible to only run a subset of the examples for faster iteration?
💡 Hint: Mention RunLLM in the thread for followups.
RunLLM I would still like to track the experiment with the UI/server. Can I prefilter the dataset and pass it to the experiment method, or do I need to reupload it? My use-case is something like grading the examples as either "easy" or "hard", and then being able to run either of them alone or both together.
We just released a new feature as of Phoenix 12.7.0 that is purpose built for this use-case called "Dataset Splits". https://arize.com/docs/phoenix/datasets-and-experiments/how-to-experiments/splits The documentation is still a work in progress, however you can select one or more examples in the UI and then assign them to "splits" like "easy" or "hard" and then just run that sets of examples.
More ergonomic improvements are forthcoming
Anthony P. oh nice, that looks perfect! 🙂
my recommendation would be to enter some filter on the data using the search bar (on metadata perhaps) and then select-all of the results and assign them to a split
I will file an issue to support shift-select on the checkboxes, we should definitely have that
Anthony P. can splits be constructed from python?
that feature did not make it in for initial release, but it is planned
