Exploring Sahil's Tool for NL to Test Generation in Phoenix

·Jan 02, 2025 01:48 PM

Interesting tool by Sahil from Gumroad: https://github.com/anti-work/shortest: create tests from NL -> create evals from NL? I had recently spoken with John and Dat from the Arize team and would love to see this in Phoenix. Why this tool? As Arize is more of a dev first tool, adding these workflows can massively help in making EDD available to PMs or SMEs. Could be a good wrapper around the testset generator by Ragas which can be fed to a DSPy pipeline!

👀3

5 comments