Barath C.

Commented on Exploring UI Options for Evaluating LLM Chatbot Pe...·Posted inDiscussions

Hey Jason , yes saving evaluations to potentially a database and able to visualise it with an UI. The data I am evaluating is the output of my own implementation I am not using LlamaIndex or LangChain for now.

Commented on Exploring UI Options for Evaluating LLM Chatbot Pe...·Posted inDiscussions

Barath C.

Hey Xander S. thanks for reaching out. I wan't to run these evals as part of some CI/CD pipelines. It would be nicer if there was an UI to visualise the results.

Posted in Discussions·

Barath C.

Exploring UI Options for Evaluating LLM Chatbot Performance

Hey guys I am new to this community. I am building an LLM Chatbot based on some custom knowledge base. I want to use Phoenix Eval to evaluate the Q&A metric. I have tried it it works well. But, is there a UI based version for running evals ? like MLFlow or LangSmith ?

4Comments