Exploring LLM Quality Metrics with Arize Platform Insights

·Mar 07, 2024 08:38 PM

Arize team. I started playing around with Arize for the last two days, and I must admit that the Platform is slick. Some of the most challenging questions we usually get are about measuring LLM/RAG quality (relevance, hallucination, toxicity, bias), for which I never had a good answer. Arize gives a good point of view on what to look for in the LLM quality dimensions.