Introducing Session-Level Evaluations for Agent Performance Analysis

·Jun 30, 2025 08:31 PM

🚀 New drop for everyone building agents: Session-Level Evaluations You can now see how your agent does across a full convo, not just one turn. What you can measure: 🌀 Coherence (is it consistent?) 🧩 Context retention (is it remembering past turns?) 🎯 Goal achievement (does the user get what they came for?) 🛤️ Multi-step progression (can it handle complex tasks smoothly?) Perfect for those of you building multi-turn workflows where step-by-step checks aren’t enough. Full Guide: https://arize.com/docs/ax/cookbooks/evaluation/session-level-evaluations Drop your questions or what you’re excited to test with this! 👇✨

🙌1