Hello, What鈥檚 the best way to automatically track prompt changes (committed via Git) and evaluate their outputs against predefined ground truths? The idea is to have a workflow that gets triggered whenever a developer updates a prompt in Git. This workflow should run evaluations comparing the new prompt outputs against a set of ground truth responses to ensure consistency or detect regressions.