Evaluating LLM as SQL Judge: Insights from Defog Library Discussion

·May 09, 2024 06:52 PM

Really an awesome discussion with Manas from community. He used the defog library as ground truth to evaluate LLM as a judge for Generated SQL https://github.com/defog-ai/sql-eval Solid results, lots of debate on if we should add the schema to the prompt to improve, definitely an area of investment in research.

🙌1

Discussions

Evaluating LLM as SQL Judge: Insights from Defog Library Discussion

Jason

·May 09, 2024 06:52 PM

Really an awesome discussion with Manas from community. He used the defog library as ground truth to evaluate LLM as a judge for Generated SQL https://github.com/defog-ai/sql-eval Solid results, lots of debate on if we should add the schema to the prompt to improve, definitely an area of investment in research.

🙌1