Quick Hot Take on Claude 3:
We will release some public Evals on it shortly. It's probably the closest to GPT-4 yet model wise that we have tested (we have not tested Gemini Ultra so can't say where that is in the spectrum) but in some of our hardest Evals Claude-3 still does not beat GPT-4.