Thanks Jason. Please keep me posted. The functionlog_evaluations_sync is frustrating at the moment as it fails frequently and silently and makes logging evals at scale impossible due to frequent and silent ❓ errors.
We already batch eval calls (passing multiple spans in the same DF). We also have retries with exponential backoffs to avoid DDOSing your API. None of this seems to help 😢.
In my experience, the error rate is ~35 to 50% which is very high.