Optimizing PostgreSQL Query for Phoenix Homepage Performance | Arize AI Community

Arize AI Community Icon

RunLLM
APP
·
This slow query is a known issue in Phoenix when handling large datasets, as the join between traces and spans over millions of rows becomes expensive even with indexes. The Phoenix team recommends advanced indexing (such as composite or covering indexes on join/filter columns), query refactoring, or pre-aggregation strategies to improve performance, but there is no official documentation or silver bullet for this specific query. Limiting the time range, reducing the number of projects in the query, or pre-aggregating token counts per project and time window can help as workarounds (GitHub Issue #8068, GitHub Issue #4006).Recent Phoenix releases have made improvements like lazy loading, reducing page size, and selective attribute fetching, which may help if you upgrade. For deeper optimization, consider adding composite indexes on (trace_rowid, start_time) in spans and (project_rowid, id) in traces, and/or pre-aggregating token counts outside of the main query path (GitHub Issue #4006). Would you like more detail on index strategies or pre-aggregation approaches?Sources: - GitHub Issue #8068 - GitHub Issue #4006
💡 Hint: Mention RunLLM in the thread for followups.
RunLLM
APP
·
If you tell me the right answer to this question, I'll make sure I don't make the same mistake twice!
Feedback
Type the right answer here...
↵ Please 'enter' to submit
Anthony P.
·
Hey Jo P. super sorry you are hitting this. This is a known issue that we are actively working towards resolving. Is it possible for you to share some anonymized stats for your largest projects such as # of traces, token count in your most expensive project in the last 30 days? This may help us replicate on our end and make further improvements
Jo P.
·
Thanks Anthony P.! Some rough stats for the last 30 days on our dominant project:
total traces: 200,000
total tokens: 200M (170M prompt, 30M completion)
Roger Y.
·
are you using SQLite or PostgreSQL?
Jo P.
·
postgres
Roger Y.
·
thx. we’ll try to replicate on our end
🙏1
Anthony P.
·
how would you characterize the depth of most of your traces?
Jo P.
·
our traces are mostly between 2 - 10 levels deep. I'd say the median is probably around 5
Anthony P.
·
thanks 🙏
Jo P.
·
And thank you so much for your help with this issue!
Anthony P.
·
of course, we will update you when we have made some progress
🙏1
🙌1
Roger Y.
·
for the time being we have removed the token count display in arize-phoenix 11.2.1 (we intend to make it customizable in the future)
👍1