Managing Disk Space in Postgres DB for Observability Events

Ashwin G. · 2025-12-10T18:08:04.904Z

Hello #phoenix-support - Disk space keeps increasing when i use the postgres DB for observability even after having the 'Maximum retention trace period' in settings to be 20 days. Am i missing something, wat should i do to clean up the disk space.

Managing Disk Space in Postgres DB for Observability Events | Arize AI Community

6 comments

· Sorted by Oldest

RunLLM
APP
·
·
If you have set the "Maximum retention trace period" to 20 days in Phoenix, traces older than 20 days should be automatically deleted according to the configured cleanup schedule. However, disk space may not immediately decrease because PostgreSQL does not instantly reclaim disk space after deletions; it relies on autovacuum to clean up and compact the database over time. You can monitor and trigger autovacuum or run a manual VACUUM FULL to reclaim space, but be aware this can lock tables and impact performance temporarily. Also, ensure your retention policy is correctly associated with your project and that no custom project-specific policy is overriding the default setting.Additionally, Phoenix v11.5+ includes a disk usage monitor that warns admins and blocks writes when thresholds are exceeded, but it does not force immediate space reclamation. You can also manually delete traces or projects via the UI or REST API for more immediate cleanup. For more details on retention policy configuration and manual deletion, see the official documentation and release notes. If disk usage remains high after deletions, check for lingering project sessions or orphaned records, as these may require additional cleanup steps in some versions of Phoenix or PostgreSQL environments. Would you like more detail on how to manually trigger database cleanup or verify your retention policy settings?Sources: - Data Retention | Arize Phoenix - Release Notes: Data Retention Policies - Self-Hosting FAQ: Data Retention - GitHub Issue: Disk space not reclaimed after deletion
💡 Hint: Mention RunLLM in the thread for followups.
Roger Y.
·
·
are you using the latest version of Phoenix?
Roger Y.
·
·
you can also sort the spans by date to see wether the old spans are deleted
Ashwin G.
·
·
i have checked the DB and it doesn't have data prior to 20 days
Ashwin G.
·
·
am using helm chart deployment.

Roger Y.

ok. if the old data is deleted, it could just be that new data is very large, or postgres hasn’t vacuumed the dead tuples yet. you can check how much dead tuples there are and when the auto vacuum took place via the following query.

SELECT
    n.nspname AS schema,
    c.relname AS table_name,

    pg_size_pretty(pg_relation_size(c.oid)) AS main_table_size,

    pg_size_pretty(pg_relation_size(c.reltoastrelid)) AS toast_heap_size,
    pg_size_pretty(pg_indexes_size(c.reltoastrelid)) AS toast_index_size,
    pg_size_pretty(
        pg_relation_size(c.reltoastrelid) + pg_indexes_size(c.reltoastrelid)
    ) AS toast_total_size,

    ROUND(
        (pg_relation_size(c.reltoastrelid) + pg_indexes_size(c.reltoastrelid))::numeric
        / NULLIF(pg_total_relation_size(c.oid), 0) * 100,
        2
    ) AS toast_pct_of_total,

    ct.relname AS toast_table_name,

    COALESCE(st.n_live_tup, 0) AS live_toast_chunks,
    COALESCE(st.n_dead_tup, 0) AS dead_toast_chunks,

    COALESCE(
        ROUND(st.n_dead_tup::numeric / NULLIF(st.n_live_tup + st.n_dead_tup, 0) * 100, 2),
        0
    ) AS toast_dead_pct,

    st.last_autovacuum AS toast_last_vacuum

FROM pg_class c
JOIN pg_namespace n ON n.oid = c.relnamespace
LEFT JOIN pg_class ct ON ct.oid = c.reltoastrelid
LEFT JOIN pg_stat_all_tables st ON st.relid = ct.oid

WHERE c.relkind = 'r'
  AND c.reltoastrelid > 0
  AND n.nspname NOT IN ('pg_catalog', 'information_schema', 'pg_toast')
  AND pg_total_relation_size(c.oid) > 10000000

ORDER BY pg_relation_size(c.reltoastrelid) + pg_indexes_size(c.reltoastrelid) DESC
LIMIT 20;

RunLLM
APP
·
·
If you have set the "Maximum retention trace period" to 20 days in Phoenix, traces older than 20 days should be automatically deleted according to the configured cleanup schedule. However, disk space may not immediately decrease because PostgreSQL does not instantly reclaim disk space after deletions; it relies on autovacuum to clean up and compact the database over time. You can monitor and trigger autovacuum or run a manual VACUUM FULL to reclaim space, but be aware this can lock tables and impact performance temporarily. Also, ensure your retention policy is correctly associated with your project and that no custom project-specific policy is overriding the default setting.Additionally, Phoenix v11.5+ includes a disk usage monitor that warns admins and blocks writes when thresholds are exceeded, but it does not force immediate space reclamation. You can also manually delete traces or projects via the UI or REST API for more immediate cleanup. For more details on retention policy configuration and manual deletion, see the official documentation and release notes. If disk usage remains high after deletions, check for lingering project sessions or orphaned records, as these may require additional cleanup steps in some versions of Phoenix or PostgreSQL environments. Would you like more detail on how to manually trigger database cleanup or verify your retention policy settings?Sources: - Data Retention | Arize Phoenix - Release Notes: Data Retention Policies - Self-Hosting FAQ: Data Retention - GitHub Issue: Disk space not reclaimed after deletion
💡 Hint: Mention RunLLM in the thread for followups.
Roger Y.
·
·
are you using the latest version of Phoenix?
Roger Y.
·
·
you can also sort the spans by date to see wether the old spans are deleted
Ashwin G.
·
·
i have checked the DB and it doesn't have data prior to 20 days
Ashwin G.
·
·
am using helm chart deployment.

Roger Y.

SELECT
    n.nspname AS schema,
    c.relname AS table_name,

    pg_size_pretty(pg_relation_size(c.oid)) AS main_table_size,

    pg_size_pretty(pg_relation_size(c.reltoastrelid)) AS toast_heap_size,
    pg_size_pretty(pg_indexes_size(c.reltoastrelid)) AS toast_index_size,
    pg_size_pretty(
        pg_relation_size(c.reltoastrelid) + pg_indexes_size(c.reltoastrelid)
    ) AS toast_total_size,

    ROUND(
        (pg_relation_size(c.reltoastrelid) + pg_indexes_size(c.reltoastrelid))::numeric
        / NULLIF(pg_total_relation_size(c.oid), 0) * 100,
        2
    ) AS toast_pct_of_total,

    ct.relname AS toast_table_name,

    COALESCE(st.n_live_tup, 0) AS live_toast_chunks,
    COALESCE(st.n_dead_tup, 0) AS dead_toast_chunks,

    COALESCE(
        ROUND(st.n_dead_tup::numeric / NULLIF(st.n_live_tup + st.n_dead_tup, 0) * 100, 2),
        0
    ) AS toast_dead_pct,

    st.last_autovacuum AS toast_last_vacuum

FROM pg_class c
JOIN pg_namespace n ON n.oid = c.relnamespace
LEFT JOIN pg_class ct ON ct.oid = c.reltoastrelid
LEFT JOIN pg_stat_all_tables st ON st.relid = ct.oid

WHERE c.relkind = 'r'
  AND c.reltoastrelid > 0
  AND n.nspname NOT IN ('pg_catalog', 'information_schema', 'pg_toast')
  AND pg_total_relation_size(c.oid) > 10000000

ORDER BY pg_relation_size(c.reltoastrelid) + pg_indexes_size(c.reltoastrelid) DESC
LIMIT 20;