Limiting Data Fetch Timespan in UI to Prevent Container Crashes

Artem B. · 2024-09-16T23:12:48.662Z

Hi! Is it possible to limit the timespan of the data fetched by the UI? We've observed that the container crashes with a large amount of tokens when opening the UI (about 160 million tokens). To mitigate we do 'Clear Data' from the UI but it would be ideal to specify how much data the container and UI will attempt to fetch on load

14 comments

· Sorted by Oldest

RunLLM
APP
·
Yes, it is possible to limit the timespan of the data fetched by the UI in Phoenix. You can specify a time range using the start_time and end_time parameters when querying spans. This can help mitigate issues with large amounts of data.
```
import phoenix as px from datetime import datetime, timedelta
px_client = px.Client()
start = datetime.now() - timedelta(days=7) # Last 7 days end = datetime.now() - timedelta(days=1) # Exclude the last 24 hours
phoenix_df = px_client.query_spans(start_time=start, end_time=end)
LBKSWould you like more detail on this? Sources: - [Extract Data from Spans](https://docs.arize.com/phoenix/tracing/how-to-tracing/extract-data-from-spans#bOaHNYT91AAE)
💡 Hint: Mention RunLLM in the thread for followups.
Artem B.
·
If I'm using a phoenix docker container, where can I apply this logic? I'd like to limit the query when the UI is opened in the web browser RunLLM
RunLLM
APP
·
To limit the query when the UI is opened in the web browser for a Phoenix Docker container, you would need to modify the backend code that handles the data fetching for the UI. Specifically, you can implement time range filtering in the API endpoints that the UI calls to fetch data.
1. Modify the Backend Code: Update the API endpoint to include start_time and end_time parameters in the query logic. This will ensure that only data within the specified time range is fetched.
2. Update the Docker Container: Rebuild and redeploy the Docker container with the updated backend code.
Here is a general approach to modifying the backend code:
1. Locate the API Endpoint: Find the API endpoint in the Phoenix backend code that handles the data fetching for the UI.
2. Add Time Range Filtering: Modify the query logic to include start_time and end_time parameters.
Example modification in Python:
```
from datetime import datetime, timedelta
start = datetime.now() - timedelta(days=7) # Last 7 days end = datetime.now() - timedelta(days=1) # Exclude the last 24 hours
query = SpanQuery().where( "span_kind == 'RETRIEVER'" ).select( input="input.value" ).time_range( start_time=start, end_time=end )
px_client = px.Client() phoenix_df = px_client.query_spans(query)
LBKSWould you like more detailed steps on how to implement this in your Docker setup? Sources: - [Extract Data from Spans](https://docs.arize.com/phoenix/tracing/how-to-tracing/extract-data-from-spans#bOaHNYT91AAE)
💡 Hint: Mention RunLLM in the thread for followups.
Mikyo
·
Hey Artem B. sorry you are facing this issue. Would love to help in any regard and happy to have a call with you too if you would be open to it. In general there is a UI control that you can pin to 1 day. Dos this not work in your scenario?
Artem B.
·
so I should fork the arize phoenix repo and build a custom image RunLLM?
RunLLM
APP
·
Yes, forking the Arize Phoenix repository and building a custom Docker image is a viable approach to implement custom logic, such as limiting the timespan of data fetched by the UI.
Here are the general steps to achieve this:
1. Fork the Repository: Fork the Arize Phoenix repository on GitHub.
2. Modify the Backend Code: Update the backend code to include time range filtering in the API endpoints that fetch data for the UI. You can add start_time and end_time parameters to the query logic.
3. Build the Custom Docker Image: After making the necessary changes, build a new Docker image from your forked repository.
4. Deploy the Custom Docker Image: Run the custom Docker image to deploy your modified version of Phoenix.
Example steps to build and run the custom Docker image:
```
git clone https://github.com/your-username/phoenix.git cd phoenix
docker build -t your-custom-phoenix .
docker run -p 6006:6006 -p 4317:4317 -i -t your-custom-phoenix
LBKSWould you like more detailed instructions on any of these steps? Sources: - [https://arize-ai.slack.com/archives/C04R3GXC8HK/p1720763791.977129](https://arize-ai.slack.com/archives/C04R3GXC8HK/p1720763791.977129) - [https://github.com/Arize-ai/phoenix/issues/3094](https://github.com/Arize-ai/phoenix/issues/3094)
💡 Hint: Mention RunLLM in the thread for followups.
Artem B.
·
Hey Mikyo, 100+ million tokens can occur in less than 2 hours in one of our environments
Mikyo
·
Artem B. - we did some refactoring to make the token counting queries much faster - which version of phoenix are you currently running?
Mikyo
·
I can add a last hour option to unblock you for now until we get retention policies in place?
Artem B.
·
That'd be awesome. Any chance you could do trace count instead of time frame? Even a single call to one of our endpoints can generate 11 million tokens. I apologize, I imagine we're a bit of an oddball in this regard.
Artem B.
·
I think its a fairly recent image version but I'm getting the image ID to double check
Mikyo
·
Sg - it might take me a bit more work to get the page size configurable. Let me get a quick fix out for the time range and will try to unblock you on load
Artem B.
·
awesome!
Mikyo
·
Just released an update. Hopefully it unblocks you for the time being https://github.com/Arize-ai/phoenix/releases/tag/arize-phoenix-v4.34.0
👀1
🙌1

RunLLM
APP
·
Yes, it is possible to limit the timespan of the data fetched by the UI in Phoenix. You can specify a time range using the start_time and end_time parameters when querying spans. This can help mitigate issues with large amounts of data.
```
import phoenix as px from datetime import datetime, timedelta
px_client = px.Client()
start = datetime.now() - timedelta(days=7) # Last 7 days end = datetime.now() - timedelta(days=1) # Exclude the last 24 hours
phoenix_df = px_client.query_spans(start_time=start, end_time=end)
LBKSWould you like more detail on this? Sources: - [Extract Data from Spans](https://docs.arize.com/phoenix/tracing/how-to-tracing/extract-data-from-spans#bOaHNYT91AAE)
💡 Hint: Mention RunLLM in the thread for followups.
Artem B.
·
If I'm using a phoenix docker container, where can I apply this logic? I'd like to limit the query when the UI is opened in the web browser RunLLM
RunLLM
APP
·
To limit the query when the UI is opened in the web browser for a Phoenix Docker container, you would need to modify the backend code that handles the data fetching for the UI. Specifically, you can implement time range filtering in the API endpoints that the UI calls to fetch data.
1. Modify the Backend Code: Update the API endpoint to include start_time and end_time parameters in the query logic. This will ensure that only data within the specified time range is fetched.
2. Update the Docker Container: Rebuild and redeploy the Docker container with the updated backend code.
Here is a general approach to modifying the backend code:
1. Locate the API Endpoint: Find the API endpoint in the Phoenix backend code that handles the data fetching for the UI.
2. Add Time Range Filtering: Modify the query logic to include start_time and end_time parameters.
Example modification in Python:
```
from datetime import datetime, timedelta
start = datetime.now() - timedelta(days=7) # Last 7 days end = datetime.now() - timedelta(days=1) # Exclude the last 24 hours
query = SpanQuery().where( "span_kind == 'RETRIEVER'" ).select( input="input.value" ).time_range( start_time=start, end_time=end )
px_client = px.Client() phoenix_df = px_client.query_spans(query)
LBKSWould you like more detailed steps on how to implement this in your Docker setup? Sources: - [Extract Data from Spans](https://docs.arize.com/phoenix/tracing/how-to-tracing/extract-data-from-spans#bOaHNYT91AAE)
💡 Hint: Mention RunLLM in the thread for followups.
Mikyo
·
Hey Artem B. sorry you are facing this issue. Would love to help in any regard and happy to have a call with you too if you would be open to it. In general there is a UI control that you can pin to 1 day. Dos this not work in your scenario?
Artem B.
·
so I should fork the arize phoenix repo and build a custom image RunLLM?
RunLLM
APP
·
Yes, forking the Arize Phoenix repository and building a custom Docker image is a viable approach to implement custom logic, such as limiting the timespan of data fetched by the UI.
Here are the general steps to achieve this:
1. Fork the Repository: Fork the Arize Phoenix repository on GitHub.
2. Modify the Backend Code: Update the backend code to include time range filtering in the API endpoints that fetch data for the UI. You can add start_time and end_time parameters to the query logic.
3. Build the Custom Docker Image: After making the necessary changes, build a new Docker image from your forked repository.
4. Deploy the Custom Docker Image: Run the custom Docker image to deploy your modified version of Phoenix.
Example steps to build and run the custom Docker image:
```
git clone https://github.com/your-username/phoenix.git cd phoenix
docker build -t your-custom-phoenix .
docker run -p 6006:6006 -p 4317:4317 -i -t your-custom-phoenix
LBKSWould you like more detailed instructions on any of these steps? Sources: - [https://arize-ai.slack.com/archives/C04R3GXC8HK/p1720763791.977129](https://arize-ai.slack.com/archives/C04R3GXC8HK/p1720763791.977129) - [https://github.com/Arize-ai/phoenix/issues/3094](https://github.com/Arize-ai/phoenix/issues/3094)
💡 Hint: Mention RunLLM in the thread for followups.
Artem B.
·
Hey Mikyo, 100+ million tokens can occur in less than 2 hours in one of our environments
Mikyo
·
Artem B. - we did some refactoring to make the token counting queries much faster - which version of phoenix are you currently running?
Mikyo
·
I can add a last hour option to unblock you for now until we get retention policies in place?
Artem B.
·
That'd be awesome. Any chance you could do trace count instead of time frame? Even a single call to one of our endpoints can generate 11 million tokens. I apologize, I imagine we're a bit of an oddball in this regard.
Artem B.
·
I think its a fairly recent image version but I'm getting the image ID to double check
Mikyo
·
Sg - it might take me a bit more work to get the page size configurable. Let me get a quick fix out for the time range and will try to unblock you on load
Artem B.
·
awesome!
Mikyo
·
Just released an update. Hopefully it unblocks you for the time being https://github.com/Arize-ai/phoenix/releases/tag/arize-phoenix-v4.34.0
👀1
🙌1