Tom M.

Commented on Inquiry About Observability Tools for Self-Hosted...·Posted inPhoenix Support

And sorry for the dumb question, but Arize doesn't currently support rendering image urls for LLM tracing does it? (forgetting base64 encoded images for now?)

Commented on Inquiry About Observability Tools for Self-Hosted...·Posted inPhoenix Support

Tom M.

Great, thanks Mikyo!!

Commented on Inquiry About Observability Tools for Self-Hosted...·Posted inPhoenix Support

Tom M.

Re. evaluation strategy, i just asked our team:

Measuring accuracy on known eval sets with wide coverage of policy areas and including edge cases, reviewing mistakes to look for patterns
Also interested in using LLMs as judges, for example using GPT to say which model output is best

Commented on Inquiry About Observability Tools for Self-Hosted...·Posted inPhoenix Support

Tom M.

We've self hosted our model on vllm, it's API is openai compatible. Here's our client for reference - we send images as base64 encoded, we're not using any wrappers but we can easily use one if it makes things easier:

class VLMClient:
    def __init__(self, vlm_model: str = VLM_MODEL, vllm_url: str = VLLM_URL):
        self._vlm_model = vlm_model
        self._vllm_client = httpx.AsyncClient(base_url=vllm_url)

        if VLLM_HEALTHCHECK:
            wait_for_ready(
                server_url=vllm_url,
                wait_seconds=VLLM_READY_TIMEOUT,
                health_endpoint="health",
            )

    @property
    def vlm_model(self) -> str:
        return self._vlm_model

    async def __call__(
        self,
        prompt: str,
        image_bytes: bytes | None = None,
        image_filetype: filetype.Type | None = None,
        max_tokens: int = 10,
    ) -> str:
        # Assemble the message content
        message_content: list[dict[str, str | dict]] = [
            {
                "type": "text",
                "text": prompt,
            }
        ]

        if image_bytes is not None:
            if image_filetype is None:
                image_filetype = filetype.guess(image_bytes)

            if image_filetype is None:
                raise ValueError("Could not determine image filetype")

            if image_filetype not in ALLOWED_IMAGE_TYPES:
                raise ValueError(
                    f"Image type {image_filetype} is not supported. Allowed types: {ALLOWED_IMAGE_TYPES}"
                )

            image_b64 = base64.b64encode(image_bytes).decode("utf-8")
            message_content.append(
                {
                    "type": "image_url",
                    "image_url": {
                        "url": f"data:{image_filetype.mime};base64,{image_b64}",
                    },
                }
            )

        # Put together the request payload
        payload = {
            "model": self.vlm_model,
            "messages": [{"role": "user", "content": message_content}],
            "max_tokens": max_tokens,
            # "logprobs": True,
            # "top_logprobs": 1,
        }

        response = await self._vllm_client.post("/v1/chat/completions", json=payload)
        response = response.json()
        response_text: str = (
            response.get("choices")[0].get("message", {}).get("content", "").strip()
        )

        return response_text

Commented on Inquiry About Observability Tools for Self-Hosted...·Posted inPhoenix Support

Tom M.

So with Phoenix we'd look too make use of the latest persistence features in version 4.0+, probably with Postgres. And we'd look to slot the tracing into our production API for classifying images for different types of harmful content that we send to our visual LLM

Commented on Inquiry About Observability Tools for Self-Hosted...·Posted inPhoenix Support

Tom M.

Great! So the rest of our tech stack, other than python:

Database: Postgres
Monitoring and Logging: Datadog, Grafana, Prometheus
Containerization and Orchestration: Docker and Kubernetes (EKS)
Infrastructure: Hosted on AWS (use s3 for storage too), SQS for message queues

There's some other components but nothing else that immediately come to mind as relevant, let me know if there's anything else you're wondering about that i might have missed!

Commented on Inquiry About Observability Tools for Self-Hosted...·Posted inPhoenix Support

Tom M.

Hey again Mikyo ! It’s another team in my company working on this I’m supporting but from what I can see: So yep it’s base64 encoded small images. We’re using a customised Llava model that we’ve self hosted with VLLM on Kubernetes. We’re using a Python client, just plain httpx to send the requests we aren’t using any frameworks at the moment (although I think we’re open to choosing a framework). I’ll check in with the team to confirm and put this into a ticket, thanks!

Commented on Inquiry About Observability Tools for Self-Hosted...·Posted inPhoenix Support

Tom M.

Thanks Roger, i'll try this out 👍! Phoenix looks amazing for our use case and would be my first choice but its really important for us to be able to view the image inputs as well - is there anything i can do to bump up adding support for this on your roadmap? How much work would this be?

Commented on Inquiry About Observability Tools for Self-Hosted...·Posted inPhoenix Support

Tom M.

Great, thanks Dustin N.! Is there any plans for supporting those soon? (Big fan of Phoenix regardless)

Posted in Phoenix Support·

Tom M.

Inquiry About Observability Tools for Self-Hosted Multimodal LLMs

Hi! We're self-hosting a multimodal LLM on vllm and are looking for an open source LLM observability tool. Does Phoenix support mulitmodal image + text inputs, and can you hook it up to self hosted LLM's?

16Comments