Ilya B.

Commented on Introducing Annotations in Phoenix for Enhanced Ap...·Posted inArize AX Releases

Ilya B.

This is cool guys, thanks for the release, might want to try soon!

Commented on Exploring LLM Response Logits for Bias and Model I...·Posted inDiscussions

Ilya B.

This helps, thanks Mikyo!

Commented on Exploring LLM Response Logits for Bias and Model I...·Posted inDiscussions

Ilya B.

Mikyo John G. hi guys, maybe u can answer on above? Not urgent, but i am in general interested whether you look in this direction with phoenix evals 🙂

Posted in Discussions·

Ilya B.

Exploring LLM Response Logits for Bias and Model Insights

Hi folks, did you experiment with using LLM response logits distribution to obtain insights about model behaviour with respect to biases, hallucinations and other undesired effects? I understand that this applies to open-source models only where you have an access to logits. I am curious in general whether this direction has a potential from your perspective? Let me know if u did some research and feel free to link relevant papers if any!

4Comments

Commented on Request for Source Code on Parsing Logic of Eval T...·Posted inPhoenix Support

Ilya B.

I guess it happens here? https://github.com/Arize-ai/phoenix/blob/main/packages/phoenix-evals/src/phoenix/evals/utils.py#L60

Posted in Phoenix Support·

Ilya B.

Request for Source Code on Parsing Logic of Eval Templates

Hi guys, could you point me to the source code where the parsing logic of eval templates with explanation happens? e.g. i am interested how you extract LABEL and EXPLANATION from the model output from this template

1Comment

Commented on Support for Google Gemini-Vision in Phoenix/Evals?·Posted inPhoenix Support

Ilya B.

cool cool, glad to hear that, thanks Mikyo!

Commented on Support for Google Gemini-Vision in Phoenix/Evals?·Posted inPhoenix Support

Ilya B.

Hi guys, let me give some context. We use multimodal models to generate product tags from both texts and images. Here is one of the blog posts about this project, it can give some more context what and why we do. Screenshot shows an image belonging to a given product class and how we extract tags like color, style or theme. As input we pass image and text prompt, as output we return list of tag values. So the output is same format we discussed with you before. We are now developing automated evaluation pipelines. For now It is assumed to run them offline to test different prompts before shipping them to prod. I was thinking whether i can use arize-phoenix for gemini-vision models 🙂

Commented on Support for Google Gemini-Vision in Phoenix/Evals?·Posted inPhoenix Support

Ilya B.

My quick guess is that only text prompts now can be sent as input to evaluators. If I am right, let me know whether you have any plans to support multimodality? (Probably I could help) 🙂

Posted in Phoenix Support·

Ilya B.

Support for Google Gemini-Vision in Phoenix/Evals?

Hi folks, do you support multimodal models in phoenix/evals, specifically google gemini-vision family? I see vertexai is supported, but I didn't test yet whether I can use vision models or is it for text only input now. Any info here? 🙂 FYI Mikyo Trevor L.

6Comments