Sebastian S.

Commented on Update on Arize's Tracing MCP Server Release Statu...·Posted inPhoenix Support

dang, though i replied. Thanks this is exactly it https://arize.com/docs/ax/tracing-assistant ! it was for instrumenting my code.

Posted in Phoenix Support·

Sebastian S.

Update on Arize's Tracing MCP Server Release Status

Hey all did arize ever release the tracing MCP server. I recall that SallyAnn D. was working on this, it was a tool intended to live in your IDE that assisted in generating rich spans for your codebase.

3Comments

Commented on Evaluating the Hype: Insights on Multi-Agent Syste...·Posted inDiscussions

Sebastian S.

keep it coming Vibhu S.!!

Commented on Evaluating the Hype: Insights on Multi-Agent Syste...·Posted inDiscussions

Sebastian S.

My thoughts on their thoughts: - (in response to the sequence task decomposition diagram) Its hard to think about this without taking into account the outputs as part of the context. In this example you may not need the subtask 1 when prompting subtask 2 given that the output of subtast 1 is provided. - “Depending on the domain, you might even consider fine-tuning a smaller model (this is in fact something we’ve done at Cognition).” (“Cognition | Don’t Build Multi-Agents”) this violates the idea that smaller is dumber and thus best fitted for less impact jobs (in fact the team seems to support this sentiment in the section "edit apply models") - “So, builders had the large models output markdown explanations of code edits and then fed these markdown explanations to small models to actually rewrite the files. However, these systems would still be very faulty. Often times, for example, the small model would misinterpret the instructions of the large model and make an incorrect edit due to the most slight ambiguities in the instructions. Today, the edit decision-making and applying are more often done by a single model in one action.” (“Cognition | Don’t Build Multi-Agents”) would love to see where other coding agents did this. - “However, agents today are not quite able to engage in this style of long-context proactive discourse with much more reliability than you would get with a single agent. Humans are quite efficient at communicating our most important knowledge to one another, but this efficiency takes nontrivial intelligence.” (“Cognition | Don’t Build Multi-Agents”) if you operate under the aforementioned principles of context engineering then more than likely that discoruse is not proactive. Since it would be nothing more than chain of thought. This is based on the speculation that the reason this works for humans is the diversity of knowledge that makes these discussions proactive - “I don’t see anyone putting a dedicated effort to solving this difficult cross-agent context-passing problem.” (“Cognition | Don’t Build Multi-Agents”) A2A addresses context passing

Posted in Discussions·

Sebastian S.

The End of Observability: What's Next for Data Insights?

https://www.honeycomb.io/blog/its-the-end-of-observability-as-we-know-it-and-i-feel-fine

0Comments

Posted in Phoenix Support·

Sebastian S.

Status Update on Traces API for Phoenix and Arize Platform

What is the status on the traces API? (aka being able to query certain traces stored on phoenix or arize platform)

2Comments

Commented on Phoenix: Using OpenAI API for Image Generation Out...·Posted inPhoenix Support

Sebastian S.

John G. I tried with this as well and it did not work, do you have any examples of the client handling output images and whatever produced the trace: prompt = """ A children's book drawing of a veterinarian using a stethoscope to listen to the heartbeat of a baby otter. """ result = client.images.generate( model="gpt-image-1", prompt=prompt )

Commented on Phoenix: Using OpenAI API for Image Generation Out...·Posted inPhoenix Support

Sebastian S.

RunLLM could you share tutorials or guides showcasing generated images being displayed in phoenix

Posted in Phoenix Support·

Sebastian S.

Understanding Limited Views for TOOL vs. LLM Spans in Mistral OCR

What is the reasoning for having limited views for different span kinds for example TOOL span contains similar features to LLM spans but is just a lot more limited. I am currently looking into instrumenting Mistral OCR and I feel like I am forced to make calling OCR an LLM span so that I can leverage displaying images

1Comment

Commented on Phoenix: Using OpenAI API for Image Generation Out...·Posted inPhoenix Support

Sebastian S.

not for the example i showed