Sanjana Yeddula's Avatar

Sanjana Yeddula

@syeddula.bsky.social

Arize AI!

11 Followers  |  7 Following  |  20 Posts  |  Joined: 18.03.2025  |  1.7141

Latest posts by syeddula.bsky.social on Bluesky

Arize AI

app.arize.com/auth/phoenix...

27.06.2025 22:34 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Missed the news from Arize Observe 2025? Phoenix Cloud just got Spaces & Access Management!

✨ Create tailored Spaces
πŸ”‘ Manage user permissions
πŸ‘₯ Easy team collaboration

More than a feature, it’s Phoenix adapting to you.

Spin up a new Phoenix project & test it out!
@arize-phoenix.bsky.social

27.06.2025 22:34 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Preview
Google GenAI | Phoenix Instrument LLM calls made using the Google Gen AI Python SDK

Docs: docs.arize.com/phoenix/trac...

Notebook: colab.research.google.com/github/Arize...

08.05.2025 20:41 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸ†• New in OpenInference: Python auto-instrumentation for the Google GenAI SDK!

Add GenAI tracing to your @arize-phoenix.bsky.social applications in just a few lines. Works great with Span Replay so you can debug, tweak, and explore agent behavior in prompt playground.

Check Notebook + docs below!πŸ‘‡

08.05.2025 20:41 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

Learn to prompt better

07.05.2025 19:26 β€” πŸ‘ 6    πŸ” 5    πŸ’¬ 0    πŸ“Œ 0
Preview
Google Colab

Cookbook: colab.research.google.com/github/Arize...

18.04.2025 18:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Tracing and Evaluating OpenAI Agents
YouTube video by Arize AI Tracing and Evaluating OpenAI Agents

Check out the full video: youtu.be/iOGu7-HYm6s?...

18.04.2025 18:51 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Just dropped a tutorial on using the OpenAI Agents SDK + @arize-phoenix.bsky.social to go from building to evaluating agents.

βœ”οΈ Trace agent decisions at every step
βœ”οΈ Offline and Online Evals using LLM as a Judge

If you're building agents, measuring them is essential.

Full vid and cookbook below

18.04.2025 18:51 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Post image

We've added GPT-4.1 models to the @arize-phoenix.bsky.social Prompt Playground.

My go-to way to test out these new models: grab a failed trace from a previous run, pull it into playground, switch the model and see if 4.1 can succeed where 4o failed.

Early signs are promising!

16.04.2025 18:43 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

good point - the focus of the tutorial was on general prompt optimization techniques. textgrad is awesome for gradient-based optimization, but this approach aimed to keep things more widely applicable. definitely worth exploring in a future video for more fine tuning

08.04.2025 07:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
phoenix/tutorials/evals/optimizing_llm_as_a_judge_prompts.ipynb at main Β· Arize-ai/phoenix AI Observability & Evaluation. Contribute to Arize-ai/phoenix development by creating an account on GitHub.

Notebook: github.com/Arize-ai/pho...

07.04.2025 17:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
LLM as a Judge Prompt Optimization
YouTube video by Arize AI LLM as a Judge Prompt Optimization

Full video: youtu.be/pvef59pEmvo

07.04.2025 17:15 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

LLM as a Judge allows models to evaluate outputs in a single promptβ€”but a good judging needs a good prompt

In my new tutorial, learn techniques on how to optimize your prompt so your judge can improve accuracy, cost, fairness, and robustness

better prompts ➑️ better evals

07.04.2025 17:15 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 2    πŸ“Œ 0

Notebook: github.com/Arize-ai/pho...

24.03.2025 23:27 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Notebook: github.com/Arize-ai/pho...

24.03.2025 23:25 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
ReAct Prompting
YouTube video by Arize AI ReAct Prompting

Think + Act β€” all within your prompt

In this tutorial, I apply ReAct principles to prompt LLMs to Reason + Act like humans. By specifying these steps, the LLM generates reasoning and interacts with tools for greater accuracy.

Full Video Tutorial: youtu.be/PB7hrp0mz54?...

24.03.2025 23:25 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Hey! No particular reason for sticking with 3.5 in these demos, just what I've been rolling with. phoenix prompts and these notebooks let you swap out models easily if you are interested in testing that out

I'll switch it up in some upcoming notebooks. thanks for the feedback!

20.03.2025 23:18 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Chain-of-Thought Prompting YouTube video by Arize AI

Video: www.youtube.com/watch?si=yHW...
Notebook: github.com/Arize-ai/pho...
#LLM #prompts #observability

19.03.2025 23:13 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

How much LLM reasoning can you drive through your prompt itself?

I’ve been using Chain of Thought (CoT) prompting to help LLMs replicate logical step-by-step thinking.

For the next segment in my prompting series, I use @arize-phoenix.bsky.social to test the performance of various CoT methods

19.03.2025 23:13 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Arize Phoenix – 5,000 Stars on GitHub!
YouTube video by Arize AI Arize Phoenix – 5,000 Stars on GitHub!

πŸŽ‰ 5000 Stars and Counting... πŸŽ‰

We're celebrating Phoenix reaching 5000 stars on GitHub! This milestone underscores the growing demand for robust, open-source tools that tackle the complexities of AI and LLM development

Check it out: github.com/Arize-ai/pho...

www.youtube.com/watch?v=bW5Z...

19.03.2025 17:46 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1
Preview
phoenix/tutorials/prompts/few_shot_prompting.ipynb at main Β· Arize-ai/phoenix AI Observability & Evaluation. Contribute to Arize-ai/phoenix development by creating an account on GitHub.

Notebook: github.com/Arize-ai/pho...
Video: www.youtube.com/watch?v=ggXc...

18.03.2025 23:50 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

How much more data does an LLM app really need?

In my latest tutorial, I explore how few-shot prompting boosts accuracy without massive datasets or retrainingβ€”using @arize-phoenix.bsky.social prompts and experiments to break it down.

This kicks off my prompting series... more to come!

18.03.2025 23:50 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

🧠 Phoenix now supports Anthropic Sonnet 3.7 & Thinking Budgets!

This makes Prompt Playground ideal for side-by-side reasoning tests: o3 vs. Anthropic vs. R1.

Plus, GPT-4.5 support keeps it up to date with the latest from OpenAI & Anthropic - test them all out in the playground! ⚑️

07.03.2025 17:29 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Some updates for Projects! Gain more flexibility and control with:

πŸ“Œ Persistent column selection for consistent views
πŸ” Filter data directly from tables with metadata and quick metadata filters
⏳ Set custom time ranges for traces & spans
🌳 Option to filter spans by root spans

Check out the demoπŸ‘‡

07.03.2025 23:39 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Prompt Optimization Techniques Explore different prompt optimization techniques and learn how Arize Phoenix and DSPy can be used to automate and enhance the process.

Prompt optimization is essential, and automating it with frameworks like DSPy gives you scalable and data-driven improvements.

There's also a tutorial linked in here where you can use Phoenix to compare the performance of different techniques. πŸ‘‡

arize.com/blog/prompt-...

17.03.2025 21:22 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

@syeddula is following 7 prominent accounts