Sid's Avatar

Sid

@sidvenkatayogi.bsky.social

2 Followers  |  4 Following  |  2 Posts  |  Joined: 23.01.2025  |  1.5517

Latest posts by sidvenkatayogi.bsky.social on Bluesky

Preview
Faithfulness vs. Safety: Evaluating LLM Behavior Under Counterfactual Medical Evidence In high-stakes domains like medicine, it may be generally desirable for models to faithfully adhere to the context provided. But what happens if the context does not align with model priors or safety ...

πŸ“ŽPaper: arxiv.org/abs/2601.11886
πŸ§‘β€πŸ’»Code/data: github.com/KaijieMo-kj/...

w/
@kaijie-mo.bsky.social @sidvenkatayogi.bsky.social
@chantalsh.bsky.social @ramezkouzy.bsky.social
@cocoweixu.bsky.social @byron.bsky.social @jessyjli.bsky.social

21.01.2026 19:07 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Hello world πŸ‘‹
My first paper at UT Austin!

We ask: what happens when medical β€œevidence” fed into an LLM is wrong? Should your AI stay faithful, or should it play it safe when the evidence is harmful?

We show that frontier LLMs accept counterfactual medical evidence at face value.🧡

21.01.2026 18:45 β€” πŸ‘ 14    πŸ” 6    πŸ’¬ 3    πŸ“Œ 2
Preview
GitHub - sidvenkatayogi/pixie Contribute to sidvenkatayogi/pixie development by creating an account on GitHub.

Github:
github.com/sidvenkatayo...
Process:
sidvenkatayogi.github.io/posts/2025/0...
Download PIXIE:

21.07.2025 19:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Introducing PIXIE
YouTube video by Siddhartha Venkatayogi Introducing PIXIE

just released PIXIE, a novel, intuitive visual tool for visual creatives to browse/search for inspiration and reference. The better way to view your Pinterest boards.

Here’s the demo:

21.07.2025 19:02 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@sidvenkatayogi is following 4 prominent accounts