Juan Diego Rodriguez (@ COLM 2025)'s Avatar

Juan Diego Rodriguez (@ COLM 2025)

@juand-r.bsky.social

CS PhD student at UT Austin in #NLP Interested in language, reasoning, semantics and cognitive science. One day we'll have more efficient, interpretable and robust models! Other interests: math, philosophy, cinema https://www.juandiego-rodriguez.com/

4,230 Followers  |  2,102 Following  |  535 Posts  |  Joined: 30.10.2023  |  2.1808

Latest posts by juand-r.bsky.social on Bluesky

Post image

if you're interesting in gaining a better intuition for how llms behave at inference time, you should try logitloom🌱, the open-source tool i made for exploring token trajectory trees (aka looming) on base and instruct models! more info in thread

🌱 vgel.me/logitloom
πŸ’» github.com/vgel/logitloom

08.10.2025 01:36 β€” πŸ‘ 95    πŸ” 23    πŸ’¬ 5    πŸ“Œ 1
Post image Post image Post image

At @colmweb.org all week πŸ₯―🍁! Presenting 3 mechinterp + actionable interp papers at @interplay-workshop.bsky.social

1. BERTology in the Modern World w/ @bearseascape.bsky.social
2. MICE for CATs
3. LLM Microscope w/ Jiarui Liu, Jivitesh Jain, @monadiab77.bsky.social

Reach out to chat! #COLM2025

06.10.2025 22:08 β€” πŸ‘ 7    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Excited to present this at #COLM2025 tomorrow! (Tuesday, 11:00 AM poster session)

06.10.2025 20:40 β€” πŸ‘ 10    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0

Here’s a #COLM2025 feed!

Pin it πŸ“Œ to follow along with the conference this week!

06.10.2025 20:26 β€” πŸ‘ 24    πŸ” 17    πŸ’¬ 2    πŸ“Œ 1
Preview
Strahler number - Wikipedia

en.m.wikipedia.org/wiki/Strahle...

06.10.2025 20:38 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

On my way to #COLM2025 🍁

Check out jessyli.com/colm2025

QUDsim: Discourse templates in LLM stories arxiv.org/abs/2504.09373

EvalAgent: retrieval-based eval targeting implicit criteria arxiv.org/abs/2504.15219

RoboInstruct: code generation for robotics with simulators arxiv.org/abs/2405.20179

06.10.2025 15:50 β€” πŸ‘ 12    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Preview
Language Models Fail to Introspect About Their Knowledge of Language There has been recent interest in whether large language models (LLMs) can introspect about their own internal states. Such abilities would make LLMs more interpretable, and also validate the use of s...

I’m at #COLM2025 from Wed with:

@siyuansong.bsky.social Tue am introspection arxiv.org/abs/2503.07513

@qyao.bsky.social Wed am controlled rearing: arxiv.org/abs/2503.20850

@sashaboguraev.bsky.social INTERPLAY ling interp: arxiv.org/abs/2505.16002

I’ll talk at INTERPLAY too. Come say hi!

06.10.2025 15:57 β€” πŸ‘ 20    πŸ” 6    πŸ’¬ 1    πŸ“Œ 0

Excited to present this at COLM tomorrow! (Tuesday, 11:00 AM poster session)

06.10.2025 15:21 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Yes, smartphones are a great example.
As far as computer technology more generally, they are often invisible to many people... They do not realize that our modern world would just stop working without them.

06.10.2025 14:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

(honest question, genuinely curious about your opinion)-- do you think text/image/video generation has improved people's well-being directly in certain ways they are ignoring? (people who are not programmers or researchers)

06.10.2025 13:51 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ€– Yeah, this place, like most of social media, is crawling with Russians, Chinese, Israelis, and others running their games. (See: readsludge.com/2025/09/15/d... and www.voanews.com/a/bluesky-co...)

Wish we had more time to hunt the bots. Stay sharp out there.

05.10.2025 18:42 β€” πŸ‘ 15    πŸ” 7    πŸ’¬ 0    πŸ“Œ 0

πŸ‘€πŸ₯³

06.10.2025 02:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ‘€

05.10.2025 13:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Americans are β€˜deer in the headlights’ in face of Trump assault on free speech, Maria Ressa tells Jon Stewart Nobel prize winner says US institutions have collapsed much quicker than expected under the Trump administration

The Nobel prize winner Maria Ressa has said Americans are like β€œdeer in the headlights” amid the collapse of US institutions and free speech under the Trump administration, particularly after Jimmy Kimmel’s suspension.

04.10.2025 22:59 β€” πŸ‘ 284    πŸ” 108    πŸ’¬ 6    πŸ“Œ 9

I spoke to a Venezuelan woman who was arrested in this raid and later released with her 4yo son. She said agents broke down their door, pointed guns at them and made sexualized remarks about Venezuelan women. When she returned to her apartment it was boarded up and all her possessions were gone.

04.10.2025 14:30 β€” πŸ‘ 5564    πŸ” 3300    πŸ’¬ 242    πŸ“Œ 207
04.10.2025 09:17 β€” πŸ‘ 2913    πŸ” 1478    πŸ’¬ 18    πŸ“Œ 18

One more thought: AI tools are a very useful research accelerator for an expert, and I plan to use them whenever I can. But at the moment it is very easy to be led down false paths if you let them get ahead of yourself and lure you too far from your expertise.

04.10.2025 19:04 β€” πŸ‘ 10    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1

Nikhil's recent paper is a tour de force in causal analysis! They show that LLMs keep track of what characters know in a story using "pointer" mechanisms. Definitely worth checking out.

24.06.2025 17:48 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

I’m excited for COLM this week!

Looking forward to chatting with people about interpretability, data efficient training, cog sci and LLM consistency.

04.10.2025 14:53 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Stefan Zweig, The World of Yesterday, p. 436

04.10.2025 13:20 β€” πŸ‘ 89    πŸ” 21    πŸ’¬ 1    πŸ“Œ 1
Post image Post image Post image Post image

Some important findings in this paper:
1) Working with AI boosts the performance of people solving math, science & ethics questions
2) The biggest boost is for the hardest problems
3) High performers remain highest performing, but low performers gain more
4) People who are good with AI gain most

24.09.2025 00:29 β€” πŸ‘ 97    πŸ” 24    πŸ’¬ 2    πŸ“Œ 3
Video thumbnail

Abughazaleh: I think Kristi Noem should be tried at The Hague. And if the response from ICE to people exercising their first amendment right is to drive vehicles through them, they should not be an agency in the US.

04.10.2025 02:04 β€” πŸ‘ 25215    πŸ” 6877    πŸ’¬ 666    πŸ“Œ 358

The best writing I’ve seen on this topic is the essay β€œTechnically Radical: On the Unrecognized Potential of Tech Workers and Hackers” by @mutual-a.bsky.social

wedontagree.net/technically-...

03.10.2025 00:06 β€” πŸ‘ 12    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Gift 🎁 Article

www.nytimes.com/2025/09/30/t...

01.10.2025 19:32 β€” πŸ‘ 30    πŸ” 12    πŸ’¬ 0    πŸ“Œ 1
trade meme. open ai receives: total sum of creative output from all humanity, $500 billion valuation
you receive: polluted internet, polluted world, collapse of society and nature of truth, no jobs, can put your face in my slop app

trade meme. open ai receives: total sum of creative output from all humanity, $500 billion valuation you receive: polluted internet, polluted world, collapse of society and nature of truth, no jobs, can put your face in my slop app

i made this meme which is better than the article:

01.10.2025 20:32 β€” πŸ‘ 912    πŸ” 219    πŸ’¬ 5    πŸ“Œ 2
Preview
This social app can put your face into fake movie scenes, memes and arrest videos The new Sora social app from ChatGPT maker OpenAI encourages users to upload video of their face so their likeness can be put into AI-generated clips.

OpenAI is essentially a social arsonist, developing and releasing tools that hyper scale the most racist, misogynistic, and toxic elements of society, lowering the barriers for all manner of abuse. The so called guardrails make a pinky swear look like an ironclad contract.

02.10.2025 12:47 β€” πŸ‘ 512    πŸ” 207    πŸ’¬ 8    πŸ“Œ 26
Post image

As part of #SeattleAIWeek, we're hosting "AI Innovation in the Open" on Oct. 30 from 2-4:30pmβ€”an afternoon of live demos and hands-on tutorials at Ai2 HQ. πŸ‘‡

02.10.2025 19:06 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

thanks!

02.10.2025 16:14 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
daniel:// stenberg://
@bagder@mastodon.social

Joshua Rogers sent us a *massive* list of potential issues in #curl that he found using his set of AI assisted tools. Code analyzer style nits all over. Mostly smaller bugs, but still bugs and there could be one or two actual security flaws in there. Actually truly awesome findings.

I have already landed 22(!) bugfixes thanks to this, and I have over twice that amount of issues left to go through. Wade through perhaps.

Credited "Reported in Joshua's sarif data" if you want to look for yourself

daniel:// stenberg:// @bagder@mastodon.social Joshua Rogers sent us a *massive* list of potential issues in #curl that he found using his set of AI assisted tools. Code analyzer style nits all over. Mostly smaller bugs, but still bugs and there could be one or two actual security flaws in there. Actually truly awesome findings. I have already landed 22(!) bugfixes thanks to this, and I have over twice that amount of issues left to go through. Wade through perhaps. Credited "Reported in Joshua's sarif data" if you want to look for yourself

Joshua Rogers, using AI tooling responsibly and professionally, reported 22+ genuine issues in curl that are now being addressed

Especially notable because curl had problems with floods of garbage slop AI "security issues" in the past that were nothing of the sort simonwillison.net/2025/Oct/2/c...

02.10.2025 15:15 β€” πŸ‘ 114    πŸ” 18    πŸ’¬ 2    πŸ“Œ 3

🎯

02.10.2025 15:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@juand-r is following 20 prominent accounts