My talk at Samsung AI Forum yesterday
www.youtube.com/watch?v=L2nA...
@rao2z.bsky.social
AI researcher & teacher at SCAI, ASU. Former President of AAAI & Chair of AAAS Sec T. Here to tweach #AI. YouTube Ch: http://bit.ly/38twrAV Twitter: rao2z
My talk at Samsung AI Forum yesterday
www.youtube.com/watch?v=L2nA...
In the year since LRMs ("reasoning models") hit the scene, we have been trying to understand, analyze and demystify them.. Here are our efforts to date--conveniently all in one place..👇
www.linkedin.com/posts/subbar...
𝐏𝐞𝐫𝐟𝐨𝐫𝐦𝐚𝐭𝐢𝐯𝐞 𝐓𝐡𝐢𝐧𝐤𝐢𝐧𝐠? The anthropomorphization of LRM intermediate tokens as thinking begat a cottage industry to "get efficiency by shortening thinking." We ask: 𝗜𝘀 𝗖𝗼𝗧 𝗹𝗲𝗻𝗴𝘁𝗵 𝗿𝗲𝗮𝗹𝗹𝘆 𝗮 𝗿𝗲𝗳𝗹𝗲𝗰𝘁𝗶𝗼𝗻 𝗼𝗳 𝗽𝗿𝗼𝗯𝗹𝗲𝗺 𝗵𝗮𝗿𝗱𝗻𝗲𝘀𝘀 𝗼𝗿 𝗶𝘀 𝗶𝘁 𝗺𝗼𝗿𝗲 𝗽𝗲𝗿𝗳𝗼𝗿𝗺𝗮𝘁𝗶𝘃𝗲? 👉 www.linkedin.com/posts/subbar...
10.09.2025 16:50 — 👍 4 🔁 0 💬 0 📌 1Rejecting papers in #AI Conferences because of "resource constraints" is shooting ourselves in the foot as a community; use Findings.. #SundayHarangue 👇
x.com/rao2z/status...
Proofs are not reasoning traces & I/O Format Language shouldn't be much of an issue for LLMs + other things #SundayHarangue (Special IMO edition). 🧵 👇
x.com/rao2z/status...
Both LLMs and LRMs are upper bounded by humanity's knowledge closure. True scientific discoveries are, by definition, outside of that closure. Ergo, LLMs/LRMs are great force multipliers to us; but don't support "Nobel this weekend" hype..
👉 www.linkedin.com/posts/subbar...
Computational Complexity is the wrong measure for LRMs (as it was for LLMs)--think distributional distance instead #SundayHarangue (yes, we're back!)
👉 x.com/rao2z/status...
A̶̶̶I̶̶̶ ̶ ̶ ̶ ̶(̶A̶r̶t̶i̶f̶i̶c̶i̶a̶l̶ ̶I̶n̶t̶e̶l̶l̶i̶g̶e̶n̶c̶e̶)̶
̶̶̶A̶̶̶G̶̶̶I̶̶̶ ̶(̶A̶r̶t̶i̶f̶i̶c̶i̶a̶l̶ ̶G̶e̶n̶e̶r̶a̶l̶ ̶I̶n̶t̶e̶l̶l̶i̶g̶e̶n̶c̶e̶)̶
̶̶̶A̶̶̶S̶̶̶I̶̶̶ ̶(̶A̶r̶t̶i̶f̶i̶c̶i̶a̶l̶ ̶S̶u̶p̶e̶r̶ ̶I̶n̶t̶e̶l̶l̶i̶g̶e̶n̶c̶e̶)
ASDI (Artificial Super Duper Intelligence)
Don't get stuck with yesterday's hypeonyms!
Dare to get to the next level!
#AIAphorisms
This series of lectures was given the same week there was all that brouhaha over the Apple illusion paper (I was giving these lectures during the day and talking to reporters in the evening 😅). As such they are pretty up-to-date! 3/
x.com/rao2z/status...
The lectures start with a "big picture" overview (Lecture 1); focus on standard LLMs and their limitations, and LLM-Modulo as a test-time scaling approach (Lecture 2); and end with a critical appraisal of the test-time scaling and RL post-training techniques (Lecture 3). 2/
19.06.2025 22:27 — 👍 0 🔁 0 💬 1 📌 0For anyone interested, here are the videos of the three ~50min each lectures on the reasoning/planning capabilities of LLMs/LRMs that I gave at #ACDL2025 in Riva Del Sole resort last week. 1/
www.youtube.com/playlist?lis...
...it basically confirmed what is already well-established: LLMs (& LRMs & "LLM agents") have trouble w/ problems that require many steps of reasoning/planning.
See, e.g., lots of recent papers by Subbarao Kambhampati's group at ASU. (2/2)
An AGI-wannabe reasoning model whining that it couldn't handle a problem because its context window isn't big enough is like a superman-wannabe little kid protesting that he couldn't add those numbers because he doesn't have enough fingers and toes.. #AIAphorisms
16.06.2025 00:47 — 👍 3 🔁 0 💬 0 📌 0"our counter-intuitive results demonstrate ways in which common interpretations of Large Reasoning Models may be anthropomorphizations or simplifications" arxiv.org/abs/2505.13775
01.06.2025 13:30 — 👍 55 🔁 11 💬 2 📌 1The transformer expressiveness results are often a bit of a red herring as there tends to be a huge gap between what can be expressed in transformers, and what can be learned with gradient descent. Mind the Gap, a new paper with
Lucas Saldyt dives deeper into this issue 👇👇
x.com/SaldytLucas/...
Anthropomorphization of intermediate tokens as reasoning/thinking traces isn't quite a harmless fad, and may be pushing LRM research into questionable directions.. So we decided to put together a more complete argument. Paper 👉 arxiv.org/pdf/2504.09762 (Twitter thread: x.com/rao2z/status...)
28.05.2025 13:41 — 👍 10 🔁 1 💬 0 📌 1Longer thread here
x.com/rao2z/status...
This RLiNo? paper (arxiv.org/abs/2505.13697) lead by Soumya Samineni and Durgesh_kalwar dives into the MDP model used in the RL post-training methods inspired by DeepSeek R1, and asks if some of the idiosyncrasies of RL aren't just consequences of the simplistic structural assumptions made
25.05.2025 22:51 — 👍 4 🔁 0 💬 1 📌 0Do Intermediate Tokens Produced by LRMs (need to) have any semantics? Our new study 👇
Thread 👉 x.com/rao2z/status...
Delighted to share that Siddhant Bhambri & Mudit Verma's
critical evaluation and refutation of the reasoning claims of ReACT has been accepted to #TMLR (Transactions on Machine Learning)
👉https://openreview.net/forum?id=aFAMPSmNHR
Solving Single Agent Fully Observable Deterministic (SAFODP) Problems with Dec-POMDP approaches #SundayHarangue #allegory
x.com/rao2z/status...
IMHO, the whole idea of connecting "length of intermediate tokens" produced by LRMs to inference time compute is a mind-boggling demonstration of circular reasoning--that comes from the assumptions about MDP model and reward model.. 👇
x.com/rao2z/status...
It ain't "The Bitter Lesson" if you are in the loop curating the training data for your LLM, y'all.. Pick your lesson, will ya? #SundayHarangue (h/t @kstechly.bsky.social)
05.05.2025 11:44 — 👍 4 🔁 3 💬 0 📌 0Don't use summarizers for the papers by @rao2z.bsky.social because the reasoning traces therein are, unlike the LRMs & LLMs under investigation, substantively meaningful, semantically well-ordered, and stylistically compelling and engaging!
#AI #LLMs #CoT
arxiv.org/abs/2504.09762
Here is a recording of my talk at @msftresearch.bsky.social last week titled "(How) Do LLMs Reason/Plan?" (Also gave a version of it at as a distinguished lecture at Oracle today..)
www.youtube.com/watch?v=0u2h...
A preprint available at arxiv.org/abs/2504.09762
15.04.2025 17:23 — 👍 3 🔁 0 💬 0 📌 1(With @kstechly.bsky.social & Karthik Valmeekam)
13.04.2025 17:39 — 👍 0 🔁 0 💬 0 📌 0Our invited commentary for the Annals of
New York Academy of Sciences titled "(How) Do reasoning models reason?" is now online
👉 nyaspubs.onlinelibrary.wiley.com/doi/epdf/10....
It is a written version of my recent talks (and #SundayHarangues) on the recent developments in LRMs..
Woo hoo.. Our first #TMLR paper!🤗 On the planning and scheduling abilities of LRMs o1 & R1 (w/ Karthik, Kaya, Atharva)
👉 openreview.net/forum?id=FkK...
Even a jaded researcher like me has to admit that
Transactions on Machine Learning Research is a veritable oasis among #AI publication venues! 🙏
AI Hype: The phenomenon where experts without expertise hype up imminent arrival of expertise without experts. #AIAphorisms
30.03.2025 08:26 — 👍 10 🔁 4 💬 0 📌 0