Charts and graphs help people analyze data, but can they also help AI?
In a new paper, we provide initial evidence that it does! GPT 4.1 and Claude 3.5 describe three synthetic datasets more precisely and accurately when raw data is accompanied by a scatter plot. Read more inπ§΅!
04.08.2025 14:17 β π 7 π 2 π¬ 1 π 0
What AI Thinks It Knows About You
What happens when people can see what assumptions a large language model is making about them?
AI is often thought of as a black box -- no way to know what's going on inside. That's changing in eye-opening ways. Researchers are finding "beliefs" models are forming as they converse, and how those beliefs correlate to what the models say and how they say it.
www.theatlantic.com/technology/a...
21.05.2025 15:32 β π 32 π 13 π¬ 4 π 2
Historical popularity chart showing the popularity of Oliver rising to meet the previously much greater popularity of Olivia
The interactive NameGrapher is updated with 2024 baby name popularity stats! Come explore--and marvel that Oliver and Olivia have converged namerology.com/baby-name-gr...
12.05.2025 19:09 β π 8 π 2 π¬ 1 π 0
A wonderful visualization for those of us obsessed by sunlight and geography!
12.05.2025 13:36 β π 25 π 1 π¬ 1 π 0
An incredibly rich, detailed view of neural net internals! There are so many insights in these papers. And the visualizations of "addition circuit" features are just plain cool!
27.03.2025 20:00 β π 17 π 2 π¬ 0 π 0
Great news, congrats! And glad youβll still be in the neighborhood!
27.03.2025 16:13 β π 1 π 0 π¬ 0 π 0
I'd be curious about advice on teaching non-coders how to test programs they've written with AI. I'm not thinking unit tests so much as things like making sure you can drill down for verifiable details in a visualizationβbasic practices that are good on their own, but also help catch errors.
24.03.2025 19:45 β π 10 π 0 π¬ 2 π 0
Now that we have vibe coding, we need vibe testing!
24.03.2025 19:45 β π 23 π 4 π¬ 7 π 0
Oh, that looks super relevant and fascinating, reading through it now...
21.03.2025 20:19 β π 1 π 0 π¬ 0 π 0
Ha! I think (!) that for me, the word "calculate" connotes narrow precision and correctness, whereas "think" is more expansive but also implies more fuzziness and the possibility of being wrong. That said, your observation does give me pause!
21.03.2025 20:10 β π 0 π 0 π¬ 0 π 0
Interesting question! I haven't calculated this, but @yidachen.bsky.social might know
21.03.2025 19:36 β π 1 π 0 π¬ 0 π 0
Colorful depictions of reasoning progress: most of the time the system settles on the correct answer but sometimes it vacillates in interesting ways.
This is a common pattern, but we're also seeing some others! Here are similar views for multiple-choice abstract algebra questions (green is the correct answer; other colors are incorrect answers) You can see many more at yc015.github.io/reasoning-pr... cc @yidachen.bsky.social
21.03.2025 19:18 β π 5 π 0 π¬ 3 π 0
GitHub - ARBORproject/arborproject.github.io
Contribute to ARBORproject/arborproject.github.io development by creating an account on GitHub.
Very cool! You're definitely not alone in finding this fascinating. If you're looking for other people interested in this kind of thing, drop by the Arbor Project page, if you haven't already. github.com/ArborProject...
13.03.2025 17:44 β π 3 π 0 π¬ 1 π 0
The wind map at hint.fm/wind/ has been running since 2012, relying on weather data from NOAA. We added a notice like this today. Thanks to @cambecc.bsky.social for the inspiration.
03.03.2025 01:57 β π 78 π 22 π¬ 1 π 1
It's based on a data set of multiple-choice questions that have a known right answer, so this visualization only works when you have labeled ground truth. Definitely wouldn't shock me if those answers were labeled by grad students, though!
26.02.2025 01:03 β π 3 π 0 π¬ 0 π 0
Great questions! Maybe it would be faster... or maybe it's doing something important under the hood that we can't see? I genuinely have no idea.
25.02.2025 21:36 β π 1 π 0 π¬ 1 π 0
We also see cases where it starts out with the right answer, but eventually "convinces itself" of the wrong answer! I would love to understand the dynamics better.
25.02.2025 21:34 β π 1 π 0 π¬ 0 π 0
Reasoning or Performing Β· ARBORproject arborproject.github.io Β· Discussion #11
Research Question When asked the DeepSeek Distilled R1 models a challenging abstract algebra question, they often generated hundreds of tokens of CoT before providing the final answer. Yet, on some...
You can see the model go down the wrong path, "realize" it's not right, then find the correct answer! To see more visualizations, or if you have related ideas, join the discussion here!
github.com/ARBORproject... (vis by @yidachen.bsky.social in conversation with @diatkinson.bsky.social )
25.02.2025 18:44 β π 11 π 0 π¬ 1 π 0
Neat visualization that came up in the ARBOR project: this shows DeepSeek "thinking" about a question, and color is the probability that, if it exited thinking, it would give the right answer. (Here yellow means correct.)
25.02.2025 18:44 β π 80 π 15 π¬ 7 π 2
Chain of Thought for Tsumego (Go Life or Death) Problems
Contribute to ARBORproject/arborproject.github.io development by creating an account on GitHub.
Thank you! That's a great write-up, and this is definitely an interesting experiment. The distinction between how the model might do parsing vs. solving is very much worth thinking about. I added a few thoughts on the wiki page. github.com/ARBORproject...
22.02.2025 15:52 β π 2 π 0 π¬ 1 π 0
Great thread describing the new ARBOR open interpretability project, which has some fascinating projects already. Take a look!
20.02.2025 22:50 β π 8 π 2 π¬ 0 π 0
Observations
Contribute to ARBORproject/arborproject.github.io development by creating an account on GitHub.
Excellent idea! I just added an "observations" index page on the wiki for things like that.
github.com/ARBORproject...
20.02.2025 21:50 β π 1 π 0 π¬ 1 π 0
ARBORproject arborproject.github.io Β· Discussions
Explore the GitHub Discussions forum for ARBORproject arborproject.github.io. Discuss code, ask questions & collaborate with the developer community.
Take a look at some initial research projects, and see if there's one you'd like to work on:
github.com/ARBORproject...
Or propose your own idea! There are many ways to contribute, and we welcome all of them.
20.02.2025 19:55 β π 7 π 2 π¬ 1 π 0
GitHub - ARBORproject/arborproject.github.io
Contribute to ARBORproject/arborproject.github.io development by creating an account on GitHub.
Today we're launching a multi-lab open collaboration, the ARBOR project, to accelerate AI interpretability research for reasoning models. Please join us!
github.com/ARBORproject...
(ARBOR = Analysis of Reasoning Behavior through Open Research)
20.02.2025 19:55 β π 44 π 9 π¬ 1 π 0
Ah, thatβs a great connection!
20.01.2025 02:59 β π 2 π 0 π¬ 0 π 0
Out: rectangular maps of the globe
In: rectangular maps of butterfly wings!
19.01.2025 19:48 β π 13 π 0 π¬ 2 π 0
I'm realizing that many people hear "generative art" (a term going back at least to the 60s!) as synonymous with generative AI. Is this how cryptographers feel about the new meaning of "crypto"?
17.01.2025 12:36 β π 25 π 1 π¬ 3 π 0
What a beauty! This is comet C/2024 G3 (ATLAS) passing through the field of view of the LASCO C3 coronagraph.
It wasn't for certain whether it would survive it's closest approach to the sun on January 13th, but it did and delivered us a spectacular show!
#comet #C2024G3 π
16.01.2025 21:14 β π 531 π 199 π¬ 15 π 16
Neuroscientist. Cognitive psychologist in memory, healthy ageing & dementia. Consultant. FENS-Kavli scholar https://fenskavlinetwork.org/
https://dorothytse.com/
https://ageingbetter.wixsite.com/ageing-better-with-a
Helena Vasconcelos' research account. Email helenav@cs.stanford.edu.
https://helenavasc.com
Sakana AI is an AI R&D company based in Tokyo, Japan. πΌπ§
https://sakana.ai/careers
Assistant professor of computer science at Technion
https://belinkov.com/
Associate Professor of Computer Science @University of Massachusetts Amherst | Co-Director of the HCI-VIS Lab. Former Harvard Radcliffe Fellow | Currently on sabbatical @Inria Saclay
Asst Prof at UniversitΓ© de MontrΓ©al, Associate Member of Mila-Quebec AI Institute. PhD from Cornell InfoSci. Creator of ChainForge. Programming and culture, LLM evaluation tooling.
Art from the MoMA's Paintings and Sculpture collection.
The Museum of Modern Art (MoMA) is an art museum located in New York City. #artbots by @nuwaves-future.bsky.social
https://www.moma.org
Mathematician at Stanford
AI Researcher @ Mistral AI | Formally IBM Research | Former Mathematician/Logician/Data scientist | Building AI for math and reasoning
Math professor. My research is mostly in number theory. I am also very involved in Math olympiads.
We are a research institute investigating the trajectory of AI for the benefit of society.
epoch.ai
Machine Learning Librarian at @hf.co
Research director @Inria, Head of @flowersInria
lab, prev. @MSFTResearch @SonyCSLParis
Artificial intelligence, cognitive sciences, sciences of curiosity, language, self-organization, autotelic agents, education, AI and society
http://www.pyoudeyer.com
SVP of Open-Endedness at Lila Sciences. In the past: Maven CEO, Lead at OpenAI, head of basic/core research at Uber AI, professor at UCF.
Stuff I helped invent: NEAT, CPPNs, HyperNEAT, novelty search, POET, Picbreeder.
Book: Why Greatness Cannot Be Plann
Embodied lifelong learning (compositionality, RL, TAMP, robotics). Assistant Professor at Stony Brook ECE. Postdoc at MIT CSAIL, PhD from GRASP lab at Penn.
https://jorge-a-mendez.github.io
RL researcher looking for DACs // What is this AutoRL anyway?
she/her
Currently: Leibniz Uni Hannover
Previously: Uni Freiburg (Master's) | Meta AI London (Intern)
Always & Forever: AutoRL.org
Group Leader in TΓΌbingen, Germany
Iβm π«π· and I work on RL and lifelong learning. Mostly posting on ML related topics.
Research Scientist @ Google DeepMind, in open-ended learning, and AI for Scientific Discovery.
I can be described as a multi-agent artificial general intelligence.
OK, so some people pointed out that I am not in fact artificial, contradicting my bio. To them I would reply that I am likely also a cognitive gadget.
www.jzleibo.com