Julius Adebayo @juliusad - Bluesky Profile

Latest posts by juliusad.bsky.social on Bluesky

Looks like Tesla’s models sometimes confuse train tracks with road lanes.

04.01.2025 21:23 — 👍 0 🔁 0 💬 0 📌 0

OLMo 2 tech report is out!

We get in the weeds with this one, with 50+ pages on 4 crucial components of LLM development pipeline:

03.01.2025 19:51 — 👍 47 🔁 9 💬 3 📌 0

The LCMs are cool though; however, it is early days. They give us a knob (concept representations) to understand and change the model's outputs. There is no reason why an LCM should also not have a COT (or be able to reason via search/planning)...we just have to ask it :)

03.01.2025 23:32 — 👍 1 🔁 0 💬 0 📌 0

The reasoning models are cool though; they explicitly enforce dependence on the model's cot, so here it should be a reliable explanation (? not sure tho). Played with 'thinking' gemini: it generate pages of COT sometimes, and now we have to figure what (and which part) is relevant.

03.01.2025 23:32 — 👍 1 🔁 0 💬 1 📌 0

This reminds me of all the issues with heatmaps and probes. The model really has no incentive to rely on its cot unless it is explicitly asked to do so via fine-tuning or some kind of penalty.

03.01.2025 23:32 — 👍 0 🔁 0 💬 1 📌 0

You always ask the right questions :) I don't think chain-of-thought, of current models, (except the reasoning ones) gives reliable insight about models. The issue is that cot is an output (and input) of the model, and you can change it in all sort of ways without affecting the model's output.

03.01.2025 23:32 — 👍 1 🔁 0 💬 1 📌 0

It is too early to tell :) I like the papers on your list but I think only a few of them were instant ‘classics’.

Having said that, I like: large concept models paper from meta.

02.01.2025 15:28 — 👍 5 🔁 0 💬 1 📌 0

Is the final output actually “causally” dependent on the long COT generated? How key are these traces to the search/planning clearly happening here? Some many questions but so little answers.

21.12.2024 19:12 — 👍 1 🔁 0 💬 0 📌 0

Great to see clarification comments. o3 is impressive nonetheless.

Played around with o1 and the ‘thinking’ Gemini model. The cot output (for Gemini) can confusing and convoluted, but it got 3/5 problems right. Stopped on the remaining 2.

These models are an impressive interpretability test bed.

21.12.2024 19:12 — 👍 1 🔁 0 💬 1 📌 0

New paper. We show that the representations of LLMs, up to 3B params(!), can be engineered to encode biophysical factors that are meaningful to experts.

We don't have to hope Adam magically finds models that learn useful features; we can optimize for models that encode for interpretable features!

13.12.2024 01:50 — 👍 8 🔁 1 💬 0 📌 0

Pinging into the void.

18.11.2024 03:31 — 👍 4 🔁 0 💬 1 📌 0

@juliusad is following 19 prominent accounts

Gaël Varoquaux
@gaelvaroquaux

Research & code: Research director @inria ►Data, Health, & Computer science ►Python coder, (co)founder of scikit-learn, joblib, & @probabl.bsky.social ►Sometimes does art photography ►Physics PhD

David Picard
@davidpicard

Professor of Computer Vision/Machine Learning at Imagine/LIGM, École nationale des Ponts et Chaussées @ecoledesponts.bsky.social Music & overall happiness 🌳🪻 Born well below 350ppm 😬 mostly silly personal views 📍Paris 🔗 https://davidpicard.github.io/

Yonatan Belinkov ✈️ COLM2025
@boknilev

Assistant professor of computer science at Technion; visiting scholar at @KempnerInst 2025-2026 https://belinkov.com/

Preetum Nakkiran
@preetumnakkiran

ML Research @ Apple. Understanding deep learning (generalization, calibration, diffusion, etc). preetum.nakkiran.org

Jeff Dean
@jeffdean

Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...

Mor Geva
@megamor2

https://mega002.github.io

Yoav Goldberg
@yoavgo

Victoria Dean (she/they)
@vdean

Teaching Faculty @ Princeton University | CMU, MIT alum | reinforcement learning, AI ethics, equity and justice, baking | ADHD 💖💜💙

Jason Weston
@jasonweston

Senior Director, Research Scientist @ Meta FAIR + Visiting Prof @ NYU. Pretrain+SFT: NLP from Scratch (2011). Multilayer attention+position encode+LLM: MemNet (2015). Recent (2024): Self-Rewarding LLMs & more!

@kyunghyuncho

a mediocre combination of a mediocre AI scientist, a mediocre physicist, a mediocre chemist, a mediocre manager and a mediocre professor. see more at https://kyunghyuncho.me/

TMLR Published Papers
@tmlr-pub

TMLR Homepage: https://jmlr.org/tmlr/ TMLR Infinite Conference: https://tmlr.infinite-conf.org/

Alex Makelov
@amakelov

Mechanistic interpretability Creator of https://github.com/amakelov/mandala prev. Harvard/MIT machine learning, theoretical computer science, competition math.

Nathan C. Frey
@ncfrey

CTO & Co-Founder at Coefficient Bio. Ex-Prescient Design • Genentech. Advisor to Atomscale & Guide Labs ncfrey.github.io | ncfrey.substack.com

Sander Dieleman
@sedielem

Blog: https://sander.ai/ 🐦: https://x.com/sedielem Research Scientist at Google DeepMind (WaveNet, Imagen 3, Veo, ...). I tweet about deep learning (research + software), music, generative models (personal account).

Aya Abdelsalam Ismail
@asalamismail

Research Scientist @prescientdesign @Genentech Former PhD @umdcs ayaismail.com

Microsoft Research
@msftresearch

We advance science and technology to benefit humanity. http://microsoft.com/research

Frank Noe
@franknoe

Scientist, #MachineLearning and #AI for Moleculear Sciences. Scuba Diver. Loves @cecclementi.bsky.social

Science Magazine
@science.org

Cutting-edge research, news, commentary, and visuals from the Science family of journals. https://www.science.org

James Zou
@jameszou

@Stanford Professor. AI for science and medicine.