Lucas Dixon's Avatar

Lucas Dixon

@iislucas.bsky.social

Machine learning, interpretability, visualization, Language Models, People+AI research

504 Followers  |  437 Following  |  14 Posts  |  Joined: 17.12.2023  |  2.0102

Latest posts by iislucas.bsky.social on Bluesky

Preview
Read smarter, not harder, with Lumi As research engineers, our reading lists are always exciting… and also way too long to finish. While we’d love to go through every research…

Helps you more quickly understand, ask questions, get to the parts of the original content that you are looking to understand. Would love to hear what you think!

Read more: medium.com/people-ai-re...

Github: github.com/PAIR-code/lumi

01.10.2025 15:56 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Lumi: A reading prototype by Google PAIR Explore research papers with AI features including annotations, granular summaries, and custom Q&A. Prototype by People & AI Research (PAIR) at Google

New open-source AI assited reading experience we built for arxiv papers: lumi.withgoogle.com

01.10.2025 15:55 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
An image with the Vancouver skyline and the words "sign up to review". At the top are the logos of both the Actionable Interpretability workshop (a magnifying glass) and the ICML conference (a brain).

An image with the Vancouver skyline and the words "sign up to review". At the top are the logos of both the Actionable Interpretability workshop (a magnifying glass) and the ICML conference (a brain).

🚨 We're looking for more reviewers for the workshop!
πŸ“† Review period: May 24-June 7

If you're passionate about making interpretability useful and want to help shape the conversation, we'd love your input.

πŸ’‘πŸ” Self-nominate here:
docs.google.com/forms/d/e/1F...

20.05.2025 00:05 β€” πŸ‘ 6    πŸ” 5    πŸ’¬ 0    πŸ“Œ 0
Preview
ARBORproject arborproject.github.io Β· Discussions Explore the GitHub Discussions forum for ARBORproject arborproject.github.io. Discuss code, ask questions & collaborate with the developer community.

Take a look at some initial research projects, and see if there's one you'd like to work on:
github.com/ARBORproject...
Or propose your own idea! There are many ways to contribute, and we welcome all of them.

20.02.2025 19:55 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Great thread describing the new ARBOR open interpretability project, which has some fascinating projects already. Take a look!

20.02.2025 22:50 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

Looking for a small or medium sized VLM? PaliGemma 2 spans more than 150x of compute!

Not sure yet if you want to invest the time πŸͺ„finetuningπŸͺ„ on your data? Give it a try with our ready-to-use "mix" checkpoints:

πŸ€— huggingface.co/blog/paligem...
🎀 developers.googleblog.com/en/introduci...

19.02.2025 17:47 β€” πŸ‘ 19    πŸ” 7    πŸ’¬ 0    πŸ“Œ 0

In December, I posted about our new paper on mastering board games using internal + external planning. πŸ‘‡

Here's a talk now on Youtube about it given by my awesome colleague John Schultz!

www.youtube.com/watch?v=JyxE...

17.01.2025 17:26 β€” πŸ‘ 35    πŸ” 11    πŸ’¬ 1    πŸ“Œ 0

Yeah, I'm skeptical of how good LLMs alone can be, but when they get to use existing search based theorem provers, and lookup tools (SAT, induction provers, etc), then I would expect a good deal better w.r.t. gap sizes, and ability to find counter examples.

09.01.2025 08:07 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I have a hope that modern AI/LLMs might help here: by helping translate informal papers to formal mathematical statements, and informal proof to formal proof, and thereby help highlight gaps and help find counter examples. A few others are interested in this... @wattenberg.bsky.social maybe?

02.01.2025 11:33 β€” πŸ‘ 12    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Love the idea! Is there any stats / evals for it! And how does one get to play with it? :)

21.12.2024 09:49 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Research scholar program Overview

Google research scholar programme applications open until 27th Jan. support early-career professors (received PhD within seven years of submission).
research.google/programs-and...

21.12.2024 09:46 β€” πŸ‘ 12    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

What's in an attention head? 🀯

We present an efficient framework – MAPS – for inferring the functionality of attention heads in LLMs ✨directly from their parameters✨

A new preprint with Amit Elhelo 🧡 (1/10)

18.12.2024 17:55 β€” πŸ‘ 62    πŸ” 13    πŸ’¬ 1    πŸ“Œ 0
Post image

We scaled training data attribution (TDA) methods ~1000x to find influential pretraining examples for thousands of queries in an 8B-parameter LLM over the entire 160B-token C4 corpus!
medium.com/people-ai-re...

13.12.2024 18:57 β€” πŸ‘ 36    πŸ” 8    πŸ’¬ 2    πŸ“Œ 5

I think this (LLMs makes making small scripts super easy) gets more profound agian when we start to make lots of tiny voice-powered apps/agents... would love to see more prototyping tools for this and play with them. Send me pointers!

11.12.2024 17:49 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Option 1.

11.12.2024 00:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I just wish search was semantic instead of substring! Still fun to see and explore! Btw - which embedding model did you use and what input text per paper?

11.12.2024 00:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

That's neat! Did you also try a few different other styles/mood boards?

30.11.2024 08:45 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

🚨 New Paper 🚨
Can LLMs perform latent multi-hop reasoning without exploiting shortcuts? We find the answer is yes – they can recall and compose facts not seen together in training or guessing the answer, but success greatly depends on the type of the bridge entity (80% for country, 6% for year)! 1/N

27.11.2024 17:26 β€” πŸ‘ 67    πŸ” 14    πŸ’¬ 3    πŸ“Œ 1
Preview
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step When leveraging language models for reasoning tasks, generating explicit chain-of-thought (CoT) steps often proves essential for achieving high accuracy in final outputs. In this paper, we investigate...

arxiv.org/abs/2405.14838 by supervised learning curriculum of incrementally eliminating the start of a CoT they are able to train gpt2-small to do 9 digit multiplication without CoT; a fascinating and impressive result!

29.11.2024 09:27 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
A Visual Dive into Conditional Flow Matching | ICLR Blogposts 2025 Conditional flow matching (CFM) was introduced by three simultaneous papers at ICLR 2023, through different approaches (conditional matching, rectifying flows and stochastic interpolants). <br/> The m...

I'm learning a lot from this visually rich blog post. Also, I'm charmed by the rotating list of equal-contribution authors. Good knights-of-the-round-table energy!
dl.heeere.com/conditional-...

27.11.2024 18:37 β€” πŸ‘ 23    πŸ” 5    πŸ’¬ 2    πŸ“Œ 0

I want to describe my experience of coding with AI, because it seems to differ from other people's expectations. Earlier this morning, I saw a beautiful image here, based on roots of polynomials: bsky.app/profile/scon...
I wanted to try this idea myself, but with animation in a Javascript context!

17.11.2024 17:05 β€” πŸ‘ 58    πŸ” 10    πŸ’¬ 2    πŸ“Œ 1

It's so beautiful to see this kind of fluid interaction with huge data!

21.11.2024 18:07 β€” πŸ‘ 16    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Many circles of different sizes, representing a visualization of inequality

Many circles of different sizes, representing a visualization of inequality

The Gini coefficient is the standard way to measure inequality, but what does it mean, concretely? I made a little visualization to build intuition:
www.bewitched.com/demo/gini

23.11.2024 15:35 β€” πŸ‘ 199    πŸ” 57    πŸ’¬ 10    πŸ“Œ 8
Preview
Galilean Moons Timeline A meditative visualization of Jupiter's Galilean moons

A meditative toy, visualizing Jupiter's Galilean moons:
www.bewitched.com/demo/jupiter/

24.11.2024 15:29 β€” πŸ‘ 46    πŸ” 13    πŸ’¬ 5    πŸ“Œ 2

And here is the context page for the DeepMind student researcher programme: deepmind.google/about/studen...

21.11.2024 09:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Student Researcher, 2025 β€” Google Careers

People can now apply for Student researcher roles (basically a kind of internship) at Google/Deep Mind (until Dec 13)
www.google.com/about/career...

20.11.2024 21:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Neat! Are there any human-eval results for comparing the output w.r.t. Things like hallucinations and human enjoyment. I'd love to see eval here!

30.10.2024 09:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@iislucas is following 19 prominent accounts