Andrea Panizza's Avatar

Andrea Panizza

@andreapanizza.bsky.social

ML, trekking, enjoying life

98 Followers  |  143 Following  |  170 Posts  |  Joined: 18.11.2024  |  2.0462

Latest posts by andreapanizza.bsky.social on Bluesky

Post image 31.05.2025 10:31 β€” πŸ‘ 10152    πŸ” 1464    πŸ’¬ 162    πŸ“Œ 72

Interviewer: Can you explain this gap in your resume?

LM researcher: You're right to wonder about the gaps in my resume! They are more common than people think, and there are many valid reasons why someone might have them. Here are some of the most frequent reasons you might see a gap:

12.05.2025 23:39 β€” πŸ‘ 129    πŸ” 11    πŸ’¬ 1    πŸ“Œ 3
Post image

Finally found the time to watch it in full: one of the most interesting and thought provoking LLM conference I’ve seen in a while. You see LLM as latent attention graphs and many structural oddities suddenly falls into place. www.youtube.com/watch?v=J1YC...

12.04.2025 06:45 β€” πŸ‘ 40    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0

This is beautiful 🀩 I would have paid for a similar treatment from my team

10.04.2025 19:49 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Parents Gently Explain To Child That Their Money In Heaven Now HUNTSVILLE, ALβ€”In an effort to comfort the child by telling her the funds had gone to a far better place, local parents Blake and Allison McKee gently explained to their daughter Friday that their mon...

Parents Gently Explain To Child That Their Money In Heaven Now

04.04.2025 21:30 β€” πŸ‘ 18524    πŸ” 3483    πŸ’¬ 162    πŸ“Œ 189

PS I forgot an important caveat: the compute budget should be the same for both methods being HPO'ed. If the nihilist feels like being a PITA, I'll add a curve of the performance of the two methods as a function of the compute budget. If they still don't give up, I give up πŸ€·β€β™‚οΈ life goes on regardless.

01.04.2025 12:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

cross-validation precisely to answer this kind of objection. Is HPO is unfeasible, I agree that the result is just preliminary evidence. I note that since HPO is costly, there's merit in methods which show superior performance with default HP, but I don't insist b/c it's a valid objection. /2

01.04.2025 12:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
We need to talk about random seeds Modern neural network libraries all take as a hyperparameter a random seed, typically used to determine the initial state of the model parameters. This opinion piece argues that there are some safe us...

I start by acknowledging the validity of their objection, because inn general it's true that with different HP, results may have been different (even random seeds may be considered HP, see arxiv.org/abs/2210.13393). Secondly, if we're in a situation where HPO is feasible, I use nested 1/

01.04.2025 12:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

More than 250 people have already enrolled in the Causal Secrets Mini-Course!

All but one review so far are 5-star.

It's free for everyone!

Share it with your friend!

https://bit.ly/4ic4VK4

#CausalSky

11.03.2025 09:29 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 1
Post image Post image

One of the first papers I've seen with RLVR / reinforcement finetuning of vision language models

Looks about as simple as we would expect it to be, lots of details to uncover.

Liu et al. Visual-RFT: Visual Reinforcement Fine-Tuning
buff.ly/DbGuYve
(posted a week ago, oops)

10.03.2025 15:44 β€” πŸ‘ 16    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1
Preview
Cutting-edge web scraping techniques at NICAR Here's the handout for a workshop I presented this morning at [NICAR 2025](https://www.ire.org/training/conferences/nicar-2025/) on web scraping, focusing on lesser know tips and tricks that became po...

Here's the handout for my "Cutting-edge web scraping techniques" workshop at #NICAR2025 this morning github.com/simonw/nicar...

Plus some extra notes on the custom software I built to support the workshop: simonwillison.net/2025/Mar/8/c...

08.03.2025 19:28 β€” πŸ‘ 147    πŸ” 18    πŸ’¬ 2    πŸ“Œ 2
Post image

This thing now deserves its own name

06.03.2025 21:04 β€” πŸ‘ 6    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Career Update: Google DeepMind -> Anthropic TODO

Nicholas Carlini moves to Anthrophic.

nicholas.carlini.com/writing/2025...

05.03.2025 21:22 β€” πŸ‘ 18    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0

Ah, during the time of the big excitement about L5! I remember talking to people in Houston, expecting FSD to be solved soon...things proved to be harder, but a lot of progress has been made!

05.03.2025 17:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

I already advertised for this document when I posted it on arXiv, and later when it was published.

This week, with the agreement of the publisher, I uploaded the published version on arXiv.

Less typos, more references and additional sections including PAC-Bayes Bernstein.

arxiv.org/abs/2110.11216

05.03.2025 01:16 β€” πŸ‘ 109    πŸ” 22    πŸ’¬ 1    πŸ“Œ 3

Thanks!

05.03.2025 07:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Very well written! Did you work in the sector?

05.03.2025 07:42 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Preview
Self Driving Cars are At A Transition Point Cruise leaves the game as Waymo and Tesla ramp up

My self driving car writeup from December (needs an update) open.substack.com/pub/itcanthi...

05.03.2025 05:25 β€” πŸ‘ 10    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Looks like a book or a very long review paper! Can you share the link?

04.03.2025 08:53 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This is revised down to -2.8%, and partly precipitated the big flush today. Probably some sovereign wealth fund exited positions across Nasdaq and S&P given the volume of sales.

03.03.2025 21:40 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

If you look at most of the models we've received from OpenAI, Anthropic, and Google in the last 18 months you'll hear a lot of "Most of the improvements were in the post-training phase."

Here's a simple analogy for how so many gains can be made on mostly the same base model:

03.03.2025 16:22 β€” πŸ‘ 28    πŸ” 4    πŸ’¬ 1    πŸ“Œ 1
Preview
Which Attention Heads Matter for In-Context Learning? Large language models (LLMs) exhibit impressive in-context learning (ICL) capability, enabling them to perform new tasks using only a few demonstrations in the prompt. Two different mechanisms have be...

More evidence of the importance of training analysis for interp! Induction heads might serve as *preliminary* function vector heads (which directly compute in-context learning tasks). Ultimately, LMs rely on FV heads more than IH heads for ICL. from @kayoyin.bsky.social

03.03.2025 16:51 β€” πŸ‘ 15    πŸ” 2    πŸ’¬ 2    πŸ“Œ 0
Preview
Young Coders Are Using AI for Everything, Giving "Blank Stares" When Asked How Programs Actually Work Young programmers "can't actually program" because they're too reliant on AI models, writes developer Namanyay Goel.

I'm sure this is fine

futurism.com/young-coders...

24.02.2025 21:43 β€” πŸ‘ 2199    πŸ” 461    πŸ’¬ 173    πŸ“Œ 140
Preview
Minimax Optimal Kernel Two-Sample Tests with Random Features Reproducing Kernel Hilbert Space (RKHS) embedding of probability distributions has proved to be an effective approach, via MMD (maximum mean discrepancy) for nonparametric hypothesis testing problems ...

Impressive piece of work by Soumya Mukherjee and Bharath Sriperumbudur: arxiv.org/abs/2502.20755
Minimal optimal kernel two-sample tests with random Fourier features.

03.03.2025 07:05 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image Post image Post image

Our Workshop on Uncertainty Quantification for Computer Vision goes to @cvprconference.bsky.social this year!
We have a super line-up of speakers and a call for papers.
This is a chance for your paper to shine at #CVPR2025

⏲️ Submission deadline: 14 March
πŸ’» Page: uncertainty-cv.github.io/2025/

28.02.2025 07:28 β€” πŸ‘ 33    πŸ” 7    πŸ’¬ 0    πŸ“Œ 0
Post image

A new paper by Vovk that continues exploring properties of so-called "randomness predictors" (compared to "conformal predictors").
www.arxiv.org/abs/2502.19254

27.02.2025 23:25 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0
Preview
Volume estimates for unions of convex sets, and the Kakeya set conjecture in three dimensions We study sets of $Ξ΄$ tubes in $\mathbb{R}^3$, with the property that not too many tubes can be contained inside a common convex set $V$. We show that the union of tubes from such a set must have almos...

I am happy to announce that the Kakeya set conjecture, one of the most sought after open problems in geometric measure theory, has now been proven (in three dimensions) by Hong Wang and Joshua Zahl! arxiv.org/abs/2502.17655 I discuss some ideas of the proof at terrytao.wordpress.com/2025/02/25/t...

26.02.2025 04:49 β€” πŸ‘ 156    πŸ” 36    πŸ’¬ 0    πŸ“Œ 4

@andreapanizza is following 20 prominent accounts