's Avatar

@rogergrosse.bsky.social

1,721 Followers  |  77 Following  |  6 Posts  |  Joined: 17.11.2024  |  1.5957

Latest posts by rogergrosse.bsky.social on Bluesky

The Nintendo is closer in time to the first transistor than to today.

18.12.2024 15:36 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Conferences are basically a way for a group of people to temporarily have a lower opportunity cost on their time.

07.12.2024 13:28 β€” πŸ‘ 10    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

thinking of calling this "The Illusion Illusion"

(more examples below)

01.12.2024 14:33 β€” πŸ‘ 1584    πŸ” 387    πŸ’¬ 60    πŸ“Œ 91

Oh yeah, GANs, those were the days.

27.11.2024 19:05 β€” πŸ‘ 16    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

🚨 New #NeurIPS2025 paper β€œTraining Data Attribution via Approximate Unrolling” 🚨

Introducing SOURCE: A method to understand how individual training examples influence neural net behavior, allowing us to make AI models more transparent and trustworthy!

πŸ“„ Full paper: openreview.net/pdf?id=3NaqG...

27.11.2024 17:41 β€” πŸ‘ 19    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

I just created a Project with a system prompt describing my interests and a doc with my publication list (titles + abstracts). Then I paste the email feed into the chat each day. Nothing fancy.

26.11.2024 20:01 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I have Claude filter my arXiv feed each day. It mostly works pretty well, except that it always hallucinates that "Studying LLM Generalization with Influence Functions" is in my feed and tells me I should read it.

26.11.2024 19:41 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0
Preview
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models The capabilities and limitations of Large Language Models have been sketched out in great detail in recent years, providing an intriguing yet conflicting picture. On the one hand, LLMs demonstrate a g...

Some very nice work from Cohere and UCL using influence functions to analyze math reasoning abilities in LLMs. Factual queries turn up docs containing the facts, but reasoning queries turn up similar cognitive strategies, suggesting generalization. arxiv.org/abs/2411.12580

22.11.2024 13:51 β€” πŸ‘ 16    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

@rogergrosse is following 20 prominent accounts