Claas Voelcker's Avatar

Claas Voelcker

@cvoelcker.bsky.social

Thou shalt not disfigure the soul! The square root of a goat is one If I seem very angry, check if I have been watered in the last 24 hours. For professional, see https://cvoelcker.de

2,607 Followers  |  385 Following  |  538 Posts  |  Joined: 08.10.2023  |  2.0687

Latest posts by cvoelcker.bsky.social on Bluesky

Since there is a decent amount of "Do we know how LLMs work?" discourse flowing around, I would be really interested to hear what people would accept as "understanding how LLMs work". What kind of knowledge are we speaking about here?

05.10.2025 14:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Sometimes D&D adventures are hilariously badly written:

Players hear a voice telling them to bring NPC X. They have to bring NPC X to open the door. If NPC X died earlier in the adventure, X will wait for them behind the door.

So if X dies it is behind the door that can ONLY be opened by X???

05.10.2025 14:52 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

A lot of folks in the AI space are like people who insist it’s safe and smart to get drunk in the driver’s seat of their Tesla on a country road and a lot of their opponents are like people saying there’s no such thing as an automatic transmission

05.10.2025 03:02 β€” πŸ‘ 280    πŸ” 36    πŸ’¬ 2    πŸ“Œ 3

I don't understand how the Canadian AI/ML ecosystem wants to attract and retain talents when they are offering less then half the salary on representative roles...

03.10.2025 18:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Put on arxiv before acceptance, yes or no?

03.10.2025 15:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0

Really great rant!

03.10.2025 12:44 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Enjoying the For You feed? Give it a like β™‘ to help more people discover it: bsky.app/profile/did:...

The more people use it -> the more feedback we get -> the better we can make it for you.

19.07.2025 01:52 β€” πŸ‘ 1422    πŸ” 183    πŸ’¬ 26    πŸ“Œ 52

A totally unrelated question: does anybody know how to make long equations work on mobile with math jax and Jekyll πŸ˜…πŸ™ˆ

03.10.2025 01:13 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Thank you ❀️

03.10.2025 01:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
a close up of a sad cat with the words pleeeaasse written below it ALT: a close up of a sad cat with the words pleeeaasse written below it

cvoelcker.de/blog/2025/re...

I finally gave in and made a nice blog post about my most recent paper. This was a surprising amount of work, so please be nice and go read it!

02.10.2025 21:34 β€” πŸ‘ 27    πŸ” 7    πŸ’¬ 0    πŸ“Œ 3
Relative Entropy Pathwise Policy Optimization - Technical Overview | Claas A. Voelcker A lightweight overview of the new REPPO algorithm

cvoelcker.de/blog/2025/re...

Here ya go!

02.10.2025 21:31 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Congrats! May your GPU and space access live long and prosper!

02.10.2025 00:16 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

RL rant time after reading another LLM paper about whether "RL sharpens the distribution or discovers new knowledge.": RL is not magic. If your exploration policy takes an action with 0 probability, it can't explore that action! It trivially just affects the distribution of supported actions.

01.10.2025 22:54 β€” πŸ‘ 37    πŸ” 2    πŸ’¬ 3    πŸ“Œ 0

Stanford and Berkeley are functionally equivalent places and I refuse to treat them as separate entities.

26.09.2025 19:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

But those are WOOOOOOORK :D

26.09.2025 16:55 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Happy guy sad guy meme with sad text: USE PPO AND TUNE HYPERPARAMETER FOR WEEKS and happy text: USE REPPO AND GET A POLICY

Happy guy sad guy meme with sad text: USE PPO AND TUNE HYPERPARAMETER FOR WEEKS and happy text: USE REPPO AND GET A POLICY

I have been told I need to get more modern in my paper promotion! github.com/cvoelcker/reppo / arxiv.org/abs/2507.11019 @marcelhussing.bsky.social

26.09.2025 14:51 β€” πŸ‘ 10    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

My grad school salary advise: find a loving partner before grad school, get them a work visa and a well-paid job in your school location, have them wildly out-earn you because they are brilliant and tada... sugar-partner! Tested for your convenience!

25.09.2025 17:25 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I pitched this ~8 years ago! It was going to provide the services "AI consultancy" and "blockchain consultancy". Our highly trained consultants would say "no" when you ask about AI or blockchain, and then just give you a normal database and some working SQL.

24.09.2025 20:55 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I know I'm like... 3 years late to the party, but wow, custom preamble prompts make chatgpt so much more useful.

24.09.2025 19:40 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Sometimes I wonder how certain researchers get famous when _none_ of their results are replicable, even with their own published code?!

You may chose if I mean you in this rant...

24.09.2025 16:11 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We've hired some *fantastic* researchers but our startup is still looking for 2-3 more people with skills in ML/RL/LLMs. If you'd like to work on some transformative applied problems, hit me up. We'll be launching publicly soon too...

23.09.2025 17:31 β€” πŸ‘ 37    πŸ” 8    πŸ’¬ 0    πŸ“Œ 0

Happy Rosh Hashanah. May this year be better than the last one!

22.09.2025 21:18 β€” πŸ‘ 16    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Reminds me of my favourite high-dim stat tidbit: the more dimensions you measure, the less likely it is to be close to the center (or average across all of them). High-dim Gaussian is a ball.

22.09.2025 19:37 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The only thing I want in life is a math textbook that tells me _why_ we need a thing and what we need it to look like, before it rigorously defines it. Why is it always the other way around???

22.09.2025 17:54 β€” πŸ‘ 8    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@tmlrorg.bsky.social I keep getting assigned to review papers where I know close to nothing about the subject area. Is there a way to change the paper matching algorithm (e.g. exclude some of my works) or refuse review?

21.09.2025 16:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Proud to announce that Meta-World+ was accepted to NeurIPs, Datasets and Benchmarks! Meta-World is a common benchmark for multi-task and meta-RL research! However, it was very difficult to do effective science with Meta-World as different versions produce different results.

19.09.2025 23:21 β€” πŸ‘ 15    πŸ” 1    πŸ’¬ 1    πŸ“Œ 3

Math and ML? Just looking to expand my reading list so I can feel guilty about not reading more books even more πŸ˜‚

17.09.2025 00:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Huge shoutout to πŸ‘‘ @axelbrunnbauer.bsky.social πŸ‘‘ who took the lead on developing our Atari integration while I was off getting married and chilling for the summer.

16.09.2025 13:29 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Big if true 🀫: #REPPO works on Atari as well 😱 πŸ‘Ύ πŸš€

Some tuning is still needed, but we are seeing results roughly on par with #PQN.

If you want to test out #REPPO (atari is not integrated due to issues with envpool and jax version), check out github.com/cvoelcker/re...

#reinforcementlearning

16.09.2025 13:29 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

What’s your favorite textbooks?

16.09.2025 00:45 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1

@cvoelcker is following 20 prominent accounts