Claas Voelcker's Avatar

Claas Voelcker

@cvoelcker.bsky.social

For professional, see https://cvoelcker.de If I seem very angry, check if I have been watered in the last 24 hours. Now πŸ‡ΊπŸ‡Έ flavoured, previously available in πŸ‡¨πŸ‡¦ and πŸ‡©πŸ‡ͺ

2,694 Followers  |  555 Following  |  816 Posts  |  Joined: 08.10.2023  |  1.9075

Latest posts by cvoelcker.bsky.social on Bluesky


The real teacher forcing was the saccharine "solarpunk" stories I made you rehearse.

26.02.2026 05:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Submit your RL papers to RLC!
This is now perhaps the best venue for RL researchers.

26.02.2026 00:04 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

I still think the best alignment strategy is to write a lot of really hopeful and optimistic fiction about AI so that this saturates the pretraining datasets and future AI will be forced to roleplay the most benevolent versions of themselves we can think of.

25.02.2026 18:45 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Companies which make papers a hiring bonus should be told to p**s off. We are drowning in students who want to do "research" to get hired by Google... It's soul crushing

25.02.2026 18:38 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Every time I read too much about prompting, CLAUDE files, skills, etc, I feel the need to remind people that humans have an infinite capacity for magical thinking. The Romans were also REALLY convinced that there was a correct way to sacrifice a goat to ensure a good harvest...

24.02.2026 21:44 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yes, I bow down to the infinite big brain that has to be behind that incomprehensible mess πŸ˜‚πŸ˜‚πŸ˜‚

24.02.2026 18:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Ok, but you haven’t even begun to properly make fun of the fact that in high school, we randomly switch to 5-15 (0-5 technically exist, but are all failing grades) and then high school final grades are translated back to 1.0-5.0!

24.02.2026 18:27 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Austin on a Tuesday? No chance

24.02.2026 14:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What is the maximum time between submitting a slurm job and it actually starting that people would find ok for regular research progress?

23.02.2026 23:17 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Which proves that the issue is mostly off policy-bootstrapping, which is provably … difficult πŸ˜…

23.02.2026 06:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Claude jumps to conclusions faster than the most hyperactive undergrad I have ever worked with...

22.02.2026 18:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

That's probably fair, I just thought to mention it cause Droid and OXE are also mostly just used for training (unless I'm wrong there)

22.02.2026 03:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

My patience with graduates whose reaction to "AI will crash the economy" is not "oh, we should work hard on open source, policy, and figuring out how to help each other" but "f**k, you need to run through the doors so we can slam them close faster" has dropped to negative values. You have FAILED!

21.02.2026 16:49 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

If you join an AI company because you need to be part of the few who will prosper as society collapses from economic turmoil, you have failed as a person and should be shunned. If your students express this attitude, you should sit them down and talk about responsibilities.
Build a better world!

21.02.2026 16:49 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Nice overview! I think the IsaacSim/Lab universe deserves a shoutout for locomotion training

21.02.2026 03:23 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Not being on twitter and facing the full ngmi doom helps MASSIVELY with not having this crisis

21.02.2026 01:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Have you noticed how navigation apps include walking & waiting for public transit, but excludes parking & walking for driving? After being late a few times πŸ˜…, we finally did. We got curious: what if these apps account for parking?

19.02.2026 15:39 β€” πŸ‘ 75    πŸ” 33    πŸ’¬ 6    πŸ“Œ 3

I forgot that every LLM's logo was a butthole

20.02.2026 14:33 β€” πŸ‘ 845    πŸ” 164    πŸ’¬ 28    πŸ“Œ 2

That would be my question: how often are candidates with three papers even invited? Random sampling at both UofT and IT Austin suggests… never

20.02.2026 05:18 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

this is my reaction as well: we just need to hire more faculty to advise them

20.02.2026 04:15 β€” πŸ‘ 12    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Ah I understand your point! In many such cases you can still probably point to the 5 most pivotal work that intergrate the smaller steps?

20.02.2026 01:50 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Even then, a student from a rarely publishing lab will often barely generate attention on 5 papers if that is all they got. Since universities run on citations, you are screwed, even if the work is good.

20.02.2026 01:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Impact is not independent from the attention paper mills produce. A good paper does not create impact on its own.

20.02.2026 01:40 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I had one call per interested lab (I believe it was 4 or 5) and then I got an offer that was in the 2019/20 season

19.02.2026 18:21 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We're thrilled to share that the Call for Workshops for this year's @rl-conference.bsky.social is now live!

As Workshop co-chair (alongside the wonderful Raksha Kumaraswamy and @claireve.bsky.social) we are looking forward to seeing the proposals for workshops that we receive.

LINK IN NEXT POST

13.02.2026 21:50 β€” πŸ‘ 11    πŸ” 5    πŸ’¬ 1    πŸ“Œ 2

I assume you enjoy ugly crying?

18.02.2026 04:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

PSA: If your hands hurt from cutting spicy chillies, do not, i repeat, DO NOT, scratch your nose...

18.02.2026 03:54 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

My hypothesis: A lot is visibility especially re jobs. In addition, CS grad students are (forced to be) very unpolitical, and this place isn’t really perceived as such.

16.02.2026 05:56 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I’ve been thinking about a practical question and would love some opinions:

How do your papers actually get discovered/cited?

I was searching for recent work on high update ratio RL and found several very closely related papers tackling the same failure modes we study. None cited our earlier work.

14.02.2026 23:13 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 5    πŸ“Œ 0

πŸš€ Excited to share REPPO, a new on-policy RL agent!

TL;DR: Replace PPO with REPPO for fewer hyperparameter headaches and more robust training.

REPPO, led by @cvoelcker.bsky.social, will be presented at ICLR 2026. How does it work? πŸ§΅πŸ‘‡

13.02.2026 19:28 β€” πŸ‘ 25    πŸ” 10    πŸ’¬ 1    πŸ“Œ 0

@cvoelcker is following 20 prominent accounts