Since there is a decent amount of "Do we know how LLMs work?" discourse flowing around, I would be really interested to hear what people would accept as "understanding how LLMs work". What kind of knowledge are we speaking about here?
05.10.2025 14:58 β π 1 π 0 π¬ 0 π 0
Sometimes D&D adventures are hilariously badly written:
Players hear a voice telling them to bring NPC X. They have to bring NPC X to open the door. If NPC X died earlier in the adventure, X will wait for them behind the door.
So if X dies it is behind the door that can ONLY be opened by X???
05.10.2025 14:52 β π 1 π 0 π¬ 0 π 0
A lot of folks in the AI space are like people who insist itβs safe and smart to get drunk in the driverβs seat of their Tesla on a country road and a lot of their opponents are like people saying thereβs no such thing as an automatic transmission
05.10.2025 03:02 β π 280 π 36 π¬ 2 π 3
I don't understand how the Canadian AI/ML ecosystem wants to attract and retain talents when they are offering less then half the salary on representative roles...
03.10.2025 18:07 β π 1 π 0 π¬ 0 π 0
Put on arxiv before acceptance, yes or no?
03.10.2025 15:34 β π 1 π 0 π¬ 3 π 0
Really great rant!
03.10.2025 12:44 β π 2 π 0 π¬ 0 π 0
Enjoying the For You feed? Give it a like β‘ to help more people discover it: bsky.app/profile/did:...
The more people use it -> the more feedback we get -> the better we can make it for you.
19.07.2025 01:52 β π 1422 π 183 π¬ 26 π 52
A totally unrelated question: does anybody know how to make long equations work on mobile with math jax and Jekyll π
π
03.10.2025 01:13 β π 3 π 0 π¬ 1 π 0
Thank you β€οΈ
03.10.2025 01:09 β π 0 π 0 π¬ 0 π 0
a close up of a sad cat with the words pleeeaasse written below it
ALT: a close up of a sad cat with the words pleeeaasse written below it
cvoelcker.de/blog/2025/re...
I finally gave in and made a nice blog post about my most recent paper. This was a surprising amount of work, so please be nice and go read it!
02.10.2025 21:34 β π 27 π 7 π¬ 0 π 3
Congrats! May your GPU and space access live long and prosper!
02.10.2025 00:16 β π 3 π 0 π¬ 0 π 0
RL rant time after reading another LLM paper about whether "RL sharpens the distribution or discovers new knowledge.": RL is not magic. If your exploration policy takes an action with 0 probability, it can't explore that action! It trivially just affects the distribution of supported actions.
01.10.2025 22:54 β π 37 π 2 π¬ 3 π 0
Stanford and Berkeley are functionally equivalent places and I refuse to treat them as separate entities.
26.09.2025 19:58 β π 1 π 0 π¬ 0 π 0
But those are WOOOOOOORK :D
26.09.2025 16:55 β π 1 π 0 π¬ 0 π 0
Happy guy sad guy meme with sad text: USE PPO AND TUNE HYPERPARAMETER FOR WEEKS and happy text: USE REPPO AND GET A POLICY
I have been told I need to get more modern in my paper promotion! github.com/cvoelcker/reppo / arxiv.org/abs/2507.11019 @marcelhussing.bsky.social
26.09.2025 14:51 β π 10 π 2 π¬ 1 π 0
My grad school salary advise: find a loving partner before grad school, get them a work visa and a well-paid job in your school location, have them wildly out-earn you because they are brilliant and tada... sugar-partner! Tested for your convenience!
25.09.2025 17:25 β π 6 π 0 π¬ 1 π 0
I pitched this ~8 years ago! It was going to provide the services "AI consultancy" and "blockchain consultancy". Our highly trained consultants would say "no" when you ask about AI or blockchain, and then just give you a normal database and some working SQL.
24.09.2025 20:55 β π 7 π 0 π¬ 0 π 0
I know I'm like... 3 years late to the party, but wow, custom preamble prompts make chatgpt so much more useful.
24.09.2025 19:40 β π 3 π 0 π¬ 0 π 0
Sometimes I wonder how certain researchers get famous when _none_ of their results are replicable, even with their own published code?!
You may chose if I mean you in this rant...
24.09.2025 16:11 β π 6 π 0 π¬ 0 π 0
We've hired some *fantastic* researchers but our startup is still looking for 2-3 more people with skills in ML/RL/LLMs. If you'd like to work on some transformative applied problems, hit me up. We'll be launching publicly soon too...
23.09.2025 17:31 β π 37 π 8 π¬ 0 π 0
Happy Rosh Hashanah. May this year be better than the last one!
22.09.2025 21:18 β π 16 π 1 π¬ 0 π 0
Reminds me of my favourite high-dim stat tidbit: the more dimensions you measure, the less likely it is to be close to the center (or average across all of them). High-dim Gaussian is a ball.
22.09.2025 19:37 β π 5 π 0 π¬ 1 π 0
The only thing I want in life is a math textbook that tells me _why_ we need a thing and what we need it to look like, before it rigorously defines it. Why is it always the other way around???
22.09.2025 17:54 β π 8 π 0 π¬ 0 π 0
@tmlrorg.bsky.social I keep getting assigned to review papers where I know close to nothing about the subject area. Is there a way to change the paper matching algorithm (e.g. exclude some of my works) or refuse review?
21.09.2025 16:31 β π 1 π 0 π¬ 0 π 0
Proud to announce that Meta-World+ was accepted to NeurIPs, Datasets and Benchmarks! Meta-World is a common benchmark for multi-task and meta-RL research! However, it was very difficult to do effective science with Meta-World as different versions produce different results.
19.09.2025 23:21 β π 15 π 1 π¬ 1 π 3
Math and ML? Just looking to expand my reading list so I can feel guilty about not reading more books even more π
17.09.2025 00:16 β π 0 π 0 π¬ 0 π 0
Huge shoutout to π @axelbrunnbauer.bsky.social π who took the lead on developing our Atari integration while I was off getting married and chilling for the summer.
16.09.2025 13:29 β π 3 π 0 π¬ 1 π 0
Big if true π€«: #REPPO works on Atari as well π± πΎ π
Some tuning is still needed, but we are seeing results roughly on par with #PQN.
If you want to test out #REPPO (atari is not integrated due to issues with envpool and jax version), check out github.com/cvoelcker/re...
#reinforcementlearning
16.09.2025 13:29 β π 7 π 1 π¬ 1 π 0
Whatβs your favorite textbooks?
16.09.2025 00:45 β π 1 π 0 π¬ 1 π 1
engineer living in Seattle (posts never represent employer). Transfem person (she/they), liberal, autistic. RTs not endorsements. Here to make friends & talk about Chris Nolan films. Anti-doomer. None of us are immune to the effects of social media.
Professor of Computer Vision/Machine Learning at Imagine/LIGM, Γcole nationale des Ponts et ChaussΓ©es @ecoledesponts.bsky.social Music & overall happiness π³πͺ» Born well below 350ppm
πParis π https://davidpicard.github.io/
Researcher on MDPs and RL. Retired prof. #orms #rl
Transactions on Machine Learning Research (TMLR) is a new venue for dissemination of machine learning research
https://jmlr.org/tmlr/
the economics of clairo blog post coming soon
Blogging at https://someunpleasant.substack.com/
Como todos los hombres de Babilonia, he sido procΓ³nsul; como todos, esclavo; tambiΓ©n he conocido la omnipotencia, el oprobio, las cΓ‘rceles.
very sane ai newsletter: verysane.ai
PhD candidate @polimi | Reinforcement Learning @rl3polimi | I do stuff, I see stuff. Some with purpose, most by chance.
https://ricczamboni.github.io
Reinforcement Learning PhD Student at the University of Tokyo, Prev: Intern at Sakana AI, PFN, M.Sc/B.Sc. from TU Munich
johannesack.github.io
Kempner Institute research fellow @Harvard interested in scaling up (deep) reinforcement learning theories of human cognition
prev: deepmind, umich, msr
https://cogscikid.com/
Wir schauen uns hier gerade um. https://www.queer.de
Final year Ph.D. candidate in NLP, CV at JHU. Researching transparent reasoning, multimodality, and fact verification. #NLProc
https://katesanders9.github.io/
PhD Candidate at Toronto Metropolitan University. Reinforcement learning π, machine learning. He/him.
reggiemclean.ca
princeton physics phd
mit '23 physics + math
RL, interpretable AI4Science, stat phys
Intensivist, ethicist, epidemiologist, math enthusiast. Es kΓΆnnte auch anders sein. (I call them tweets and they're my own)
Subscribe to my tech and online culture newsletter UserMag.co
Listen/watch Power User podcast on all platforms!!
Support my work on Patreon: https://www.patreon.com/c/taylorlorenz
Ex-philosopher, ex-Tweeter.
Email: info@contrapoints.com
Independent journalist covering internet culture, politics, and media @spitfirenews.com. Buy me a coffee: https://ko-fi.com/kattenbarge
Bilder, Geschichten, Begegnungen, Kunst, Kultur, Fotografie, Musik, Reisen, Buddhismus (Zen)β¦ liebe das #Meer und das #Ruhrgebiet!! Mit attestierter HochsensibilitΓ€tβ¦
#wirsindmehr #noafd