Scores of R1, Flash-thinking, Claude 4.7, QwQ, o1-pro, o3-mini on USAMO 2025. Scores less than 5% of max score.
Tests on USAMO immediately after problems were posted yield surprisingly bad model performance. Suggests there's much more training on test than expected.
arxiv.org/abs/2503.219...
31.03.2025 19:08 β
π 29
π 8
π¬ 7
π 0
Merry Christmas to all the cats enjoying their empty boxes this year! (And to everyone else too!)ππΎ
25.12.2024 16:28 β
π 8544
π 607
π¬ 192
π 22
Merry Christmas.
Cartoon by the wonderful Liz Climo
25.12.2024 20:40 β
π 26840
π 3083
π¬ 165
π 98
but the Greatest Gift of All is nine months of no Christmas music
25.12.2024 18:05 β
π 13
π 1
π¬ 1
π 0
A weird thing with AI is that so much has gotten so mind blowing so quick we are kind of immune to it by now...
I mean, that AI can make weird but realistic videos and funny songs from a prompt like this is crazy!!!
25.12.2024 22:06 β
π 1
π 0
π¬ 0
π 0
YouTube video by Last Week in AI
Current of Dreams - AI Music Video
My favorite AI creation so far
youtu.be/QIxwtzKAXTg?...
14.12.2024 20:58 β
π 2
π 0
π¬ 0
π 0
thinking of calling this "The Illusion Illusion"
(more examples below)
01.12.2024 14:33 β
π 1581
π 386
π¬ 60
π 91
01.12.2024 20:49 β
π 20
π 1
π¬ 2
π 0
you can post jokes on here if you want. it's legal
22.11.2024 12:14 β
π 7455
π 363
π¬ 329
π 66
Some machine learners were once children. Hereβs where you can find them:
go.bsky.app/F6mM37U
19.11.2024 23:31 β
π 125
π 16
π¬ 18
π 3
Yeah, thanks!
19.11.2024 21:49 β
π 0
π 0
π¬ 0
π 0
Hi #AI folks on Bluesky!
Been a looooong while since I stopped being active on Twitter/Mastodon, but hey let's give this a try and see if it sticks.
Here are some hashtags in case that does something: #AIResearch #DeepLearning #MachineLearning #Robotics #LLM
19.11.2024 18:54 β
π 2
π 0
π¬ 1
π 0