ββI feel like ARC is goodhart's law all over again. As soon as people started targetting it, we started beating it.
20.12.2024 18:10 β π 4 π 0 π¬ 0 π 0@voxmenthe.bsky.social
ML Research Engineer - occasional posts on NLP, RL, ML, LLMs etc Also occasional random stuff
ββI feel like ARC is goodhart's law all over again. As soon as people started targetting it, we started beating it.
20.12.2024 18:10 β π 4 π 0 π¬ 0 π 0Very useful tool!
04.12.2024 08:47 β π 3 π 0 π¬ 0 π 0Explain key equations in an intuitive yet rigorous way. More on how results achieved, less on result details. Outro reviews technical details of the method. No small talk. No gushing, or broad implications. Just focused and engaging technical discussion.
Works pretty well!
3/3
Podcast for AI researchers. Experts ask advanced questions. Systematically cover each section. Explain insights, methods and how to implement. Cover the paper in detail, especially the mechanics and components of how the method is implemented.
2/3
I spent a looong time iterating on this prompt for customizing NotebookLM notebooklm.google.com to do high quality summaries of research papers:
1/3
Yes so much of LLM capability still comes down to properly constructed and curated data
27.11.2024 18:46 β π 0 π 0 π¬ 0 π 0So maybe agentic systems need a strong set of subroutines designed to leverage external information sources, apply different reasoning paths - i.e. think more outside the box
23.11.2024 23:13 β π 1 π 0 π¬ 2 π 0I am really curious *why* - like what specifically is it that the humans do with the the really long periods that AI doesn't?
23.11.2024 22:49 β π 0 π 0 π¬ 1 π 0I found this post from @hamel.bsky.social helpful: hamel.dev/blog/posts/l...
23.11.2024 22:31 β π 1 π 0 π¬ 0 π 0Playing with silent CoT prompts today. I don't expect much but you never know....
23.11.2024 22:29 β π 2 π 0 π¬ 0 π 0I'd love to see more people explore this! I'm surprised more people aren't doing something like what you mentioned with using it with standard pretrained LLMs. I could imagine some interesting combinations with sampling strategies @xjdr.bsky.social
23.11.2024 04:01 β π 3 π 0 π¬ 0 π 0Sonnet still really impressive on this "benchmark"
23.11.2024 02:27 β π 0 π 0 π¬ 0 π 0