Jeff Coggshall @NeurIPS's Avatar

Jeff Coggshall @NeurIPS

@voxmenthe.bsky.social

ML Research Engineer - occasional posts on NLP, RL, ML, LLMs etc Also occasional random stuff

353 Followers  |  3,072 Following  |  12 Posts  |  Joined: 08.11.2024  |  1.951

Latest posts by voxmenthe.bsky.social on Bluesky

​​I feel like ARC is goodhart's law all over again. As soon as people started targetting it, we started beating it.

20.12.2024 18:10 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Very useful tool!

04.12.2024 08:47 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Explain key equations in an intuitive yet rigorous way. More on how results achieved, less on result details. Outro reviews technical details of the method. No small talk. No gushing, or broad implications. Just focused and engaging technical discussion.

Works pretty well!

3/3

04.12.2024 06:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Podcast for AI researchers. Experts ask advanced questions. Systematically cover each section. Explain insights, methods and how to implement. Cover the paper in detail, especially the mechanics and components of how the method is implemented.

2/3

04.12.2024 06:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I spent a looong time iterating on this prompt for customizing NotebookLM notebooklm.google.com to do high quality summaries of research papers:

1/3

04.12.2024 06:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yes so much of LLM capability still comes down to properly constructed and curated data

27.11.2024 18:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

So maybe agentic systems need a strong set of subroutines designed to leverage external information sources, apply different reasoning paths - i.e. think more outside the box

23.11.2024 23:13 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

I am really curious *why* - like what specifically is it that the humans do with the the really long periods that AI doesn't?

23.11.2024 22:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Creating a LLM-as-a-Judge That Drives Business Results – A step-by-step guide with my learnings from 30+ AI implementations.

I found this post from @hamel.bsky.social helpful: hamel.dev/blog/posts/l...

23.11.2024 22:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Playing with silent CoT prompts today. I don't expect much but you never know....

23.11.2024 22:29 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I'd love to see more people explore this! I'm surprised more people aren't doing something like what you mentioned with using it with standard pretrained LLMs. I could imagine some interesting combinations with sampling strategies @xjdr.bsky.social

23.11.2024 04:01 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Sonnet still really impressive on this "benchmark"

23.11.2024 02:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@voxmenthe is following 20 prominent accounts