Max Bartolo's Avatar

Max Bartolo

@maxbartolo.bsky.social

Building robust LLMs @Cohere

297 Followers  |  27 Following  |  15 Posts  |  Joined: 20.11.2024  |  1.7366

Latest posts by maxbartolo.bsky.social on Bluesky

Post image

Thrilled to share our new preprint on Reinforcement Learning for Reverse Engineering (RLRE) πŸš€

We demonstrate that human preferences can be reverse engineered effectively by pipelining LLMs to optimise upstream preambles via reinforcement learning πŸ§΅β¬‡οΈ

22.05.2025 15:01 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Massive shoutout to all our fantastic contributors, collaborators and partners who made this possible! πŸ™

27.03.2025 15:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Model weights are available for research purposes at:
πŸ”— Command A: huggingface.co/CohereForAI/...
πŸ”—Command R7B: huggingface.co/CohereForAI/...

27.03.2025 15:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ“„ You can find the full tech report at cohere.com/research/pap...

27.03.2025 15:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

I'm excited to share the tech report for our @cohere.com @cohereforai.bsky.social Command A and Command R7B models. We highlight our novel approach to model training including self-refinement algorithms and model merging techniques at scale. Read more below! ⬇️

27.03.2025 15:01 β€” πŸ‘ 11    πŸ” 4    πŸ’¬ 1    πŸ“Œ 3
Post image

I really enjoyed my MLST chat with Tim @neuripsconf.bsky.social about the research we've been doing on reasoning, robustness and human feedback. If you have an hour to spare and are interested in AI robustness, it may be worth a listen 🎧

Check it out at youtu.be/DL7qwmWWk88?...

19.03.2025 15:11 β€” πŸ‘ 8    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

That's very cool! There's definitely a lot happening in the space and most people are doing some version of this, but I haven't come across a well-organised collection of tools like this yet -- could be quite impactful!

10.03.2025 17:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Check out @lisaalaz.bsky.social's internship work with us @cohere.com questioning the rationale behind rationales πŸ”₯

13.02.2025 16:18 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Super excited to see PRISM recognised as a #NeurIPS2024 best paper. This was an incredible large-scale effort by @hannahrosekirk.bsky.social and fantastic collaborators. If you're interested in human feedback, check it out, there are 100+ pages of detailed insights! πŸ”₯

11.12.2024 16:23 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Our paper PRISM alignment won a best paper award at #neurips2024!

All credits to @hannahrosekirk.bsky.social A.Whitefield, P.RΓΆttger, A.M.Bean, K.Margatina, R.Mosquera-Gomez, J.Ciro, @maxbartolo.bsky.social H.He, B.Vidgen, S.Hale

Catch Hannah tomorrow at neurips.cc/virtual/2024/poster/97804

11.12.2024 16:20 β€” πŸ‘ 67    πŸ” 9    πŸ’¬ 2    πŸ“Œ 0

Excited to reveal Genie 2, our most capable foundation world model that, given a single prompt image, can generate an endless variety of action-controllable, playable 3D worlds. Fantastic cross-team effort by the Open-Endedness Team and many other teams at Google DeepMind! 🧞

04.12.2024 16:13 β€” πŸ‘ 94    πŸ” 18    πŸ’¬ 3    πŸ“Œ 3
Preview
an advertisement for vancouver in british columbia canada ALT: an advertisement for vancouver in british columbia canada

Looking forward to @neuripsconf.bsky.social #NeurIPS #NeurIPS2024 in Vancouver next week! ❄️

Reach out (or pop by the @cohere.com booth) if you want to chat about human feedback, robustness and reasoning, prompt optimisation, adversarial data, glitch tokens, evaluation, or anything else!

02.12.2024 17:11 β€” πŸ‘ 11    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Couldn't agree with you more, Laura is incredible!

01.12.2024 12:11 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Sparks of multi-hop reasoning ✨

29.11.2024 09:41 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Fun to see Douwe's Dynabench plot continue to inspire new groundbreaking benchmarking work!

24.11.2024 22:11 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Awesome, thanks!

20.11.2024 23:45 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@mariaa.bsky.social I'm new here so apologies if this is a noob question, but is there a way I can recommend folks to be added to starter packs?

20.11.2024 23:41 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

🚨 LLMs can learn to reason from procedural knowledge in pretraining data! 🚨 I particularly enjoy research where the evidence contradicts our initial hypothesis. If you're interested in LLM reasoning, check out the 60+ pages of in-depth work at arxiv.org/abs/2411.12580

20.11.2024 17:21 β€” πŸ‘ 67    πŸ” 7    πŸ’¬ 4    πŸ“Œ 1

We launched Judge Arena with @huggingface.bsky.social
@clefourrier.bsky.social - a platform that lets you easily compare models as judges side-by-side and vote for the best evaluation

Check out the live leaderboard and start voting now πŸ€—

19.11.2024 19:08 β€” πŸ‘ 10    πŸ” 3    πŸ’¬ 0    πŸ“Œ 1

@maxbartolo is following 19 prominent accounts