Max Bartolo @maxbartolo - Bluesky Profile

Thrilled to share our new preprint on Reinforcement Learning for Reverse Engineering (RLRE) 🚀

We demonstrate that human preferences can be reverse engineered effectively by pipelining LLMs to optimise upstream preambles via reinforcement learning 🧵⬇️

22.05.2025 15:01 — 👍 9 🔁 1 💬 1 📌 0

Massive shoutout to all our fantastic contributors, collaborators and partners who made this possible! 🙏

27.03.2025 15:01 — 👍 1 🔁 0 💬 0 📌 0

Model weights are available for research purposes at:
🔗 Command A: huggingface.co/CohereForAI/...
🔗Command R7B: huggingface.co/CohereForAI/...

27.03.2025 15:01 — 👍 1 🔁 0 💬 1 📌 0

📄 You can find the full tech report at cohere.com/research/pap...

27.03.2025 15:01 — 👍 1 🔁 0 💬 1 📌 0

I'm excited to share the tech report for our @cohere.com @cohereforai.bsky.social Command A and Command R7B models. We highlight our novel approach to model training including self-refinement algorithms and model merging techniques at scale. Read more below! ⬇️

27.03.2025 15:01 — 👍 11 🔁 4 💬 1 📌 3

I really enjoyed my MLST chat with Tim @neuripsconf.bsky.social about the research we've been doing on reasoning, robustness and human feedback. If you have an hour to spare and are interested in AI robustness, it may be worth a listen 🎧

Check it out at youtu.be/DL7qwmWWk88?...

19.03.2025 15:11 — 👍 8 🔁 3 💬 0 📌 0

That's very cool! There's definitely a lot happening in the space and most people are doing some version of this, but I haven't come across a well-organised collection of tools like this yet -- could be quite impactful!

10.03.2025 17:27 — 👍 1 🔁 0 💬 0 📌 0

Check out @lisaalaz.bsky.social's internship work with us @cohere.com questioning the rationale behind rationales 🔥

13.02.2025 16:18 — 👍 4 🔁 1 💬 0 📌 0

Super excited to see PRISM recognised as a #NeurIPS2024 best paper. This was an incredible large-scale effort by @hannahrosekirk.bsky.social and fantastic collaborators. If you're interested in human feedback, check it out, there are 100+ pages of detailed insights! 🔥

11.12.2024 16:23 — 👍 9 🔁 1 💬 0 📌 0

Our paper PRISM alignment won a best paper award at #neurips2024!

All credits to @hannahrosekirk.bsky.social A.Whitefield, P.Röttger, A.M.Bean, K.Margatina, R.Mosquera-Gomez, J.Ciro, @maxbartolo.bsky.social H.He, B.Vidgen, S.Hale

Catch Hannah tomorrow at neurips.cc/virtual/2024/poster/97804

11.12.2024 16:20 — 👍 67 🔁 9 💬 2 📌 0

Excited to reveal Genie 2, our most capable foundation world model that, given a single prompt image, can generate an endless variety of action-controllable, playable 3D worlds. Fantastic cross-team effort by the Open-Endedness Team and many other teams at Google DeepMind! 🧞

04.12.2024 16:13 — 👍 96 🔁 18 💬 3 📌 3

an advertisement for vancouver in british columbia canada ALT: an advertisement for vancouver in british columbia canada

Looking forward to @neuripsconf.bsky.social #NeurIPS #NeurIPS2024 in Vancouver next week! ❄️

Reach out (or pop by the @cohere.com booth) if you want to chat about human feedback, robustness and reasoning, prompt optimisation, adversarial data, glitch tokens, evaluation, or anything else!

02.12.2024 17:11 — 👍 10 🔁 0 💬 0 📌 0

Couldn't agree with you more, Laura is incredible!

01.12.2024 12:11 — 👍 3 🔁 0 💬 0 📌 0

Sparks of multi-hop reasoning ✨

29.11.2024 09:41 — 👍 8 🔁 2 💬 0 📌 0

Fun to see Douwe's Dynabench plot continue to inspire new groundbreaking benchmarking work!

24.11.2024 22:11 — 👍 4 🔁 0 💬 0 📌 0

Awesome, thanks!

20.11.2024 23:45 — 👍 1 🔁 0 💬 0 📌 0

@mariaa.bsky.social I'm new here so apologies if this is a noob question, but is there a way I can recommend folks to be added to starter packs?

20.11.2024 23:41 — 👍 1 🔁 0 💬 1 📌 0

🚨 LLMs can learn to reason from procedural knowledge in pretraining data! 🚨 I particularly enjoy research where the evidence contradicts our initial hypothesis. If you're interested in LLM reasoning, check out the 60+ pages of in-depth work at arxiv.org/abs/2411.12580

20.11.2024 17:21 — 👍 67 🔁 7 💬 4 📌 1

We launched Judge Arena with @huggingface.bsky.social
@clefourrier.bsky.social - a platform that lets you easily compare models as judges side-by-side and vote for the best evaluation

Check out the live leaderboard and start voting now 🤗

19.11.2024 19:08 — 👍 10 🔁 3 💬 0 📌 1

Max Bartolo

Latest posts by maxbartolo.bsky.social on Bluesky

@maxbartolo is following 19 prominent accounts