Thrilled to share our new preprint on Reinforcement Learning for Reverse Engineering (RLRE) π
We demonstrate that human preferences can be reverse engineered effectively by pipelining LLMs to optimise upstream preambles via reinforcement learning π§΅β¬οΈ
22.05.2025 15:01 β π 9 π 1 π¬ 1 π 0
Massive shoutout to all our fantastic contributors, collaborators and partners who made this possible! π
27.03.2025 15:01 β π 1 π 0 π¬ 0 π 0
Model weights are available for research purposes at:
π Command A: huggingface.co/CohereForAI/...
πCommand R7B: huggingface.co/CohereForAI/...
27.03.2025 15:01 β π 1 π 0 π¬ 1 π 0
π You can find the full tech report at cohere.com/research/pap...
27.03.2025 15:01 β π 1 π 0 π¬ 1 π 0
I'm excited to share the tech report for our @cohere.com @cohereforai.bsky.social Command A and Command R7B models. We highlight our novel approach to model training including self-refinement algorithms and model merging techniques at scale. Read more below! β¬οΈ
27.03.2025 15:01 β π 11 π 4 π¬ 1 π 3
I really enjoyed my MLST chat with Tim @neuripsconf.bsky.social about the research we've been doing on reasoning, robustness and human feedback. If you have an hour to spare and are interested in AI robustness, it may be worth a listen π§
Check it out at youtu.be/DL7qwmWWk88?...
19.03.2025 15:11 β π 8 π 3 π¬ 0 π 0
That's very cool! There's definitely a lot happening in the space and most people are doing some version of this, but I haven't come across a well-organised collection of tools like this yet -- could be quite impactful!
10.03.2025 17:27 β π 1 π 0 π¬ 0 π 0
Check out @lisaalaz.bsky.social's internship work with us @cohere.com questioning the rationale behind rationales π₯
13.02.2025 16:18 β π 4 π 1 π¬ 0 π 0
Super excited to see PRISM recognised as a #NeurIPS2024 best paper. This was an incredible large-scale effort by @hannahrosekirk.bsky.social and fantastic collaborators. If you're interested in human feedback, check it out, there are 100+ pages of detailed insights! π₯
11.12.2024 16:23 β π 9 π 1 π¬ 0 π 0
Our paper PRISM alignment won a best paper award at #neurips2024!
All credits to @hannahrosekirk.bsky.social A.Whitefield, P.RΓΆttger, A.M.Bean, K.Margatina, R.Mosquera-Gomez, J.Ciro, @maxbartolo.bsky.social H.He, B.Vidgen, S.Hale
Catch Hannah tomorrow at neurips.cc/virtual/2024/poster/97804
11.12.2024 16:20 β π 67 π 9 π¬ 2 π 0
Excited to reveal Genie 2, our most capable foundation world model that, given a single prompt image, can generate an endless variety of action-controllable, playable 3D worlds. Fantastic cross-team effort by the Open-Endedness Team and many other teams at Google DeepMind! π§
04.12.2024 16:13 β π 94 π 18 π¬ 3 π 3
an advertisement for vancouver in british columbia canada
ALT: an advertisement for vancouver in british columbia canada
Looking forward to @neuripsconf.bsky.social #NeurIPS #NeurIPS2024 in Vancouver next week! βοΈ
Reach out (or pop by the @cohere.com booth) if you want to chat about human feedback, robustness and reasoning, prompt optimisation, adversarial data, glitch tokens, evaluation, or anything else!
02.12.2024 17:11 β π 11 π 0 π¬ 0 π 0
Couldn't agree with you more, Laura is incredible!
01.12.2024 12:11 β π 3 π 0 π¬ 0 π 0
Sparks of multi-hop reasoning β¨
29.11.2024 09:41 β π 9 π 2 π¬ 0 π 0
Fun to see Douwe's Dynabench plot continue to inspire new groundbreaking benchmarking work!
24.11.2024 22:11 β π 4 π 0 π¬ 0 π 0
Awesome, thanks!
20.11.2024 23:45 β π 1 π 0 π¬ 0 π 0
@mariaa.bsky.social I'm new here so apologies if this is a noob question, but is there a way I can recommend folks to be added to starter packs?
20.11.2024 23:41 β π 1 π 0 π¬ 1 π 0
π¨ LLMs can learn to reason from procedural knowledge in pretraining data! π¨ I particularly enjoy research where the evidence contradicts our initial hypothesis. If you're interested in LLM reasoning, check out the 60+ pages of in-depth work at arxiv.org/abs/2411.12580
20.11.2024 17:21 β π 67 π 7 π¬ 4 π 1
We launched Judge Arena with @huggingface.bsky.social
@clefourrier.bsky.social - a platform that lets you easily compare models as judges side-by-side and vote for the best evaluation
Check out the live leaderboard and start voting now π€
19.11.2024 19:08 β π 10 π 3 π¬ 0 π 1
Strengthening Europe's Leadership in AI through Research Excellence | ellis.eu
I make sure that OpenAI et al. aren't the only people who are able to study large scale AI systems.
We build secure, scalable, and private enterprise-grade AI technology to solve real-world business problems. Join us: http://cohere.com/careers
Senior Research Scientist at Cohere. PhD at UCL. He/him.
Lead pre-training @Cohere
Research Scientist @ Google DeepMind. Previously @ OpenAI. Building AGI. π€
PhD supervised by Tim RocktΓ€schel and Ed Grefenstette, part time at Cohere. Language and LLMs. Spent time at FAIR, Google, and NYU (with Brenden Lake). She/her.
Breakthrough AI to solve the world's biggest problems.
βΊ Join us: http://allenai.org/careers
βΊ Get our newsletter: https://share.hsforms.com/1uJkWs5aDRHWhiky3aHooIg3ioxm
βοΈ Assistant Professor of Computer Science at CU Boulder π©βπ» NLP, cultural analytics, narratives, online communities π https://maria-antoniak.github.io π¬ books, bikes, games, art
Natural Language Processing PhD Student @ Heidelberg University.
https://schumann.pub
#NLP #NLProc #ML #AI
Professor at the University of Copenhagen. Explainable AI, Natural Language Processing, ML. Head of copenlu.bsky.social lab.
#NLProc #NLP #XAI
http://isabelleaugenstein.github.io/
Assistant Professor confused by the concept of consciousness but talkingtorobots.com in the meantime
NYU professor, Google research scientist. Good at LaTeX.
Assistant Professor in Computer Science at USC | NLP, ML
Postdoc @ai2.bsky.social & @uwnlp.bsky.social
Associate professor at IT University of Copenhagen: NLP, language models, interpretability, AI & society. Co-editor-in-chief of ACL Rolling Review. #NLProc #NLP
Asst prof @ University of Utah Β· NLP Β· she/her ππ·