Cohere Labs's Avatar

Cohere Labs

@cohereforai.bsky.social

@Cohere.com's non-profit research lab and open science initiative that seeks to solve complex machine learning problems. Join us in exploring the unknown, together. https://cohere.com/research

461 Followers  |  12 Following  |  169 Posts  |  Joined: 10.12.2024  |  2.0476

Latest posts by cohereforai.bsky.social on Bluesky

πŸ“œPaper link: https://arxiv.org/abs/2510.00931

Led by: Ammar Khairi, @juliakreutzer.bsky.social, Daniel D'souza @mziizm.bsky.social

02.10.2025 10:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We are excited to present FusioN as a plug-and-play replacement to Best-of-N, shifting from a monolithic selection framework to collaborative synthesis one that embraces the diverse strengths of today’s leading open LLMs.

02.10.2025 10:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

How does FusioN use the same sample pool more effectively than BoN?

🧩While BoN picks just one sample per problem, FusioN synthesises one output from all samples – treating them as collaborators whose strengths can be integrated, not competitors in a zero-sum game.

02.10.2025 10:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Want the wisdom-of-the-crowd in 1 model?

πŸ§‘β€πŸŽ“πŸ§‘πŸ½β€πŸŽ“πŸ‘¨πŸΎβ€πŸŽ“Fusion-of-N distills multiple teachers into richer synthetic data than BoN, training students that achieve bigger downstream gains, even surpassing teachers on multilingual factual reasoning 🌎

02.10.2025 10:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Test-time scaling doesn't need to waste samples, Fusion-of-N turns every sample into signal; outperforming BoN across tasks, languages and models. πŸš€

Fusion-of-N boosts CommandA win-rates vs Gemini-2.5 Pro +8.3% across 11 languages – a +4.0% improvement over BoN πŸ₯‡

02.10.2025 10:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Fusion-of-N uses an LLM (the fusor) to merge multiple candidate answers into one πŸ’Ž

Instead of selecting only one response, Fusion-of-N creates an even better answer by integrating insights across all samples πŸ…

02.10.2025 10:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Is Best-of-N really the best use of your inference compute?

Introducing Fusion-of-N: a simple and powerful way to advance inference and distillation beyond Best-of-N.

02.10.2025 10:00 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 1

Apply now: https://jobs.ashbyhq.com/cohere/7ec9eaf4-8cfc-4977-9041-86f73e7ab10b

30.09.2025 10:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

We’re not your average lab. We’re a hybrid research environment dedicated to revolutionizing the ML space.

And we’re hiring a Senior Research Scientist to co-create with us.

If you believe in research as a shared, global effort β€” this is your chance.

30.09.2025 10:00 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

Led by: Srishti Gureja, Elena Tommasone, Jingyi He, @sarahooker.bsky.social, Matthias Galle, and @mziizm.bsky.social

πŸ“„ Paper: https://arxiv.org/abs/2509.20837

29.09.2025 10:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ”Ή The future of synthetic training hinges on rethinking verification. It’s calibrated verification: complex, diverse test suites combined with flexible signals that break the Verification Ceiling and improve code LLMs.

29.09.2025 10:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

πŸ”Ή We also find that LLMs can serve as soft verifiers. Their judgments recover useful data and often match or surpass formal unit tests selection.

29.09.2025 10:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

πŸ”Ή Relaxing verification thresholds boosts performance but only with sufficiently complex test suites. Correctness still matters, but how we define it is the real issue.

29.09.2025 10:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We find:

πŸ”Ή Rigid verification risks biasing toward easy problems, while richer correctness signals preserve both quality and diversity.

29.09.2025 10:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

What if the way we verify synthetic code is limiting model performance?

In our latest work we uncover the Verification Ceiling Problem: strict β€œall tests must pass” rules throw away useful data, while weak tests let errors through.

29.09.2025 10:00 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

I'm excited to share that I'll be stepping into the role of Head of @cohereforai.bsky.social. It's an honor and a responsibility to lead such an extraordinary group of researchers pushing the boundaries of AI research.

05.09.2025 17:26 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image Post image Post image

Papers In The Park 14. Last one of the season! Still great weather. Surprising. Anthony is presenting the β€œWhy Language Models Hallucinate”.

Thanks to @cohereforai.bsky.social for the copies and pizza.

13.09.2025 16:14 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

🚨 Rare opportunity: Cohere Labs is hiring a Research Scientist!

If you’re passionate about studying fundamental AI problems and working in a globally collaborative, open-science environment, this is for you.

Apply here: jobs.ashbyhq.com/cohere/7ec9e...

24.09.2025 14:30 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

It’s papers in the park 7! Thanks to @cohereforai.bsky.social for the papers and the pizza, and to Alvin and Anthony for organizing.

It’s easily one of funnest paper reads in the city!

26.07.2025 15:32 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Breaking into AI research is harder than ever, and early-career researchers face fewer chances to get started.

Entry points matter.

We started the Scholars Program 3 years ago to give new researchers a real shot β€” excited to open applications for year 4✨

13.08.2025 14:42 β€” πŸ‘ 6    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

Check out the full blogpost here: https://cohere.com/blog/elo-ratings-beyond-arena-style-evaluations

Great to collaborate with Adithya Venkatadri Hulagadri, @mziizm.bsky.social‬, @jiangangngui.bsky.social‬, and @juliakreutzer.bsky.social‬ on this exploration.

15.08.2025 05:04 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In this blogpost we propose a 3rd path:
βœ… Balanced sampling across languages/tasks
βœ… Offline pseudo-pairwise comparisons (Bradley-Terry)
βœ… Confidence intervals & transparent breakdowns

The result? Rankings that better reflect real model utility.

15.08.2025 05:04 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

From circular wins/losses across skills, to tie-handling pitfalls, to prompt-spamming in arenas, Elo struggles when competition isn’t a single, binary game.

We show how multilingual, multi-task evaluation breaks its core assumptions.

15.08.2025 05:04 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

While effective for chessβ™ŸοΈ, Elo ratings struggle with LLM evaluation due to volatility and transitivity issues.

New post in collaboration with AI Singapore explores why Elo falls short for AI leaderboards and how we can do better.

15.08.2025 05:04 β€” πŸ‘ 6    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Post image

Still have questions about the Scholars Program? Join our information session on August 15th at 11am ET to get all the answers you need!

Register now - https://tinyurl.com/CohereLabsScholarsInfo

13.08.2025 13:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Accepted scholars will join our worldclass research team from Jan to Aug 2026. This full-time, paid opportunity reflects the program's intensity and dedication, setting it apart from other labs.

Apply here: https://jobs.ashbyhq.com/cohere/a77c6864-5a43-44c1-81dc-a66e23bdd9a6

13.08.2025 13:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Scholars will gain access to a robust experimental framework, empowering them to contribute to our ongoing commitment to responsible, fundamental research in machine learning. πŸ”₯ This is your chance to make a real impact and change the course of ML research.

13.08.2025 13:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

The Scholars Program offers a unique, full-time opportunity to work alongside leading researchers in ML. πŸ“š

Our mission is to identify and nurture emerging talent from across the globe, driving innovative research that pushes the boundaries of AI. 🌌

cohere.com/research/scholars-program

13.08.2025 13:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Applications are now open for the next cohort of the Cohere Labs Scholars Program! 🌟

This is your chance to collaborate with some of the brightest minds in AI & chart new courses in ML research. Let's change the spaces breakthroughs happen.

Apply by Aug 29.

13.08.2025 13:32 β€” πŸ‘ 2    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1

β€œWhen Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs”

Led by: Ammar Khairi, Daniel D'souza, Ye Shen, Julia Kreutzer, Sara Hooker

πŸ“œPaper link: arxiv.org/abs/2506.20544

26.06.2025 16:33 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@cohereforai is following 12 prominent accounts