facebook/collaborative_agent_bench · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
ColBench: Technical framework for multi-turn LLM reasoning evaluation
- Reliable simulation with LLMs as human collaborators
- Functional verifiers measuring similarity to reference artefacts
- Supports both backend programming and visual frontend design
huggingface.co/datasets/fac...
24.03.2025 09:24 — 👍 14 🔁 3 💬 0 📌 0
Creative AI meetup: The Return · Luma
This event will host talks artists and researchers presenting AI technologies and their creative applications.
Event schedule:
18:30 Arrival
19:00 Terence…
Time for my #CreativeAI meetup to return 🤖🥳
After many years, join us on 5th March at Newspeak House with @terencebroad.bsky.social from UAL CCI, the AI research artist @cheesetalk.bsky.social and Thu Nguyen-Phuoc from Meta
Sign up here: bit.ly/4jTb7rD
13.02.2025 17:36 — 👍 4 🔁 1 💬 0 📌 1
I‘m still not sure about bluesky- is this working for folks? Who should I follow?
11.02.2025 14:02 — 👍 0 🔁 0 💬 0 📌 0
Very interesting work!
29.01.2025 14:24 — 👍 0 🔁 0 💬 0 📌 0
Screenshot of this text: Total annotations submitted: 50,035 Languages with annotations: 115 Total contributors: 419
🎉 50,000+ annotations reached! The FineWeb2-C community is helping build better language models on annotation at a time.
📊 Current stats:
- 115 languages represented
- 419 amazing contributors
- 24 languages with complete datasets
But we're not done yet! 🧵
16.01.2025 17:32 — 👍 18 🔁 6 💬 1 📌 0
Microsoft Responsible AI Mixer in NYC happening now!
23.01.2025 23:44 — 👍 1 🔁 0 💬 0 📌 0
On the Origin of Deep Learning
arxiv.org/pdf/1702.07800
22.01.2025 13:20 — 👍 1 🔁 0 💬 0 📌 0
Wait..
22.01.2025 01:38 — 👍 0 🔁 0 💬 0 📌 0
Happy New Year, friends!
01.01.2025 13:26 — 👍 0 🔁 0 💬 0 📌 0
woohooo! 🙌
19.12.2024 12:44 — 👍 0 🔁 0 💬 0 📌 0
Friday read: The o1 System Card cdn.openai.com/o1-system-ca...
06.12.2024 14:35 — 👍 1 🔁 0 💬 0 📌 0
The Lichess database of games, puzzles, and engine evaluations is now on @hf.co - https://huggingface.co/Lichess. Billions of chess data points to download, query, and stream and we're excited to see what you'll build with it! ♟️ 🤗
06.12.2024 09:46 — 👍 94 🔁 23 💬 3 📌 2
Slick!
24.11.2024 18:14 — 👍 1 🔁 0 💬 0 📌 0
23 // Kaggle Competitions Grandmaster & ML/AI Researcher. Building video games @ Iconic, machine reasoning @ Cambridge, bioscience @ ForecomAI.
https://mxbi.net / tw: @mikb0b
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
research scientist at ScaDS.AI Leipzig in nlp, ir, and ml. @hf.co fellow. @lichess.org team member. @kaggle.com datasets expert.
https://lichess.org The free chess server. No paywall, no tracking, no ads. Just the good stuff. User support requests should be directed to https://lichess.org/contact
Chief Scientist @ Distributional.com @dbnlAI.bsky.social #MLSky #StatSky
Founder @ datascientific.com
Founder wimlds.org & co-founder rladies.org
PhD @ UC Berkeley
🏡 🌈 Oakland, California.
Women+ in Machine Learning & Data Science (WiMLDS) Org. | meetup community of women & non-binary folks | Est 2013 👩🏿💻👩💻👩🏽💻 #GenderEquality #Inclusion
https://wimlds.org
building something new
reposting art, research
prev: ed tech startup (10M users, acquired), yc, that forbes list, mit
https://www.leandra.dev/
I work at Sakana AI 🐟🐠🐡 → @sakanaai.bsky.social
https://sakana.ai/careers
Working towards the safe development of AI for the benefit of all at Université de Montréal, LawZero and Mila.
A.M. Turing Award Recipient and most-cited AI researcher.
https://lawzero.org/en
https://yoshuabengio.org/profile/
Ph.D. in NLP Interpretability from Mila. Previously: independent researcher, freelancer in ML, and Node.js core developer.
VP - Product, Developer Platform @ Meta.
Former VP at Google DeepMind, Former MSFT.
Opinions here are mine.
https://Answer.AI & https://fast.ai founding CEO; previous: hon professor @ UQ; leader of masks4all; founding CEO Enlitic; founding president Kaggle; various other stuff…
Researcher (OpenAI. Ex: DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian.
Anon feedback: https://admonymous.co/giffmana
📍 Zürich, Suisse 🔗 http://lucasb.eyer.be
EU Policy Lead & Applied Researcher @ Hugging Face 🤗
Computer Scientist, PhD
Wikipedia & languages are my ♡
Democratizing machine learning through Gradio, acquired by Hugging Face 🤗
Passionate about AI & Journalism / Previously @hf.co @radiocanadainfo @ledevoir & others
Researcher trying to shape AI towards positive outcomes. ML & Ethics +birds. Generally trying to do the right thing. TIME 100 | TED speaker | Senate testimony provider | Navigating public life as a recluse.
Former: Google, Microsoft; Current: Hugging Face