Jonathan Bragg @jbragg - Bluesky Profile

Latest posts by jbragg.bsky.social on Bluesky

Brooke Vlahos, Peter Clark, Doug Downey, @yoavgo.bsky.social Ashish Sabharwal, Daniel S. Weld

06.11.2025 17:01 — 👍 0 🔁 0 💬 0 📌 0

Amanpreet Singh, Harshit Surana, Aryeh Tiktinsky, Rosni Vasu @guywiener.bsky.social Chloe Anastasiades, Stefan Candra, Jason Dunkelberger, Dan Emery, Rob Evans, Malachi Hamada, Regan Huff, Rodney Kinney, Matt Latzke, Jaron Lochner, Ruben Lozano-Aguilera, Cecile Nguyen, Smita Rao, Amber Tanaka...

06.11.2025 17:01 — 👍 0 🔁 0 💬 1 📌 0

🙏 Many thanks to my @ai2.bsky.social teammates—Mike D’Arcy @nbalepur.bsky.social Dan Bareket, Bhavana Dalvi @sergeyf.bsky.social Dany Haddad, Jena D. Hwang, @peterjansen-ai.bsky.social Varsha Kishore, Bodhisattwa Majumder @arnaik19.bsky.social Sigal Rahamimov, Kyle Richardson...

06.11.2025 17:01 — 👍 0 🔁 0 💬 1 📌 0

GitHub - allenai/agent-baselines Contribute to allenai/agent-baselines development by creating an account on GitHub.

We tested 22 agent classes—more *kinds* than other benchmarks

🤖AgentBaselines makes them reusable, incl. our SOTA science agents: github.com/allenai/agent-baselines

📚Blog: allenai.org/blog/astabench
📄Paper: arxiv.org/abs/2510.21652
📊Leaderboard: huggingface.co/spaces/allenai/asta-bench-leaderboard

06.11.2025 17:01 — 👍 0 🔁 0 💬 1 📌 0

🛠️AstaBench is the first to provide reproducible (date-limited) large-scale search tools—plus a full scientific research environment for agents.

📊Our leaderboard highlights agents that use these tools, enabling more controlled measurement of *AI*. (We measure LLM costs too.)

06.11.2025 17:01 — 👍 0 🔁 0 💬 1 📌 0

AstaBench with abstract measurement icons

Agent benchmarks don't measure true *AI* advances

We built one that's hard & trustworthy:
👉 AstaBench tests agents w/ *standardized tools* on 2400+ scientific research problems
👉 SOTA results across 22 agent *classes*
👉 AgentBaselines agents suite

🆕 arxiv.org/abs/2510.21652

🧵👇

06.11.2025 17:01 — 👍 7 🔁 1 💬 1 📌 0

@kylelo.bsky.social your gifs are an unapproved manipulation of my human attention

09.10.2025 21:06 — 👍 2 🔁 0 💬 0 📌 0

@jbragg is following 20 prominent accounts

Yoav Goldberg
@yoavgo

Nishant Balepur
@nbalepur

CS PhD Student. Trying to find that dog in me at UMD. Babysitting (aligning) + Bullying (evaluating) LLMs nbalepur.github.io

Jack Hessel
@jmhessel

jmhessel.com @Anthropic. Seattle bike lane enjoyer. Opinions my own.

Savvas Petridis
@savvaspetridis

Research Scientist at Google DeepMind, in the People + AI Research (PAIR) team. savvaspetridis.github.io

Ai2
@ai2

Breakthrough AI to solve the world's biggest problems. › Join us: http://allenai.org/careers › Get our newsletter: https://share.hsforms.com/1uJkWs5aDRHWhiky3aHooIg3ioxm

Adam Marcus
@marcua.net

Hummus, people, and data. Co-Founder & CTO of B12. Previously Locu, MIT CSAIL. He/him. https://marcua.net/ Queens is the future.

Kurt Luther
@kurtluther

Associate Professor of Computer Science, Virginia Tech

Jon Froehlich
@jonfroehlich

🎓 HCI Professor, UW CS 🛠️ Director, makeabilitylab.cs.uw.edu ♿️ Co-founder, projectsidewalk.org 🤖 Visiting Researcher, Google Research

Kira Goldner
@kiragoldner

Assistant Professor at BU CDS EconCS | Theory of CS | MD+AI+DS4SG | MD4SG co-founder Previously Columbia, UW, Oberlin. Views are mine alone. www.kiragoldner.com

Paul Krugman
@pkrugman

Ex NY Times, now author of Substack Paul Krugman. Nobel laureate and, according to Donald Trump, "Deranged BUM"

Catherine Rampell
@crampell

Economics editor at The Bulwark. MS NOW (formerly MSNBC) anchor. Previously WaPo op-ed columnist and NYT reporter. Econ, politics, immigration, tax, etc. + occasional theater nerdery.

Peter Henderson
@peterhenderson

Assistant Professor the Polaris Lab @ Princeton (https://www.polarislab.org/); Researching: RL, Strategic Decision-Making+Exploration; AI+Law

Shannon Shen
@shannonshen

PhD Student @MIT | Previous @allen_ai | #NLP #HCI | www.szj.io

David Jurgens
@davidjurgens

Associate prof at @UMich in SI and CSE working in computational social science and natural language processing. PI of the Blablablab blablablab.si.umich.edu

Ben Lee
@bcgl

Assistant Professor @ the University of Washington iSchool | formerly an Innovator in Residence @ Library of Congress | essays in WIRED, Gawker, The New Republic, Longreads, Current Affairs, etc. 🌐 www.bcglee.com

Aakanksha Naik
@arnaik19

Research Scientist at the Allen Institute for AI (AI2), interested in information extraction, NLP for healthcare and transfer learning, PhD from CMU LTI. Website: https://www.cs.cmu.edu/~anaik/

Joseph Chang
@josephc

https://josephcc.com

Lucy Lu Wang
@lucylw

Asst Prof @uwischool.bsky.social; #NLP #healthinformatics #accessibility #scholcomm 🚴🏔️🍄❄️⛷️🧶⚫️⚪️📚🍸in Seattle; llwang.net; she/her

Amy Zhang
@axz

Associate professor of social computing at UW CSE, leading @socialfutureslab.bsky.social social.cs.washington.edu

Maria Antoniak
@mariaa

asst prof of computer science at cu boulder nlp, cultural analytics, narratives, communities books, bikes, games, art https://maria-antoniak.github.io