Algorithmic Research Group on X: "At ARG, we're laser-focused on understanding recursive self-improvement. We're confident that as models scale, RSI will accelerate the frontier of AI at ever-increasing speeds. Over the past year, we've created benchmarks, agents, and AI systems to measure how this might happen. https://t.co/JyOPFSB8DJ" / X
At ARG, we're laser-focused on understanding recursive self-improvement. We're confident that as models scale, RSI will accelerate the frontier of AI at ever-increasing speeds. Over the past year, we've created benchmarks, agents, and AI systems to measure how this might happen. https://t.co/JyOPFSB8DJ
Very excited to launch this little tool that weβve been building. ScoutML is an API built for AI researchers and agents that includes a ton of metadata on each paper. Itβs been super helpful for us as we run our research agents internally.
x.com/algoresearch...
29.07.2025 19:09 β
π 2
π 0
π¬ 0
π 0
Itβs built on top of a foundation of parsed metadata from papers, code, and reposβmodels, metrics, datasets, SOTA claims, GPU counts (and types), ablation studies, citations, etc. Itβs already become crucial to our internal research, and we hope it can be helpful to others, too.
19.05.2025 13:42 β
π 0
π 0
π¬ 1
π 0
Itβs designed to support that murky, nonlinear part of the research process, where you're still figuring out what's interesting.
19.05.2025 13:42 β
π 0
π 0
π¬ 1
π 0
You give it a question like βHow can we improve generalization in low-resource RL?β and it returns distilled insights, speculative ideas, and experimental code. Not final answers, just something to push the thinking forward.
19.05.2025 13:42 β
π 0
π 0
π¬ 1
π 0
Most of the time, I end up manually digging through papers, chasing links, and piecing together ideas. It works, but itβs slow, and it doesnβt scale with curiosity. Iβve been trying to fix that with a platform we're building called ProspectML.
19.05.2025 13:42 β
π 0
π 0
π¬ 1
π 0
A lot of ML tools help you implement. Not many help you think.
When Iβm exploring a new research direction, I donβt want another search engine or citation graph. I want something thatβs actually read the literature, can suggest promising directions, and helps me reason through tradeoffs.
19.05.2025 13:42 β
π 4
π 1
π¬ 1
π 0
hello world!
11.01.2025 22:53 β
π 2
π 1
π¬ 0
π 0
ARG is on Bluesky! Please follow here: @algoresearch.bsky.social
11.01.2025 22:55 β
π 0
π 0
π¬ 0
π 0
Recommendations for Technical AI Safety Research Directions
good post on 2025 ai safety research directions:
alignment.anthropic.com/2025/recomme...
11.01.2025 22:48 β
π 1
π 1
π¬ 0
π 0
Back in Pennsylvania, drinking schuylkill county coal cracker (boilo) and making pierogies
26.12.2024 22:07 β
π 3
π 0
π¬ 0
π 0
Thatβs because it was from 2022
30.11.2024 03:40 β
π 1
π 0
π¬ 1
π 0
AI for science could be more impactful than chatbots. It is already helping win Nobel prizes and accelerating drug development and materials discovery.
Today we published an essay about it: why it matters, how itβs happening and its implications. Here is a summary from an econ / social sci lens.
26.11.2024 10:39 β
π 79
π 30
π¬ 2
π 7
Important point that the open protocol makes extracting data from bluesky easy. Can't have it both ways. I like the protocol and think this site is well designed, but that means anyone can and will analyze these posts (if there is value to them, which I'm honestly less convinced of than some)
28.11.2024 19:06 β
π 11
π 2
π¬ 0
π 1
28.11.2024 19:04 β
π 7
π 1
π¬ 0
π 0
A dataset of 1 million or 2 million Bluesky posts is completely irrelevant to training large language models.
The primary usecase for the datasets that people are losing their shit over isn't ChatGPT, it's social science research and developing systems that improve Bluesky.
28.11.2024 18:57 β
π 251
π 39
π¬ 8
π 5
Wait what even is this platform. This is insane
28.11.2024 18:54 β
π 2
π 0
π¬ 0
π 0
What! If it works for umap-learn vs umap iβm in.
25.11.2024 10:00 β
π 0
π 0
π¬ 0
π 0
I have a community project in Eleuther and open source all of my research:
bsky.app/profile/bayk...
25.11.2024 09:36 β
π 1
π 0
π¬ 1
π 0
Jk the rest are great. Just a big uncle nearest fan
25.11.2024 03:13 β
π 1
π 0
π¬ 1
π 0
5
.
.
.
.
.
.
.
.
3
2
4
1
25.11.2024 03:12 β
π 1
π 0
π¬ 1
π 0
We welcome PRs, contributions, additional tasks, and task revisions. Excited to see how agents perform on this benchmark.
24.11.2024 20:02 β
π 0
π 0
π¬ 0
π 0
We develop a baseline agent, with tools for coding, research (via Semantic Scholar), and model training, built on top of Sonnet 3.5 and GPT-4o. Our baseline agent performs well across tasks, but generally fails to move beyond baseline implementations.
24.11.2024 20:02 β
π 0
π 0
π¬ 1
π 0
ML Research Bench adapts tasks from ML conference competitions like βNeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Dayβ and βLLM Merging Competitionβ. We prompt agents to complete these challenging tasks. These tasks move beyond simple ML tasks.
24.11.2024 20:02 β
π 0
π 0
π¬ 1
π 0
ML Research Benchmark
Artificial intelligence agents are increasingly capable of performing complex tasks across various domains. As these agents advance, there is a growing need to accurately measure and benchmark their c...
(re-posting from X)
Can we get AI to accelerate AI research and development?
Iβm excited to release ML Research Benchmark, an agentic benchmark of 7 ML conference competition tasks.
Paper: arxiv.org/abs/2410.22553
Tasks: github.com/AlgorithmicR...
Agent: github.com/AlgorithmicR...
24.11.2024 20:02 β
π 2
π 0
π¬ 2
π 1
Maxo Kream
23.11.2024 14:21 β
π 1
π 0
π¬ 0
π 0
Sure! Multi-role or multi-module. From a software development perspective I think of them as microservices. Nothing more
23.11.2024 14:01 β
π 1
π 0
π¬ 0
π 0
Thereβs a paper, Iβll try to find it
23.11.2024 06:33 β
π 0
π 0
π¬ 1
π 0
Idk about βstrengthsβ and βperspectivesβ but you do want a separation of concerns if your agents have a lot of tools. And you do want specific system prompts to guide their objectives. If you pack too many tools into an api call, the model will only use a handful of them.
23.11.2024 06:33 β
π 0
π 0
π¬ 1
π 0