Akash Swamy @akashswamy - Bluesky Profile

GitHub Copilot: The agent awakens Introducing agent mode for GitHub Copilot in VS Code, announcing the general availability of Copilot Edits, and providing a first look at our SWE agent.

GitHub’s Copilot has just undergone a substantial upgrade. It now features Agent mode, with multi-file edits using various LLMs. While the pricing seems competitive compared to its closest competitor, Cursor, it’s unclear what’s the editing limit per month.

github.blog/news-insight...

08.02.2025 12:11 — 👍 0 🔁 0 💬 0 📌 0

I haven’t seen o3 yet & have been critical of benchmarks for AI but they did test against some of the hardest & best

On GPQA, PhDs with access to the internet got 34% outside their specialty, up to 81% inside. o3 is 87%.

Frontier Math went from the best AI at 2% to 25%

Some other big ones, too

21.12.2024 06:27 — 👍 113 🔁 16 💬 6 📌 0

OpenAI o3 Breakthrough High Score on ARC-AGI-Pub OpenAI o3 scores 75.7% on ARC-AGI public leaderboard.

OpenAI’s o3 model surpassed expectations on the Arc-AGI benchmark with impressive reasoning skills. Not AGI (we still don’t know what that is), but a big leap. Fingers crossed for o3/o3-mini public access in the future.
#openai

arcprize.org/blog/oai-o3-...

20.12.2024 21:07 — 👍 2 🔁 0 💬 0 📌 0

People using Spotify have a delightful surprise this year in the form of NotebookLM wrapped podcast. Spotify Wrapped has always been an excellent summary of my listening trends, but this time, you can actually listen to two AI-generated podcasters presenting it to you.
#spotify #notebooklm

04.12.2024 14:35 — 👍 1 🔁 0 💬 0 📌 0

GitHub - allenai/OLMo: Modeling, training, eval, and inference code for OLMo Modeling, training, eval, and inference code for OLMo - allenai/OLMo

We just updated the OLMo repo at github.com/allenai/OLMo!
There are now several training configs that together reproduce the training runs that lead to the final OLMo 2 models.
In particular, all the training data is available, tokenized and shuffled exactly as we trained on it!

02.12.2024 20:13 — 👍 54 🔁 11 💬 0 📌 0

OLMo 2: The best fully open language model to date | Ai2 Our next generation of fully-open base and instruct models sit at the Pareto frontier of performance and training efficiency.

Interesting development last week on small language models (SLMs). The trend is clear: models getting better with flop efficiency and reasoning capabilities while maintaining smaller param size. Agentic workflow could become cheaper and better with these developments.
#llm #ai
allenai.org/blog/olmo2

30.11.2024 11:30 — 👍 2 🔁 0 💬 0 📌 0

The Shift from Models to Compound AI Systems The BAIR Blog

It’s uncertain whether the scaling law will hold true, but we might witness numerous intriguing techniques in the application layer.

bair.berkeley.edu/blog/2024/02...

#llm #compoundai

26.11.2024 10:41 — 👍 0 🔁 0 💬 0 📌 0

LLM Explorer: A Curated Large Language Model Directory. LLM List. 38371 Open-Source Language Models. Browse 38371 open-source large and small language models conveniently grouped into various categories and llm lists complete with benchmarks and analytics.

An amazing source for comparing LLM inference frameworks, hosting costs (inference) and serverless options. llm.extractum.io
#llm #llmops #serverless #inference

25.11.2024 16:40 — 👍 1 🔁 0 💬 0 📌 0

Akash Swamy

Latest posts by akashswamy.bsky.social on Bluesky

@akashswamy is following 20 prominent accounts