Akash Swamy's Avatar

Akash Swamy

@akashswamy.bsky.social

Product Manager @ Graphcore | My opinions are my own | I talk about AI, LLMs, Engineering, Products and of-course Accelerators

53 Followers  |  421 Following  |  6 Posts  |  Joined: 20.11.2024  |  1.6923

Latest posts by akashswamy.bsky.social on Bluesky

Preview
GitHub Copilot: The agent awakens Introducing agent mode for GitHub Copilot in VS Code, announcing the general availability of Copilot Edits, and providing a first look at our SWE agent.

GitHubโ€™s Copilot has just undergone a substantial upgrade. It now features Agent mode, with multi-file edits using various LLMs. While the pricing seems competitive compared to its closest competitor, Cursor, itโ€™s unclear whatโ€™s the editing limit per month.

github.blog/news-insight...

08.02.2025 12:11 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

I havenโ€™t seen o3 yet & have been critical of benchmarks for AI but they did test against some of the hardest & best

On GPQA, PhDs with access to the internet got 34% outside their specialty, up to 81% inside. o3 is 87%.

Frontier Math went from the best AI at 2% to 25%

Some other big ones, too

21.12.2024 06:27 โ€” ๐Ÿ‘ 113    ๐Ÿ” 16    ๐Ÿ’ฌ 6    ๐Ÿ“Œ 0
Preview
OpenAI o3 Breakthrough High Score on ARC-AGI-Pub OpenAI o3 scores 75.7% on ARC-AGI public leaderboard.

OpenAIโ€™s o3 model surpassed expectations on the Arc-AGI benchmark with impressive reasoning skills. Not AGI (we still donโ€™t know what that is), but a big leap. Fingers crossed for o3/o3-mini public access in the future.
#openai

arcprize.org/blog/oai-o3-...

20.12.2024 21:07 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

People using Spotify have a delightful surprise this year in the form of NotebookLM wrapped podcast. Spotify Wrapped has always been an excellent summary of my listening trends, but this time, you can actually listen to two AI-generated podcasters presenting it to you.
#spotify #notebooklm

04.12.2024 14:35 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
GitHub - allenai/OLMo: Modeling, training, eval, and inference code for OLMo Modeling, training, eval, and inference code for OLMo - allenai/OLMo

We just updated the OLMo repo at github.com/allenai/OLMo!
There are now several training configs that together reproduce the training runs that lead to the final OLMo 2 models.
In particular, all the training data is available, tokenized and shuffled exactly as we trained on it!

02.12.2024 20:13 โ€” ๐Ÿ‘ 54    ๐Ÿ” 11    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
OLMo 2: The best fully open language model to date | Ai2 Our next generation of fully-open base and instruct models sit at the Pareto frontier of performance and training efficiency.

Interesting development last week on small language models (SLMs). The trend is clear: models getting better with flop efficiency and reasoning capabilities while maintaining smaller param size. Agentic workflow could become cheaper and better with these developments.
#llm #ai
allenai.org/blog/olmo2

30.11.2024 11:30 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
The Shift from Models to Compound AI Systems The BAIR Blog

Itโ€™s uncertain whether the scaling law will hold true, but we might witness numerous intriguing techniques in the application layer.

bair.berkeley.edu/blog/2024/02...

#llm #compoundai

26.11.2024 10:41 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
LLM Explorer: A Curated Large Language Model Directory. LLM List. 38371 Open-Source Language Models. Browse 38371 open-source large and small language models conveniently grouped into various categories and llm lists complete with benchmarks and analytics.

An amazing source for comparing LLM inference frameworks, hosting costs (inference) and serverless options. llm.extractum.io
#llm #llmops #serverless #inference

25.11.2024 16:40 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@akashswamy is following 20 prominent accounts