GitHub Copilot: The agent awakens
Introducing agent mode for GitHub Copilot in VS Code, announcing the general availability of Copilot Edits, and providing a first look at our SWE agent.
GitHubโs Copilot has just undergone a substantial upgrade. It now features Agent mode, with multi-file edits using various LLMs. While the pricing seems competitive compared to its closest competitor, Cursor, itโs unclear whatโs the editing limit per month.
github.blog/news-insight...
08.02.2025 12:11 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I havenโt seen o3 yet & have been critical of benchmarks for AI but they did test against some of the hardest & best
On GPQA, PhDs with access to the internet got 34% outside their specialty, up to 81% inside. o3 is 87%.
Frontier Math went from the best AI at 2% to 25%
Some other big ones, too
21.12.2024 06:27 โ ๐ 113 ๐ 16 ๐ฌ 6 ๐ 0
OpenAI o3 Breakthrough High Score on ARC-AGI-Pub
OpenAI o3 scores 75.7% on ARC-AGI public leaderboard.
OpenAIโs o3 model surpassed expectations on the Arc-AGI benchmark with impressive reasoning skills. Not AGI (we still donโt know what that is), but a big leap. Fingers crossed for o3/o3-mini public access in the future.
#openai
arcprize.org/blog/oai-o3-...
20.12.2024 21:07 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
People using Spotify have a delightful surprise this year in the form of NotebookLM wrapped podcast. Spotify Wrapped has always been an excellent summary of my listening trends, but this time, you can actually listen to two AI-generated podcasters presenting it to you.
#spotify #notebooklm
04.12.2024 14:35 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
GitHub - allenai/OLMo: Modeling, training, eval, and inference code for OLMo
Modeling, training, eval, and inference code for OLMo - allenai/OLMo
We just updated the OLMo repo at github.com/allenai/OLMo!
There are now several training configs that together reproduce the training runs that lead to the final OLMo 2 models.
In particular, all the training data is available, tokenized and shuffled exactly as we trained on it!
02.12.2024 20:13 โ ๐ 54 ๐ 11 ๐ฌ 0 ๐ 0
OLMo 2: The best fully open language model to date | Ai2
Our next generation of fully-open base and instruct models sit at the Pareto frontier of performance and training efficiency.
Interesting development last week on small language models (SLMs). The trend is clear: models getting better with flop efficiency and reasoning capabilities while maintaining smaller param size. Agentic workflow could become cheaper and better with these developments.
#llm #ai
allenai.org/blog/olmo2
30.11.2024 11:30 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
The Shift from Models to Compound AI Systems
The BAIR Blog
Itโs uncertain whether the scaling law will hold true, but we might witness numerous intriguing techniques in the application layer.
bair.berkeley.edu/blog/2024/02...
#llm #compoundai
26.11.2024 10:41 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Co-founder at @BotCity (YC W22)
OSS Maintainer at MarvinJ and Marvin
Computer Scientist, AI, Open Source
Research director | @McGillU @Mila_Quebec @IVADO_Qc | My team designs machine learning frameworks to understand biological systems from new angles of attack
Developing the next generation of AI compute.
Part of the SoftBank Group.
Author of "The PRFAQ Framework" (www.theprfaq.com). 18yrs of startups in Seattle & London, MSFT, Amazon. Runner. Cook. Geek.
Tech, software, AI/ML, UX, product, innovation, startups, leadership.
๐ Seattle
๐ง๐ท๐บ๐ธ
https://calbucci.com/link
Technology news and analysis with a focus on founders and startup teams.
Got a tip? http://techcrunch.com/tips
The AI-powered developer platform to build, scale, and deliver secure software.
AI Engineer @ Google ๐จโ๐ป โ Educator ๐จโ๐ซ โ Traveller โ๏ธ โ Hobby photographer ๐ท โ Foodie ๐ฎ โ Film fan ๐ฟ โ Boardgamer ๐ฒ โ Londoner๐โโ๏ธ
Medium: https://heiko-hotz.medium.com/
Github: https://github.com/heiko-hotz
LI: https://www.linkedin.com/in/heikohotz/
AI Scientist at Mistral AI.
Past: Google DeepMind.
๐ง๐ท in ๐ฌ๐ง
Researcher at Cohere | Multilingual LLM evaluation
I build tools that propel communities forward
Europe's empowering cloud provider ๐
I work on AI at OpenAI.
Former VP AI and Distinguished Scientist at Microsoft.
Head of Post-Training @LiquidAI
๐ Blog: https://mlabonne.github.io/blog
๐ป GitHub: https://github.com/mlabonne
๐ค HF: https://huggingface.co/mlabonne
I'm an author, futurist, thinker, systems architect and applied AI engineer.
https://danieljeffries.substack.com/
I write about product, tech, AI and more for 48,000+ readers on my Substack: https://departmentofproduct.substack.com/
Ex PM: shutl, ebay, C4, fintech / banking.
We analyse the business of Semiconductors and share insights about the most important industry, ever.
Creator of aidisruptor.ai - my community of 4,000+ everyday people learning how to use AI tools through practical guides.
Gen AI Product Lead @ Google | I certify AI Product Managers | Harvard Business School | ML PhD | Fortune 40u40 | http://marily.substack.com.