Yonatan Lavy's Avatar

Yonatan Lavy

@yonatanlavy.bsky.social

I build stuff for you to build stuff easier

602 Followers  |  4,197 Following  |  131 Posts  |  Joined: 05.01.2025  |  2.0021

Latest posts by yonatanlavy.bsky.social on Bluesky

Post image

Thanks Sonnet 4.5

04.10.2025 07:49 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Dotallio Instantly build personal AI apps to organize, analyze, and automate your work. Transform unstructured data into actionable insights.

As promised, here's the link to Dotallio.

Get the power of GPT-5 without the robotic feel. It's free to use:
dotallio.com

07.08.2025 19:03 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Final verdict: OpenAI made a major step forward.

Even if the capabilities alone aren't jaw-dropping, the combination of quality, price, and accessibility is a quiet revolution. This will power a new wave of real-time applications. The future is exciting.

What's your take? As hyped as you expected?

07.08.2025 18:28 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

And yes, it's still addicted to the em dash '-', the classic signature of AI-generated text.

(Which is a perfect, shameless plug for my app, Dotallio ๐Ÿ˜‰.
It cleans up all those AI artifacts and gives you human-sounding output from powerful models like GPT-5. It's free. Link in the last reply!)

07.08.2025 18:28 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

So, am I blown away? Not entirely. The demos are impressive, but it's not a quantum leap that leaves everyone in the dust.

The race is tight, and we'll likely see answers from Google (Gemini 3) and Anthropic (Opus 4.2) soon. For UI/frontend design, Opus 4.1 still has the edge.

07.08.2025 18:28 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The new API features are a huge win. Control over "reasoning effort" and "verbosity" is a welcome upgrade. Plus, "tool calls preamble" lets you force custom, structured output using Regex. This is a game-changer for reliability and unique use cases no one else offers.

07.08.2025 18:28 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

It's also FREE on ChatGPT. Not a demo, not a nerfed version. The full model. This democratizes access and will massively accelerate adoption and innovation across the industry.

07.08.2025 18:28 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

For devs, this is a dream. It matches top-tier models like Anthropic's Opus 4.1 but at a tenth of the cost via API.

This will fuel companies like Lovable or Bolt that build entire apps in real-time, making high-performance AI economically viable.

07.08.2025 18:28 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Okay, GPT-5 is live. But is it the game-changer we all expected?

After an initial dive into the specs, the model, and the API, here's my take: The real story isn't one single feature, but the entire package.

๐Ÿงต of my thoughts below ๐Ÿ‘‡

07.08.2025 18:28 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 6    ๐Ÿ“Œ 0
Post image Post image

GPT 5 is the same level as Opus 4.1 (on SWE bench)

@OpenAI and @AnthropicAI seems to stay near the top together, this is amazing news that top competitors are this close to each other

07.08.2025 17:14 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
picture

picture

Claude Code is dancing for me

What does your AI IDE do for you?

24.06.2025 15:00 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

https://github.com/vercel/ai/issues/6589
^^ Issue opened

03.06.2025 14:00 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

5/5
Thankfully, open source alternatives are starting to catch up in quality.

Maybe it is time to pay more attention to them and invest in solutions that give us more transparency and control.

Would love to hear how others handled this if it happened to you.

Github issue in next comment

03.06.2025 14:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

4/5
This experience really makes me question how much trust we put in these big platforms.

When the infrastructure is out of your hands, you are always at risk of being blindsided by changes you cannot control.

03.06.2025 14:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

3/5
I reverted to OpenAI for now, but honestly, I expected more from Google.

If you are releasing a model, revising its version, showcasing it at your keynote, and encouraging everyone to build on it, the least you can do is announce breaking changes and give companies time to adapt.

03.06.2025 14:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

2/5
I opened an issue and immediately saw a wave of other companies reporting the same thing.

Imagine that your app stops working overnight and you have no idea why. It is a strange feeling when something so critical just fails out of nowhere.

03.06.2025 14:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

1/5
Yesterday everything worked perfectly in production. Today, Gemini just stopped working.
No code changes. No notice.

I spent hours digging through my own code, convinced I must have broken something, only to finally realize the issue was coming from Google's side.

03.06.2025 14:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Google broke my app without any warning.

Full breakdown below ๐Ÿงต

03.06.2025 14:00 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 6    ๐Ÿ“Œ 0
Post image

lies.

30.05.2025 16:34 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

how does it look?

28.05.2025 18:49 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Have you ever wanted an open source, ai native presentations?

28.05.2025 18:46 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

Fine, I'll build it myself

28.05.2025 18:41 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Are there seriously no open source alternatives to Google Slides????

26.05.2025 15:37 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
picture

picture

I love that I'm able to have 3 agents running in parallel on my codebase in cursor

Each has it's own agent mode, planning, executing or critiquing the execution of major features.

For planning I still use dotallio, but the code itself, cursor is hands down the best

20.05.2025 14:00 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

5/
For more on the latest and cutting edge AI or my journey as I build @Dotallio feel free to follow me

15.04.2025 14:04 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

4/
It seems that OpenAI haven't released any new models, but just released the latest 4o in API access and rebranded it without letting us know.

What do you think about it?

link for the 4o benchmarks - https://artificialanalysis.ai/models/comparisons/gpt-4o-chatgpt-03-25-vs-grok-3

15.04.2025 14:04 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
picture

picture

picture

picture

picture

picture

3/
Now let's look at the knowledge cutoff date.

If you access the "latest" 4o model via the api - it shows you "April 2024"

But accessing the 4o model via chat - shows "June 2024" - this is the same as 4.1!

15.04.2025 14:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
picture

picture

2/
In the benchmark above it shows 4.1 scoring about 66% on GPQA diamond.

I've found one match for GPQA benchmark for the updated 4o 2025-march -
AND IT MATCHES EXACTLY. So the 4.1 model shows the same performance as the march 2025 gpt-4o.

(can see more in the link at the end)

15.04.2025 14:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
picture

picture

1/
OpenAI claims massive improvements on GPT-4.1 over the GPT-4o - but in the small details you can see in the sub header it says "2024-11-20".

But they've updated their 4o just recently, along with when the image generation came out.

Let's dig in deeper..

15.04.2025 14:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
picture

picture

OpenAI just lied.

They "Launched" GPT 4.1, even though it appears to be the SAME EXACT MODEL as the recently launched updated 4o on March.

Let me show you the exact details ๐Ÿงต๐Ÿ‘‡

15.04.2025 14:04 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 6    ๐Ÿ“Œ 0

@yonatanlavy is following 18 prominent accounts