Petr Šimeček's Avatar

Petr Šimeček

@psimecek.bsky.social

Marie Skłodowska-Curie Fellow. Stupid tweets are my own. All the glory belongs to CEITEC and Mediaboard.

32 Followers  |  51 Following  |  42 Posts  |  Joined: 17.12.2024  |  1.8713

Latest posts by psimecek.bsky.social on Bluesky

Post image

Czech contribution to AI discussion

06.08.2025 13:33 — 👍 0    🔁 0    💬 0    📌 0
Preview
Introducing gpt-oss gpt-oss-120b and gpt-oss-20b push the frontier of open-weight reasoning models

Was excited about OpenAI's new open-source models until I hit "trained on a mostly English, text-only dataset" For Czech gpt-oss-20b is genuinely bad, gpt-oss-120b is passable but far from impressive. Running formal benchmarks, but casual testing already tells the story 🇨🇿

openai.com/index/introd...

06.08.2025 13:19 — 👍 0    🔁 0    💬 0    📌 0
Post image

This is actually pretty good illustration of AI-assisted coding success / failure (ignore the term "vibe coding" and be honest - we have all been on the 4th line)

Source: www.reddit.com/r/vibecoding...

10.06.2025 08:11 — 👍 1    🔁 0    💬 0    📌 0

- Co vy na té fildě vlastně děláte.
= My učíme lidi číst a psát.

06.05.2025 17:57 — 👍 1    🔁 0    💬 0    📌 0

Mne AI extrémně zajímá jako někdo, kdo je schopný dělat kvalitativní analýzu srovnatelnou s lidským kodérem. Protože v humanitních a sociálních vědách... máme velmi dobře rozpracované teorie, jak dělat obsahovou analýzu, jenom to znamená, že máš 2-3 lidi v týmu, kteří podle (manuálu) dělají kódování

06.05.2025 17:52 — 👍 0    🔁 0    💬 1    📌 0
Preview
Když se mozek učí novým věcem, tak má několikrát vyšší spotřebu energie, než když si jen tak přemýšlí, říká akademik Josef Šlerka (277) Procento Miloše Čermáka · Episode

Je strašně málo tlustejch šachistů.

(Josef Šlerka v open.spotify.com/episode/5SBB...)

06.05.2025 17:51 — 👍 1    🔁 0    💬 1    📌 0
Post image

Ok, this is crazy. Geoguessing photos with AI:
* Gemini 2.5 Pro seems to be the best (crazy how fast it is!)
* OpenAI o4-mini-high (on a screenshot) - not bad!
* OpenAI o3 - very slow
* Grok - is guessing Lisbon, Portugal (=wrong)

02.05.2025 18:48 — 👍 2    🔁 0    💬 0    📌 1
Post image

I believe if anyone showed us GPT4.5 10 years ago, we would call it AGI.

(Standa Fort is killing it at #MLPrague2025)

30.04.2025 07:21 — 👍 1    🔁 0    💬 0    📌 0
Preview
Extending Context Window of Large Language Models via Positional Interpolation We present Position Interpolation (PI) that extends the context window sizes of RoPE-based pretrained LLMs such as LLaMA models to up to 32768 with minimal fine-tuning (within 1000 steps), while demon...

I had no idea that one of LLM innovations was invented for write better erotic stories. (arxiv.org/abs/2306.15595, enabling LLMs to work with longer texts)

What you learn at #mlprague2025!

30.04.2025 07:18 — 👍 0    🔁 0    💬 0    📌 0

Jon McLoone (Wolfram Research): People are asking AI chatbot wrong questions in the same way they were asking Google wrong questions 20 years ago. It is possible they will learn to do better.

#MLPrague2025

29.04.2025 11:56 — 👍 0    🔁 0    💬 0    📌 0
Preview
Low-background steel - Wikipedia

From questions to Jon McLoone's talk at ML Prague "Lies, Damn Lies and Gen AI"

Sunken WWII shipwrecks are valuable because they are source of low-background steel (en.wikipedia.org/wiki/Low-bac...). Maybe similarly, one day we will be seeking for pre-AI texts.

#MLPrague2025

29.04.2025 09:36 — 👍 2    🔁 0    💬 1    📌 0
Preview
Stručný návod boomera na home office ve Vietnamu. Proč to není jen pro mladé a jak na tom mohou firmy vydělat Už jste někdy snili o práci z pláže na hodně exotickém místě? A co tropický home office ve Vietnamu? Jaké to je pracovat z Hanoje nebo Ho Či Minova Města a přitom vychutnávat lahodné vietnamské jídlo?...

Stručný návod na home office ve Vietnamu
(jen si sem schovávám) 🇨🇿

vikend.hn.cz/c1-67713380-... 🔒

27.04.2025 14:29 — 👍 0    🔁 0    💬 0    📌 0
Post image

When I submitted my "Bullshit AI" talk to PyConAT, I had no idea Trump would end up giving me a perfect example. It is worth trillions of dollars. Literally.

www.linkedin.com/posts/bastia...

05.04.2025 20:54 — 👍 1    🔁 0    💬 0    📌 0
Post image

Autora neznám

05.04.2025 08:32 — 👍 1    🔁 0    💬 0    📌 0
Post image

However,...

03.04.2025 06:51 — 👍 0    🔁 0    💬 0    📌 0
Post image Post image Post image Post image

Ondřej Svoboda má na LinkedInu pěkné ukázky, že 4o-image zvládne i české styly

www.linkedin.com/posts/ondrej...

02.04.2025 18:43 — 👍 2    🔁 0    💬 1    📌 0
Post image

New models in MiniCzechBenchmark 🇨🇿

Gemini 2.5 Pro 🥇 (as expected), Gemma 12B a very pleasant surprise 🫢 (you can run it in ollama on your 💻), only marginal improvements for new DeepSeek V3, Mistral Small, openai gpt-4o

github.com/simecek/Mini...

01.04.2025 05:46 — 👍 1    🔁 0    💬 0    📌 0
Post image

Despite popular belief it is not true that current LLMs can solve math. olympiad problems.

(I have a set of Czech middle school problems Klokánek, most LLMs are slightly above random chance, best thinking at ~90%)

arxiv.org/abs/2503.219...

01.04.2025 05:35 — 👍 2    🔁 0    💬 1    📌 0
Post image

Draw me picture with two schemas explaining the difference between AlphaFold2 and ESMFold

(needed extra hint)

30.03.2025 22:08 — 👍 0    🔁 0    💬 0    📌 0
Post image

My photo is attached. Try to add glasses to it (I am about to buy a new glasses and need to pick the style)

30.03.2025 21:22 — 👍 0    🔁 0    💬 1    📌 0
Post image Post image

Draw me a little map of XYZ for my future visit. Not real map, just illustration. Include top places I should visit. Please, omit typical turist traps and concentrate on cheap free hipster punk things #gpt4o

30.03.2025 21:13 — 👍 0    🔁 0    💬 2    📌 0
Post image Post image

If you have ChatGPT Plus (paid $20 version), try new image generation introduced yesterday. Seriously crazy.

This is me as Simpsons character (no special model, just prompt)

openai.com/index/introd...

26.03.2025 10:35 — 👍 0    🔁 0    💬 0    📌 0
Preview
Image Generation & Editing - a Hugging Face Space by philschmid Generate and Edit images with Gemini 2.0

LLM image editing solved by Google (model: Gemini Flash 2.0 Image Generation Experimental). Feels like magic - it understands the image on pixel level.

Try it: huggingface.co/spaces/phils...

Or just use Google AI studio with this model
aistudio.google.com

17.03.2025 15:10 — 👍 1    🔁 0    💬 1    📌 0

The poem is from GPT4.5. Too expensive to be benchmarked (cca 700 CZK) but the first model to properly rhyme in Czech.

10.03.2025 10:45 — 👍 0    🔁 0    💬 0    📌 0
Post image Post image

MiniCzechBenchmark 🇨🇿 Update including QwQ-32B, Claude 3.7, Gemini 2.0, and DeepSeek-R1!

* Gemini 2.0 Flash is cost effective, fast and great
* 🇨🇳 open models to pay attention: V3, QwQ-32B, R1
* Mistral-Small-24B-Instruct-2501 = open alternative to haiku / gpt-4o-mini

github.com/simecek/Mini...

10.03.2025 09:31 — 👍 1    🔁 0    💬 1    📌 0
Preview
Hao AI Lab on X: "We built Gaming agents to run platformers and puzzle video games in real time. Check out our demos and try our repo yourself to customize your own gaming agent! 🎮 https://t.co/OMJUHsVuIi In addition to Super Mario Bros, we also support 2048, as well as Tetris. More games are https://t.co/nvrCvKmFzX" / X We built Gaming agents to run platformers and puzzle video games in real time. Check out our demos and try our repo yourself to customize your own gaming agent! 🎮 https://t.co/OMJUHsVuIi In addition to Super Mario Bros, we also support 2048, as well as Tetris. More games are https://t.co/nvrCvKmFzX

This is seriously crazy! Promting multimodal LLM to play games IN REAL TIME

x.com/haoailab/sta...

This is the repo github.com/lmgame-org/G...

A this is prompt responsible for tetris
github.com/lmgame-org/G...

GeminiFlash is fast enough to play SuperMario (on par with Sonnet 3.5)

02.03.2025 08:33 — 👍 0    🔁 0    💬 0    📌 0

Často Olšany v prvním průchodu zamítnou (protože ona to tak úplně není pražská čtvrť, že jo). Později se vrátí a řeknou si "Aha, tak to asi budou ty Olšany!" A to mi na tom právě přijde zajímavé.

28.02.2025 20:19 — 👍 0    🔁 0    💬 0    📌 0
Post image

Test LLM šolynou. Hlubší než si myslíte. Většina thinking modelů to dá (ale ne DeepSeek-R1, ani QWEN)

28.02.2025 20:17 — 👍 1    🔁 0    💬 1    📌 0

Actually, it seems they just pushed it out because they could not make it better. There is a lot of negativity on Twitter, GPT4.5 is expensive but worse on benchmarks than Claude 3.7, Grok3 or even DeepSeek. Not a good day for OpenAI, so they put engineers into spotlight and not Sama. Typical.

28.02.2025 19:22 — 👍 1    🔁 0    💬 0    📌 0

ChatGPT4.5 is here

www.youtube.com/watch?v=cfRY...

I love that they let the model to be introduced by engineers (with all the imperfections of normal people in front of the camera)

27.02.2025 20:41 — 👍 0    🔁 0    💬 1    📌 0

@psimecek is following 20 prominent accounts