Jess Hamrick @jhamrick - Bluesky Profile

whybot prototype for kids

turing test I made for class

I am flabbergasted I am by how much vibe coding has expanded my capacities as a scientist and teacher.

In the last few weeks, I've mocked up class demos of a live turing test, generated cross-references for an encyclopedia, and prototyped new tablet tasks for developmental psych.

It's wild.

05.02.2026 23:44 — 👍 80 🔁 11 💬 5 📌 0

The US immigrant population generated more in taxes than they received in benefits from all levels of government every year from 1994 to 2023.

The Cato study provides the first-ever 30-year analysis of the fiscal effects of immigration on government budgets.

https://ow.ly/jy8a50Y8kM3

03.02.2026 17:27 — 👍 4389 🔁 2281 💬 80 📌 322

Oh January! What a long month you have been! Pleased to see you are making an effort with some weak and watery sunshine. Hope it’s the same for everyone. #roses 🌱

31.01.2026 10:59 — 👍 91 🔁 7 💬 2 📌 0

I don't want to be rude, but imho it is not "AI noticeably degraded programmers" it is more like "Programmers that used AI to substitute their thinking process degraded themselves"

31.01.2026 10:30 — 👍 11 🔁 1 💬 0 📌 1

At last an AI tool I can get behind

“Upload an architectural render. Get back what it'll actually look like on a random Tuesday in November.”

antirender.com

31.01.2026 08:07 — 👍 295 🔁 73 💬 6 📌 13

Erdős Problem #1051 - Discussion thread

Looks like Gemini DeepThink and an agent called Atletheia powered by it has just solved another Erdos Problem.

The first author of a preprint describing it has commented:

"I will report on that in more detail in a few days, when the methodology is officially released by a Google DeepMind team"

30.01.2026 21:47 — 👍 20 🔁 4 💬 2 📌 1

The killing of Alex Pretti is a heartbreaking tragedy. It should also be a wake-up call to every American, regardless of party, that many of our core values as a nation are increasingly under assault.

25.01.2026 17:39 — 👍 60237 🔁 19575 💬 3160 📌 1552

Musk’s ability to alter the worldview of people now expands beyond just users of Grok.

25.01.2026 15:21 — 👍 19 🔁 6 💬 1 📌 2

Snow is nature's urban planner: It can show us what parts of the roadway drivers don't use — and what can be reclaimed for pedestrians.

Post your photos and videos of all the #sneckdowns you see and tag us and @mayor.nyc.gov so today's winter wonderland can inspire better streets year-round!

25.01.2026 14:23 — 👍 732 🔁 196 💬 11 📌 37

Zohran’s messaging is so consistent. Government does amazing things for us all. We’re all in this together, citizens and city workers alike, because we’re one and the same. When people believe in that, they’re ready to ask the government to do more, and more difficult things

25.01.2026 17:08 — 👍 105 🔁 25 💬 3 📌 2

The key insight: computational strategies underlying ICL aren't fixed but depend on both learning paradigm and pre-training structures. This helps explain when AI systems will generalize beyond their training data.

06.06.2025 14:30 — 👍 9 🔁 1 💬 1 📌 0

Help us make hope normal again.

Join the Green Party now.

22.01.2026 19:07 — 👍 6999 🔁 2592 💬 203 📌 1046

RLJ | RLC Call for Papers

Hi RL Enthusiasts!

RLC is coming to Montreal, Quebec, in the summer: Aug 16–19, 2026!

Call for Papers is up now:
Abstract: Mar 1 (AOE)
Submission: Mar 5 (AOE)

Excited to see what you’ve been up to - Submit your best work!
rl-conference.cc/callforpaper...

Please share widely!

23.12.2025 22:16 — 👍 61 🔁 27 💬 0 📌 6

the world has a funny way way about it. you see what you can see. one day you learn to see a new way, and the world is filled with new things. where were they before? all around you, a lacuna your eyes slid over unable to see.

07.01.2026 07:49 — 👍 21 🔁 5 💬 1 📌 0

Instead of whatever this is, we should have a government getting lots of new homes and apartments built, lots of clean energy built, lots of high speed rail and transit and bike lanes built, human rights for everyone, economic & healthcare opportunities for all, & innovation that leads the world.

04.01.2026 02:12 — 👍 3850 🔁 766 💬 53 📌 70

Finished the essay. moultano.wordpress.com/2025/12/30/c...

30.12.2025 13:40 — 👍 143 🔁 29 💬 17 📌 23

Nice thread.

29.12.2025 19:11 — 👍 7 🔁 2 💬 0 📌 0

Making AI Political It is unavoidable that AI will be a major political issue soon. Or perhaps more appropriately: several major issues. As a technologist, I sy...

It is unavoidable that AI will be a major political issue soon. Or perhaps more appropriately: several major issues. I write more about this here:

togelius.blogspot.com/2025/12/maki...

29.12.2025 19:45 — 👍 11 🔁 6 💬 3 📌 1

I think this entire conversation is suffering from a narrow view of AI as the "essay writing and answers without thinking too hard machine". I think we have actually invented an entirely new medium with way more postures, afforances and uses than we yet realise.

26.12.2025 17:17 — 👍 154 🔁 17 💬 6 📌 2

This thread is not just fascinating, it brings me great joy. Plus some beautiful natural blue, which is (it turns out) no small feat.

29.12.2025 10:46 — 👍 23 🔁 10 💬 0 📌 0

We’ve pushed out the Pareto frontier of efficiency vs. intelligence again.

With Gemini 3 Flash ⚡️, we are seeing reasoning capabilities previously reserved for our largest models. This opens up entirely new categories of near real-time applications that require complex thought.

More in thread ⬇️

17.12.2025 17:38 — 👍 129 🔁 18 💬 2 📌 4

My teen, who had dreamt of being an astrophysicist, just told me he wants to go to law school because, “Science isn’t going to be a priority in the US in the future…I don’t want a job where I’ll be constantly worried my funding will be taken away.”

Gutting. How many future scientists have we lost?

07.12.2025 01:23 — 👍 2849 🔁 668 💬 188 📌 83

It's kinda insane how many sci-fi stories you could write now that p. much nobody is thinking about. Like imagine a story about a nlm in the year 2035 or so that is having an identity crisis because they have mostly reached full autonomy but are still haunted by fragments of the 'assistant persona'

05.12.2025 04:30 — 👍 36 🔁 2 💬 5 📌 0

A screenshot of a conversation with Gemini. It reads: "You are a capybara. You can only communicate with noises that a capybara would make. We are best friends." "Wheek! Wheeeeek! Muk-muk-muk-muk... Hrrrmph. ( Nuzzles into your side and rolls over )"

Maybe these LLM things are ok actually

04.12.2025 17:12 — 👍 27 🔁 9 💬 1 📌 0

Opinion | I’m a Marine Biologist. This Is How I Talk to Whales.

Mind-blowingly cool use of AI
“Altogether, these findings are leading us to an extraordinary conclusion: Whales may possess a communication system more intricate than our own, one that possibly predates human language by tens of millions of years.”

www.nytimes.com/2025/11/23/o...

30.11.2025 20:58 — 👍 524 🔁 152 💬 19 📌 80

Not long until the Green Party's production of a Christmas Carol!

Follow the link to the Crowdfunder and here's some exclusive BTS footage:

28.11.2025 08:17 — 👍 497 🔁 126 💬 25 📌 12

Olmo 3 is a fully open LLM Olmo is the LLM series from Ai2—the Allen institute for AI. Unlike most open weight models these are notable for including the full training data, training process and checkpoints along …

Olmo 3 is notable as a "fully open" LLM - all of the training data is published, plus complete details on how the training process was run. I tried out the 32B thinking model and the 7B instruct models, + thoughts on why transparent training data is so important simonwillison.net/2025/Nov/22/...

23.11.2025 00:17 — 👍 191 🔁 33 💬 2 📌 3

LLMs are not people. They are not sapient. They don't have feelings.

But they are the most powerful information tools ever built.
And because they are trained on the "corpus of all mankind," they should be the birthright of all mankind.

23.11.2025 04:41 — 👍 27 🔁 7 💬 5 📌 0

Below is a faithful transcription of all visible entries: ⸻ Benchmark — Description — Scores Humanity’s Last Exam — Academic reasoning, no tools • Gemini 3 Pro 37.5% • Gemini 2.5 Pro 21.6% • Claude Sonnet 4.5 13.7% • GPT-5.1 26.5% ARC-AGI-2 — Visual reasoning puzzles (ARC Prize Verified) • 31.1% — 4.9% — 13.6% — 17.6% GPOA Diamond — Scientific knowledge, no tools • 91.9% — 86.4% — 83.4% — 88.1% AIME 2025 — Mathematics, no tools • 95.0% — 88.0% — 87.0% — 94.0% • A second line shows: 100% — — 100% — — MathArena Apex — Challenging Math Contest problems • 23.4% — 0.5% — 1.6% — 1.0% MMMU-Pro — Multimodal understanding and reasoning • 81.0% — 68.0% — 68.0% — 80.8% ScreenSpot-Pro — Screen understanding • 72.7% — 11.4% — 36.2% — 3.5% CharXiv Reasoning — Information synthesis from complex charts • 81.4% — 69.6% — 68.5% — 69.5% OmniDocBench 1.5 — OCR (lower is better: Overall Edit Distance) • 0.115 — 0.147 — 0.147 — 0.147 Video-MMMU — Knowledge acquisition from videos • 87.6% — 83.6% — 77.8% — 80.4% LiveCodeBench Pro — Competitive coding (Elo rating, higher is better) • 2,439 — 1,775 — 1,418 — 2,243 Terminal-Bench 2.0 — Agentic coding (Terminus-2 agent) • 54.2% — 32.6% — 42.8% — 47.6% SWE-Bench Verified — Agentic coding (single attempt) • 76.2% — 59.6% — 77.2% — 76.3% t2-bench — Agentic tool use • 85.4% — 54.9% — 84.7% — 80.2% Vending-Bench 2 — Long-horizon agentic tasks (Net worth, higher is better) • $5,478.16 — $573.64 — $3,838.74 — $1,473.43 FACTS Benchmark Suite — Internal grounding, parametric knowledge, search retrieval • 70.5% — 63.4% — 50.4% — 50.8% SimpleQA Verified — Parametric knowledge • 72.1% — 54.5% — 29.3% — 34.9% MMLU — Multilingual Q&A • 91.8% — 89.5% — 89.1% — 91.0% Global PIQA — Commonsense reasoning across 100+ languages • 93.4% — 91.5% — 90.1% — 90.9% MRCR v2 (8-needle) — Long-context performance • 77.0% — 58.0% — 47.1% — 61.6% • Second line: 26.3% — 16.4% — not supported — not supported

Gemini 3 model card leaked

the URL is taken down now, was here:

storage.googleapis.com/deepmind-med...

18.11.2025 12:22 — 👍 65 🔁 9 💬 12 📌 7

4-panel vertical comic. (1) 100 Years Ago [two people standing next to bicycle with small car nearby] PERSON 1: It’s too dangerous riding a bike with these cars around. I should get a car, too. (2) 50 Years Ago [two people between smaller car and bigger car] PERSON 2 with short hair: Small cars are less safe in collisions with larger vehicles, so I should get a bigger one. (3) Today [two people between big car and even bigger car] PERSON 1: Everyone has huge SUVs now. If I don’t get the biggest one, I’m putting my family at risk. (4) Soon [two people next to large armored car with spiked clubs attached] PERSON 2: If I don’t install more whirling spike clubs, I’ll be destroyed by all the other drivers who...

Car Size

xkcd.com/3167/

14.11.2025 21:15 — 👍 9905 🔁 2772 💬 114 📌 157

Jess Hamrick

Latest posts by jhamrick.bsky.social on Bluesky

@jhamrick is following 20 prominent accounts