Peter Bull's Avatar

Peter Bull

@peter.drivendata.org

Co-founder DrivenData. Celebrating a decade of data for good. ML challenges | https://www.drivendata.org/ Data projects | https://drivendata.co/ Open source | https://github.com/pjbull

63 Followers  |  109 Following  |  37 Posts  |  Joined: 25.10.2023  |  1.5172

Latest posts by peter.drivendata.org on Bluesky

Post image

Enthusiastic to build on this generation of earth observation foundation embeddings like DeepMind's AlphaEarth (and more)! We already see some promising crop type (cereals vs. orchards) results and are exploring other use cases in climate resilience. deepmind.google/discover/blo...

08.08.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
File Browser - marimo The next generation of Python notebooks

Very cool to see that marimo supports our cloudpathlib library for their file browser UI! Browse your S3, GCS, Azure buckets from your notebooks! docs.marimo.io/api/inputs/f...

01.08.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

✨ πŸ“¦ ✨ Just released new Cookiecutter Data Science version with support for pixi and poetry as environment managers! Some of our top requested features ever. Upgrade and check it out now.

cookiecutter-data-science.drivendata.org

25.07.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Now getting organic inbound for www.zambacloud.com, our wildlife imagery processing platform, from ChatGPT! 😲

18.07.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Just in case you thought speech-to-text worked for children, the third column is what Whisper does. Somehow in the third example it accesses my inner monologue... I guess that's why we're excited about our upcoming challenge! kidsasr.drivendata.org

16.07.2025 22:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

How are people managing code review for their AI coding agents? I do a first glance and it is obviously bad (e.g., didn't refactor repeated code), and now I've got half a dozen AI diffs for things that aren't good enough cluttering up my todo list with things to respond to....

14.07.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Time is On My Side: Dynamics of Talk-Time Sharing in Video-chat Conversations An intrinsic aspect of every conversation is the way talk-time is shared between multiple speakers. Conversations can be balanced, with each speaker claiming a similar amount of talk-time, or…

New research based on the CANDOR corpus shows that people enjoy conversations where they alternate longer turns better than short turns or one person dominating. Cool!

arxiv.org/html/2506.20...

11.07.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The best shortcut to how many experienced software engineers feel about AI is listening to the Primeagen's takes. Balanced perspectives on what's actually new, determinism, security, system complexity, what's promising, and what's not www.youtube.com/watch?v=vDWa...

09.07.2025 22:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Maldito ChatGPT Camilo Β· Maldito ChatGPT Β· Song Β· 2025

"Damn ChatGPT" your new summer jam about using ChatGPT as a therapist open.spotify.com/track/4umq06... (edited)

07.07.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
How Long Contexts Fail Taking care of your context is the key to building successful agents. Just because there’s a 1 million token context window doesn’t mean you should fill it.

Great article on the challenges of only surfacing the right info to LLMs and editing down what is not needed. If you've used a coding copilot or agent, you've seen this first hand many times. Output iterations are often polluted with code that came before.

www.dbreunig.com/2025/06/22/h...

04.07.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

BioCLIP2 looks like a stellar improvement! I'm excited to think about integrating into Zamba to for open-ended classification tasks run at scale on camera trap imagery. Definitely the potential to dramatically improve CT image utility. imageomics.github.io/bioclip-2/

30.06.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Inside the AI Prompts DOGE Used to β€œMunch” Contracts Related to Veterans’ Health Experts who reviewed the code for ProPublica found numerous and troubling flaws in the system, providing a disturbing glimpse into how the Trump administration is allowing artificial intelligence to…

"Munchable" is GenZ cringe. www.propublica.org/article/insi...

27.06.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

We've built so many low-fidelity prototypes in our HCD work. IMO vibecoding changes the feel of those prototypes, but doesn't change the process. Ask any designerβ€”they'll tell you high-fidelity first iterations are often more distracting to clients than helpful.

www.semafor.com/article/06/0...

25.06.2025 22:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

Check out this LLM circuit trace LLM for the text: '"The statement 'this statement is false' is." It goes through a logical contradictions node, but still outputs either "true" or "false" with the highest probabilities... www.anthropic.com/research/ope...

23.06.2025 18:36 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

A new preprint shows anonymization techniques for voices make transcription accuracy substantially worse for children versus adults. This is going to be a big challenge as we work on ASR for educational settings where we emphatically need both privacy and accuracy. arxiv.org/pdf/2506.00100

20.06.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
30 minutes with a stranger Watch hundreds of strangers talk for 30 minutes, and track how their moods change

😍 Incredible data storytelling about the power of conversation and human connection. Worth a read for good vibes! Based on the CANDOR corpus that we worked on. pudding.cool/2025/06/hell...

13.06.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

The gap between LLM prototype and production strikes again... in the worst possible place. www.propublica.org/article/trum...

06.06.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
The Sentinel-3 optical image shows a dense, orange plume of Saharan sand over approximately 150 000 sq km of the eastern Atlantic Ocean. The small islands of Cabo Verde peek out from beneath the clouds in the top left corner.

The Sentinel-3 optical image shows a dense, orange plume of Saharan sand over approximately 150 000 sq km of the eastern Atlantic Ocean. The small islands of Cabo Verde peek out from beneath the clouds in the top left corner.

πŸ“· This week's @esaearth.esa.int #EarthFromSpace is a #Copernicus Sentinel-3 visible image of a thick plume of orange dust from the Sahara Desert over approximately 150 000 sq km of the eastern Atlantic Ocean on 7 May 2025πŸ§ͺ🌍

www.esa.int/ESA_Multimed...

06.06.2025 10:44 β€” πŸ‘ 301    πŸ” 40    πŸ’¬ 8    πŸ“Œ 5
Post image Post image Post image

Very cool to see the multimodal conversation CANDOR dataset that we worked on used for a new paper on conversational agents! Gets agent feedback/training loops closer to the 7-38-55 rule than text only arxiv.org/abs/2505.15922

30.05.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Zamba Cloud

You don't need to be a coder to use AI for wildlife research! πŸ’»βž‘οΈπŸš« With Zamba Cloud's new image support, simply upload photos, get species IDs, and even train custom modelsβ€”all without writing a single line of code. www.zambacloud.com #WildlifeResearch

29.05.2025 11:31 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

A nurse shark takes a leisurely stroll through Coral City against a backdrop of thriving staghorn, elkhorn, brain, finger, and star corals #nurseshark #sharksofcoralcity #shark #leisurelystroll #elkhorn #staghorn #braincoral #coral #coralcitycamera #miami #portmiami #biscaynebay #coralcity

29.05.2025 16:13 β€” πŸ‘ 4885    πŸ” 511    πŸ’¬ 42    πŸ“Œ 23

What about the fact-checked news content?

29.05.2025 22:03 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Geospatial Foundational Disappointments TLDR; After 10Β²ΒΉ FLOPs and 500 B patches, IBM’s TerraMind beats a supervised U‑Net by just +2 mIoU on PANGAEA; losing on 5/9 tasks, most other GFMs do worse.

Foundation models for geospatial aren't there yet. This piece argues that unlike with language the information density of labeled data, and pretraining tasks aren't relevant enough. Worth a read if you do any geo AI christopherren.substack.com/p/geospatial...

28.05.2025 22:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

People keep plugging AI "Co-Scientists," so what happens when you ask them to do an important task like finding errors in papers?

We built SPOT, a dataset of STEM manuscripts across 10 fields annotated with real errors to find out.

(tl;dr not even close to usable) #NLProc

arxiv.org/abs/2505.11855

23.05.2025 16:21 β€” πŸ‘ 120    πŸ” 31    πŸ’¬ 4    πŸ“Œ 2
Preview
Microsoft is opening its on-device AI models up to web apps in Edge Edge continues to compete with Chrome.

This is a huge announcement for privacy-focused developers that don't want to do API calls, but the caniuse.com for built-in LLMs across browsers is going to be a shitshow. www.theverge.com/news/669528/...

23.05.2025 18:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Nation Can't Believe It On Harvard's Side

Nation Can't Believe It On Harvard's Side

22.05.2025 20:35 β€” πŸ‘ 16384    πŸ” 2002    πŸ’¬ 150    πŸ“Œ 119
Preview
Gemini Diffusion Another of the announcements from Google I/O yesterday was Gemini Diffusion, Google's first LLM to use diffusion (similar to image models like Imagen and Stable Diffusion) in place of transformers. …

I got access to Gemini Diffusion, Google's first diffusion LLM, and the thing is absurdly fast - it ran at 857 tokens/second and built me a prototype chat interface in just a couple of seconds, video here: simonwillison.net/2025/May/21/...

21.05.2025 21:45 β€” πŸ‘ 137    πŸ” 18    πŸ’¬ 3    πŸ“Œ 2
Preview
How changing β€˜localhost’ to β€˜127.0.0.1’ sped up my test suite by 1,800% I would like to share a (somewhat recent) anecdote on how a one-line code change improved my understanding of software engineering.

PSA to test suites with HTTP servers on Windows: localhost can be 100x slower than 127.0.0.1! 🀯 I hit the issue on a new cloudpathlib feature for HTTP that's coming soon github.com/drivendataor... Nearly impossible to debug. More background here medium.com/hackernoon/h...

21.05.2025 22:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I say this a lot, but the narrative that AI use is going to collapse due to data limits or costs or environmental factors or regulation or a "hype bubble" popping or whatever is not a useful position for critics.

Capability development may slow (it hasn't done so yet), but AI use isn't going away.

20.05.2025 15:53 β€” πŸ‘ 86    πŸ” 12    πŸ’¬ 5    πŸ“Œ 2
=
Nation & World -
The Seattle Times
My Account β„’
The T. Rex may have been a lot smarter than you thought
Jan. 9, 2023 at 7:36 am | Updated Jan. 9, 2023 at 7:36 am
By DINO GRANDONI
The Washington Post

= Nation & World - The Seattle Times My Account β„’ The T. Rex may have been a lot smarter than you thought Jan. 9, 2023 at 7:36 am | Updated Jan. 9, 2023 at 7:36 am By DINO GRANDONI The Washington Post

typical corrupt science

17.05.2025 22:20 β€” πŸ‘ 15904    πŸ” 2379    πŸ’¬ 165    πŸ“Œ 107

@peter.drivendata.org is following 20 prominent accounts