Find out more about the AI newsroom workflow course at its awful sales-y site, and feel free to shoot me any questions you might have!
littlecolumns.com/courses/ai-n...
@dangerscarf.bsky.social
Find out more about the AI newsroom workflow course at its awful sales-y site, and feel free to shoot me any questions you might have!
littlecolumns.com/courses/ai-n...
The course itself is six weeks long, and while it does cost money (which is crazy strange for me!), there are steep geographic pricing discounts and coupon codes for close readers of the course site.
30.10.2025 20:47 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0It's maybe like 35% a tech course, and a lot of the theory is stuff that seems simple once you've heard it: see what goes wrong, fix it, track it. That's it!
Yes, we'll learn automation tools like n8n/ActivePieces and eval suites like Opik/Arize Phoenix, buuut they're just one part
This course is going to solve every step of those crises. How do you...
- set up an AI pipeline?
- measure if it's working?
- iterate and improve it?
- make sure you're solving a reader/reporter problem instead of just playing tech games?
It isn't magic! It's easy!!!!
I'm running a six-week course in November on building and evaluating AI newsroom workflows!
It's targeted at people who don't know where to start, or who build little prototypes and end up stumped about making them production-ready.
littlecolumns.com/courses/ai-n...
a three-column table with the middle column highlighted
three columns being restructured into a vertical flow
tables being selected irrespective of their columns
the eventual pandas df
Natural PDF v0.1.13 out โ a handful of useful changes but my favorite is๐ผpage restructuring support!
Grab sections and "flow" them together vertically or horizontally, making multi-column extraction infinitely easier than 24 hours ago.
Details at jsoma.github.io/natural-pdf/...
it looks like someone has been going very hard on scans
ONE MORE DAY OF ACCEPTING BAD PDF SUBMISSIONS
you could have won EVERY CATEGORY
14.05.2025 20:21 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Woke up to ton of new non-English BAD PDF CONTEST submissions: ๐ฅ Serbian! Romanian! Chinese! ๐ฅ
Mostly not scans, though, so I predict they'll easy-peasy to extract the info from. I want to have to train a custom OCR model!!! Someone submit a big scanned non-English PDF!!!
i know you all are hiding worse scans from me
11.05.2025 13:33 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0screenshot of a spreadsheet with very tiny text
i love this giant-pdf-with-tiny-text submission, we need a smallest font size category
08.05.2025 15:24 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0I am running a contest. It is about bad pdfs.
It can make you independently wealthy (for immeasurably small measures of independent wealth)
badpdfs.com
Live colab demo/walkthrough here: colab.research.google.com/github/jsoma...
03.04.2025 16:16 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0a screenshot of natural PDF documentation
New release of ๐ Natural PDF ๐
A million and one table extraction/document layout/Q&A/quality of life improvements for all your PDF-processing needs
jsoma.github.io/natural-pdf/
the law clinic repping this student, CLEAR, is based out of CUNY.....once again the public city university absolutely flounces the ivy league when it comes to having a backbone and standing on actual principles
24.03.2025 23:16 โ ๐ 1575 ๐ 364 ๐ฌ 15 ๐ 22Thank you โ if only we could get a fix for the bug that prevents it from working 100%!
14.03.2025 03:16 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0