Ben Brumfield's Avatar

Ben Brumfield

@benwbrum.bsky.social

Open Source #DigitalHumanities software engineer. Founder of FromThePage.com, a platform for collaborative #manuscript #transcription to engage the public in #archives and create digital scholarly editions.

1,333 Followers  |  1,718 Following  |  147 Posts  |  Joined: 22.02.2024  |  2.5255

Latest posts by benwbrum.bsky.social on Bluesky

We'll try to write up some documentation once the feature settles down; there are still some more changes we want to make, possibly before tomorrow.

(The webinar will be recorded and sent to anyone who signs up, though.)

09.12.2025 14:52 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Introducing Gemini 3.0 Support in FromThePage - FromThePage Blog When Ben sent meΒ Mark Humphries’ report on testing a new, unreleased Gemini model, I got scared. And excited. Mark is a historian and digital humanist who’s gone deep on analyzing AI tools for textual...

We wrote this two weeks ago, but it's sadly outdated since it doesn't include any of the accuracy statistics, the reasoning display, or the tuning we've done to the prompt since then:

content.fromthepage.com/introducing-...

I'll try to find a link to a live page.

09.12.2025 14:52 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

"From a transcribers point of view I will be very unlikely now to continue devoting time to working on straightforward handwritten documents without an AI draft as a starting point ."

She plans to attend our webinar this Thursday and might be willing to talk about her experience.

09.12.2025 14:43 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

An update: after a week of testing, the same volunteer is now very happy:
"I have been very pleased to use the AI facility in transcribing Nicholas Piper Log books for Whitby Literary & Philosophical Society."
...

09.12.2025 14:43 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Introducing Gemini 3.0 Support in FromThePage (December 11, 2025) - FromThePage Blog

Webinar link: content.fromthepage.com/dec-2025-web...

09.12.2025 14:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

"From a transcribers point of view I will be very unlikely now to continue devoting time to working on straightforward handwritten documents without an AI draft as a starting point ."

She plans to attend our webinar this Thursday and might be willing to talk about her experience.

09.12.2025 14:43 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

An update: after a week of testing, the same volunteer is now very happy:
"I have been very pleased to use the AI facility in transcribing Nicholas Piper Log books for Whitby Literary & Philosophical Society."
...

09.12.2025 14:43 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Screenshot showing "3,008,092 Transcribed Pages"

Screenshot showing "3,008,092 Transcribed Pages"

It looks like we passed another milestone on FromThePage last week:

08.12.2025 14:28 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

But it really feels like libraries & archives as a field suddenly just went from "we aren't generally attempting to do automated handwriting recognition because it's at the edge of what's possible" to "oh boy now we have another doable but labor-intensive collections enhancement task on the backlog"

03.12.2025 15:03 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

I was doing some experiments this morning as we fine-tune our prompts and am astonished to say that while I've seen many errors produced by Gemini, none of them were the seductively plausible hallucinations that have made me regard MMLLMs as potentially poisonous.

03.12.2025 15:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We're working on a pipeline for extracting structured data from industrial drawings into structured data fields. More in January, but so far we're very impressed with LLMs for this purpose.

02.12.2025 17:12 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Wow.

26.11.2025 23:17 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

People who like to transcribe tend to be hands-on types, or puzzle solvers, or people who read between the lines. Transcribing is a way of thinking. It is not for every project, but for some projects, it can be crucial.

26.11.2025 15:40 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

As someone with FtP projects, I think there are still several good reasons for people to choose transcribing. The goal is not necessarily to record the words, but to find meaning. Sometimes that comes from reading the words, other times it comes from closely reading each mark.+

26.11.2025 15:38 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

I have never felt that AI is a threat to transcription projects. Transcription is such a fulfilling experience and the words run through you in ways that reading alone can never do. Meeting up with scribes of the past will always be thrilling.

26.11.2025 14:09 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

the projects that are successful.

That said, I don't relish saying, "Stop doing [thing you enjoy] and instead do [totally different thing]."

26.11.2025 13:44 β€” πŸ‘ 8    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

It's not yet clear that we'll need to do so, since we still need to do our own evaluation of Gemini's capabilities and weaknesses. Our hope is that it will open up collections that were unsuitable for crowdsourcing (or at institutions with no public mandate/volunteer expertise) without replacing(+)

26.11.2025 13:44 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

There may be some non-engagement follow-on benefits as well. I have had two volunteers write that transcribing helps their anxiety, which I can certainly see.

25.11.2025 21:18 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

"I have very much enjoyed the opportunity of working on historical material over the past 5 years but today feel dismayed that I may now be wasting my time in continuing."

25.11.2025 20:58 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Sure enough, we got this email from a volunteer on Friday after we announced Gemini integration:

"As a long-term transcriber on From the page I am now wondering about the implications of Gemini 3 (AI) for me - I am feeling particularly discouraged today.

25.11.2025 20:58 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1

Last week, we introduced support for #Gemini3 into our collaborative #transcription platform FromThePage. We are still experimenting with prompts and outputs, but we think that this may be the first #LLM we've seen that does not introduce seductively plausible errors into historic documents.

24.11.2025 13:54 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Introducing Gemini 3.0 Support in FromThePage - FromThePage Blog When Ben sent meΒ Mark Humphries’ report on testing a new, unreleased Gemini model, I got scared. And excited. Mark is a historian and digital humanist who’s gone deep on analyzing AI tools for textual...

Introducing #Gemini 3 Support in #FromThePage content.fromthepage.com/introducing-...

We're still developing capabilities and guardrails now, but plan to present it all at a webinar December 11.

24.11.2025 13:54 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

One library and one scholarly edition are testing it out so far, and we hope to learn from them as well as from our own experiments.

We'll be writing more about our experiments and findings about accuracy, plausibility, and user experience over the next week or so.

19.11.2025 15:48 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yesterday, Google released Gemini 3, which has gotten really interesting reviews from Mark Humphries: generativehistory.substack.com/p/the-sugar-...

Also yesterday, we shipped an integration between FromThePage and Gemini, allowing transcribers the option of starting with an AI draft.

19.11.2025 15:48 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Is That Transcription Really Human? - FromThePage Blog Last month, someone asked this question on theΒ Genealogy and AIΒ Facebook group:If volunteers use AI to transcribe documents, is that OK? I have strong opinions, but want to explain them. First off, th...

Should a volunteer use #AI to help them transcribe pages for a #crowdsourcing project? That question got me thinking about why, exactly, my answer is "no" and what kinds of purposes different transcriptions may be used for.

content.fromthepage.com/can-voluntee...

23.10.2025 17:25 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
The Legacy of India’s Quest to Sterilize Millions of Men Breaking the Cycle: Part 1 In 1976, men across India were drastically changing their behavior. Some were abandoning the beds inside their homes to sleep in fields; others were skipping major festivals...

This one made a big impression on me: pulitzercenter.org/stories/lega...

However this might be a bit closer to the time-frame you're looking for: www.theatlantic.com/magazine/arc...

15.09.2025 16:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I'd love to hear how and whether other developers in #digitalhumanities or libraries and archives have done similar experiments and evaluations.

For now, we're exercising a lot more discipline with these tools to keep from wasting our time on shiny new things.

15.09.2025 15:53 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

3. Pitching in on specific tasks that we don't have enough skills to do as well ourselves, like translating messages for our recurring "Werk/Arbeit;obra/trabajo" problem or tweaking our UI for A11Y issues (which it does well with).

15.09.2025 15:51 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

2. Helping refactor or isolate our legacy test suite. Asking an agent to isolate a single test at a time might finally get us out of dependency hell in our test suite.

15.09.2025 15:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

1. Fixing small user-reported/developer-noticed problems so that we don't have to interrupt developers in other effort. This lets us fix bugs on a managers schedule instead of a maker's schedule (cf. Paul Graham) . (Sara and I spend most of our days on a managers schedule, unfortunately)

15.09.2025 15:49 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@benwbrum is following 20 prominent accounts