Matteo Di Cristofaro @matteodic

wow, sounds super interesting, going to get it asap

17.01.2026 12:41 — 👍 1 🔁 0 💬 0 📌 0

The cover of Critical Infrastructure Studies and Digital Humanities with the top words in yellow and the bottom in red

Yay! The hard copies of Critical Infrastructure Studies and Digital Humanities arrived today. It was a pleasure to write a chapter in this about pirate infrastructures and shadow libraries.

17.01.2026 10:30 — 👍 82 🔁 9 💬 2 📌 0

Network diagram with princess emojis all Over it

the way you collect data determines what kinda data you collect. Who knew? (Many people). So a survey of our archive is a survey of the data we can reach — it’s most often the history of people who were wealthy and connected enough to be remembered, and it’s pretty white and anglophone

03.01.2026 13:29 — 👍 14 🔁 2 💬 1 📌 2

New entries to my ZineBakery.com online catalog of free tech+culture+justice zines: DH (free maker software; bodies+datafied city; queering feminist geography; edtech surveillance; digipress); Fritton's Day-Glo Ink "Primer"; @prisonculture.bsky.social's Arrested at the Library; & Virginia Trans Joy+

15.11.2025 22:56 — 👍 38 🔁 18 💬 1 📌 0

Coding Rave Music Part ???? YouTube video by Switch Angel

"crashing is not failure; crashing means we were exploring territories that were unexpected by the program."

I love this perspective.
Thanks @switchangel.bsky.social

www.youtube.com/live/mLozqDn...

05.11.2025 14:28 — 👍 1 🔁 0 💬 0 📌 0

I see so much of this in academic funding calls ‘we are looking for projects that explore how AI can help to solve … hunger, violence against women and children, poverty, etc.’ But there’s no space in there to say: ‘um, what if AI is not the right tool for this’

16.09.2025 05:55 — 👍 78 🔁 24 💬 3 📌 0

ToolFindr - Lightweight Explorer for Discovering Research Tools in Digital Humanities [AI Summary]: ToolFindr is an open, community-curated platform for discovering Digital Humanities research tools, built on the Tool Registry Framework and integrating data from Wikidata and the...

📖 ToolFindr - Lightweight Explorer for Discovering Research Tools in Digital Humanities

12.09.2025 16:13 — 👍 2 🔁 1 💬 0 📌 0

What are embeddings? A deep-dive into machine learning embeddings.

I have recently found "What are embeddings" by @vickiboykis.com, and I think it should become a #corpuslinguistics and #digitalhumanities must-read starting book. Plus it's free under CC by-nc-sa!

vickiboykis.com/what_are_emb...

02.09.2025 11:45 — 👍 6 🔁 0 💬 1 📌 0

Policy Directions on Encrypted Messaging and Extreme Speech | TechPolicy.Press Encryption, disinformation, and democracy: rethinking policy for messaging apps with rights-based safeguards.

Extreme speech thrives in encrypted spaces, but killing encryption won’t stop it, says a group of researchers who have studied the problem from multiple angles. We need context-driven governance, not backdoors, they say.

22.08.2025 19:00 — 👍 5 🔁 3 💬 0 📌 0

wow, sounds super, thanks!

14.08.2025 17:18 — 👍 0 🔁 0 💬 0 📌 0

How Wikipedia is fighting AI slop content Wikipedians are wading through the muck.

“Wikipedia editors have had to deal with an onslaught of AI-generated content filled with false information and phony citations. Already, the community of Wikipedia volunteers has mobilized to fight back against AI slop”

www.theverge.com/report/75681...

10.08.2025 16:33 — 👍 153 🔁 58 💬 3 📌 4

5nxd redeemed, thanks

09.08.2025 14:22 — 👍 0 🔁 0 💬 0 📌 0

8s25 redeemed,thanks a lot

09.08.2025 14:13 — 👍 1 🔁 0 💬 0 📌 0

I love this!

08.08.2025 07:43 — 👍 5 🔁 1 💬 0 📌 0

Screenshot of the app showing a page from a book + different views of existing and new ocr.

Many VLM-based OCR models have been released recently. Are they useful for libraries and archives?

I made a quick Space to compare VLM OCR with "traditional" OCR using 11k Scottish exam papers from @natlibscot.bsky.social

huggingface.co/spaces/davanstrien/ocr-time-capsule

01.08.2025 15:09 — 👍 47 🔁 15 💬 4 📌 1

rvya redeemed, thanks

26.07.2025 17:10 — 👍 0 🔁 0 💬 0 📌 0

b83m redeemed, thanks a lot

26.07.2025 17:08 — 👍 1 🔁 0 💬 0 📌 0

fsdc redeemed, thanks!

26.07.2025 17:07 — 👍 1 🔁 0 💬 0 📌 0

4-panel comic. (1) [Person 1 with ponytail flanked by person with short hair and another person speaking into microphone at podium] PERSON 1: In the early 2010s, researchers found that many major scientific results couldn’t be reproduced. (2) PERSON 1: Over a decade into the replication crisis, we wanted to see if today’s studies have become more robust. (3) PERSON 1: Unfortunately, our replication analysis has found exactly the same problems that those 2010s researchers did. (4) [newspaper with image of speakers from previous panels] Headline: Replication Crisis Solved

Replication Crisis

xkcd.com/3117/

21.07.2025 23:54 — 👍 4884 🔁 659 💬 28 📌 31

How social media destroys democratic discourse, explained in 6 easy figures Where we all went wrong

I believe it is worth interrogating the fundamental forces re-shaping our information spheres away from liberal democracy towards myth, manipulation and magical thinking empowering autocracy and nihilism.

Here’s how it all falls apart—a 🧵 in 6 figures ⬇️
www.protagonist-science.com/p/how-social...

11.07.2025 14:18 — 👍 26 🔁 17 💬 2 📌 5

Regulating AI Isn’t Enough. Let’s Dismantle the Logic That Put It in Schools. AI in schools isn’t progress — it’s a sign of how far we’ve strayed from the purpose of education.

Stuffing ai into everything “isn’t just a forecast, it’s a libidinal fantasy — a capitalist dream of replacing relationships with code and scalable software, while public institutions are gutted in the name of ‘innovation.’”

06.07.2025 14:30 — 👍 177 🔁 51 💬 3 📌 6

Companies That Tried to Save Money With AI Are Now Spending a Fortune Hiring People to Fix Its Mistakes Companies that rushed to replace human labor with AI are now shelling out to have IRL workers to fix the technology's screwups.

🤷🏿‍♂️

06.07.2025 12:06 — 👍 2147 🔁 658 💬 68 📌 354

"The problem with AI isn't that it can do your job. It can't. The problem with AI is that your MBA-brained boss's boss doesn't know how your job works and thinks AI can do your job at fractions of a penny on the dollar, and hears the siren song of 'maximize shareholder value'."

MBA-brain is real.

03.07.2025 06:57 — 👍 7622 🔁 3172 💬 139 📌 185

Data interference: emojis, homoglyphs, and issues of data fidelity in corpora and their results Tokenisation - "the process of splitting text into atomic parts" (Brezina & Timperley, 2017: 1) - is a crucial step for corpus linguistics, as it provides the basis for any applicable quantitative met...

Is 😵‍💫 one token or two?
To a human, it's one. To a corpus tool, it’s often split (😵 + 💫).
And 𝙊𝙉𝙇𝙄𝙉𝙀 ≠ online.
This preprint shows how emojis & homoglyphs challenge tokenisation and distort linguistic evidence.
🔍 arxiv.org/abs/2507.01764

#Emoji #Homoglyphs #CorpusLinguistics #AcademicSky #LangSky

03.07.2025 07:32 — 👍 12 🔁 1 💬 1 📌 0

wow, many thanks!

02.07.2025 14:11 — 👍 1 🔁 0 💬 0 📌 0

arXiv user login

Fellow academics, can anyone help with obtaining an #endorsement on arXiv?
I have a preprint I'd like to upload to Computer Science > Computation and Language (cs.CL), but need someone to endorse my account.
Here's the endorsement link: arxiv.org/auth/endorse...

#corpuslinguistics #linguistics

02.07.2025 14:06 — 👍 0 🔁 0 💬 1 📌 0

3jkl redeemed, thanks

24.06.2025 07:08 — 👍 1 🔁 0 💬 1 📌 0

y4ha claimed, thanks!

24.06.2025 07:06 — 👍 1 🔁 0 💬 1 📌 0

Memes can serve as strong indicators of coming mass violence A new study finds that surges in visual propaganda—like memes and doctored images—often precede political violence. By combining AI with expert analysis, researchers tracked manipulated content leading up to Russia’s invasion of Ukraine, revealing early warning signs of instability.

Memes can serve as strong indicators of coming mass violence

15.06.2025 18:22 — 👍 2 🔁 1 💬 0 📌 1

Finally, a Replacement for BERT: Introducing ModernBERT We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Finally, a Replacement for BERT (Blog about ModernBert)

huggingface.co/blog/modernb...

08.06.2025 10:17 — 👍 1 🔁 1 💬 0 📌 0

Matteo Di Cristofaro

Latest posts by matteodic.bsky.social on Bluesky

@matteodic is following 20 prominent accounts