wow, sounds super interesting, going to get it asap
17.01.2026 12:41 β π 1 π 0 π¬ 0 π 0@matteodic.bsky.social
Researcher in Corpus Linguistics and Digital Humanities @ UniMoRe. Corpus and Cognitive Linguist, Python & R user. Overall nerd (posts not representative of employers). Website: https://infogrep.it Online materials: https://catlism.github.io
wow, sounds super interesting, going to get it asap
17.01.2026 12:41 β π 1 π 0 π¬ 0 π 0The cover of Critical Infrastructure Studies and Digital Humanities with the top words in yellow and the bottom in red
Yay! The hard copies of Critical Infrastructure Studies and Digital Humanities arrived today. It was a pleasure to write a chapter in this about pirate infrastructures and shadow libraries.
17.01.2026 10:30 β π 82 π 9 π¬ 2 π 0Network diagram with princess emojis all Over it
the way you collect data determines what kinda data you collect. Who knew? (Many people). So a survey of our archive is a survey of the data we can reach β itβs most often the history of people who were wealthy and connected enough to be remembered, and itβs pretty white and anglophone
03.01.2026 13:29 β π 14 π 2 π¬ 1 π 2New entries to my ZineBakery.com online catalog of free tech+culture+justice zines: DH (free maker software; bodies+datafied city; queering feminist geography; edtech surveillance; digipress); Fritton's Day-Glo Ink "Primer"; @prisonculture.bsky.social's Arrested at the Library; & Virginia Trans Joy+
15.11.2025 22:56 β π 38 π 18 π¬ 1 π 0"crashing is not failure; crashing means we were exploring territories that were unexpected by the program."
I love this perspective.
Thanks @switchangel.bsky.social
www.youtube.com/live/mLozqDn...
I see so much of this in academic funding calls βwe are looking for projects that explore how AI can help to solve β¦ hunger, violence against women and children, poverty, etc.β But thereβs no space in there to say: βum, what if AI is not the right tool for thisβ
16.09.2025 05:55 β π 78 π 24 π¬ 3 π 0π ToolFindr - Lightweight Explorer for Discovering Research Tools in Digital Humanities
12.09.2025 16:13 β π 2 π 1 π¬ 0 π 0I have recently found "What are embeddings" by @vickiboykis.com, and I think it should become a #corpuslinguistics and #digitalhumanities must-read starting book. Plus it's free under CC by-nc-sa!
vickiboykis.com/what_are_emb...
Extreme speech thrives in encrypted spaces, but killing encryption wonβt stop it, says a group of researchers who have studied the problem from multiple angles. We need context-driven governance, not backdoors, they say.
22.08.2025 19:00 β π 5 π 3 π¬ 0 π 0wow, sounds super, thanks!
14.08.2025 17:18 β π 0 π 0 π¬ 0 π 0βWikipedia editors have had to deal with an onslaught of AI-generated content filled with false information and phony citations. Already, the community of Wikipedia volunteers has mobilized to fight back against AI slopβ
www.theverge.com/report/75681...
5nxd redeemed, thanks
09.08.2025 14:22 β π 0 π 0 π¬ 0 π 08s25 redeemed,thanks a lot
09.08.2025 14:13 β π 1 π 0 π¬ 0 π 0I love this!
08.08.2025 07:43 β π 5 π 1 π¬ 0 π 0Screenshot of the app showing a page from a book + different views of existing and new ocr.
Many VLM-based OCR models have been released recently. Are they useful for libraries and archives?
I made a quick Space to compare VLM OCR with "traditional" OCR using 11k Scottish exam papers from @natlibscot.bsky.social
huggingface.co/spaces/davanstrien/ocr-time-capsule
rvya redeemed, thanks
26.07.2025 17:10 β π 0 π 0 π¬ 0 π 0b83m redeemed, thanks a lot
26.07.2025 17:08 β π 1 π 0 π¬ 0 π 0fsdc redeemed, thanks!
26.07.2025 17:07 β π 1 π 0 π¬ 0 π 04-panel comic. (1) [Person 1 with ponytail flanked by person with short hair and another person speaking into microphone at podium] PERSON 1: In the early 2010s, researchers found that many major scientific results couldnβt be reproduced. (2) PERSON 1: Over a decade into the replication crisis, we wanted to see if todayβs studies have become more robust. (3) PERSON 1: Unfortunately, our replication analysis has found exactly the same problems that those 2010s researchers did. (4) [newspaper with image of speakers from previous panels] Headline: Replication Crisis Solved
Replication Crisis
xkcd.com/3117/
I believe it is worth interrogating the fundamental forces re-shaping our information spheres away from liberal democracy towards myth, manipulation and magical thinking empowering autocracy and nihilism.
Hereβs how it all falls apartβa π§΅ in 6 figures β¬οΈ
www.protagonist-science.com/p/how-social...
Stuffing ai into everything βisnβt just a forecast, itβs a libidinal fantasy β a capitalist dream of replacing relationships with code and scalable software, while public institutions are gutted in the name of βinnovation.ββ
06.07.2025 14:30 β π 177 π 51 π¬ 3 π 6"The problem with AI isn't that it can do your job. It can't. The problem with AI is that your MBA-brained boss's boss doesn't know how your job works and thinks AI can do your job at fractions of a penny on the dollar, and hears the siren song of 'maximize shareholder value'."
MBA-brain is real.
Is π΅βπ« one token or two?
To a human, it's one. To a corpus tool, itβs often split (π΅ + π«).
And ππππππ β online.
This preprint shows how emojis & homoglyphs challenge tokenisation and distort linguistic evidence.
π arxiv.org/abs/2507.01764
#Emoji #Homoglyphs #CorpusLinguistics #AcademicSky #LangSky
wow, many thanks!
02.07.2025 14:11 β π 1 π 0 π¬ 0 π 0Fellow academics, can anyone help with obtaining an #endorsement on arXiv?
I have a preprint I'd like to upload to Computer Science > Computation and Language (cs.CL), but need someone to endorse my account.
Here's the endorsement link: arxiv.org/auth/endorse...
#corpuslinguistics #linguistics
3jkl redeemed, thanks
24.06.2025 07:08 β π 1 π 0 π¬ 1 π 0y4ha claimed, thanks!
24.06.2025 07:06 β π 1 π 0 π¬ 1 π 0Memes can serve as strong indicators of coming mass violence
15.06.2025 18:22 β π 2 π 1 π¬ 0 π 1Finally, a Replacement for BERT (Blog about ModernBert)
huggingface.co/blog/modernb...