A broader vision of open research is needed to include the arts, humanities and social sciences.
@jenniad.bsky.social @mirandab-oa.bsky.social @samuelmoore.org & @stephenpinfield.bsky.social for @lseimpactblog.bsky.social 👇
This is a dramatic update to the NIH Data Management and Sharing Plan template, effective for applications submitted for due dates on or after May 25, 2026.
grants.nih.gov/grants/guide...
Props to @scott.hanselman.com for naming the elephant in the room in big tech.
AI coding tools boost senior engineers but create challenges for juniors who lack context to judge outputs. Cutting junior hiring for near-term gains weakens the future pipeline. We’ll need apprenticeship-style paths.
Happy Twin Peaks day to all who celebrate!
What do LLMs see?
I wrote a lil' tool that extracts the attention matrices out of open models and creates this typing visual, with each token's opacity changing according to its average attention score as the prompt progresses. Dimmer words are considered less important to the model.
Key to efficient learning is realizing how we ACTUALLY learn, not just what FEELS like learning. I wrote a Claude Skill for some friends to help them think about this and they've liked it -- see Principles for some directions you could explore
github.com/DrCatHicks/l...
RDA-US is launching a funded program for US-based professionals working in/with research infrastructure. Great professional development and networking opportunity...and great way for newcomers to engage with the Research Data Alliance (RDA)! rda-us.org/announcing-t...
"Whenever I worry about where the Internet is headed, I remember that this example of the collective generosity and goodness of people still exists." anildash.com/2026/01/15/w...
“Pay-to-crawl refers to emerging technical systems used by websites to automate compensation for when their digital content—such as text, images, and structured data—is accessed by machines.” @creativecommons.bsky.social
Found the coolest website that takes random found cassette tapes people submit and digitizes them. I’m listening to an NYC hip hop station from 1994: intertapes.net
First full moon of the year.
"Does the open science movement—the push to make research outputs such as articles, data, and software free to read and reuse—produce the benefits its supporters claim, such as accelerating discovery and promoting science literacy? The answer is a qualified yes."
Here's my enormous round-up of everything we learned about LLMs in 2025 - the third in my annual series of reviews of the past twelve months
simonwillison.net/2025/Dec/31/...
This year it's divided into 26 sections! This is the table of contents:
Ok a translation is up.
No matter what your take on these rules, it's hard not to admire China trying to get ahead of these pressing issues: data protection, dependency, labeling and reminders that you are talking to a machine, etc.
Plenty to unpack here.
www.chinalawtranslate.com/en/chatbot-m...
In a new blog post, I contrast two flavors of empiricism: the one practiced in the social sciences and the one practiced in ML/CS.
I argue that we need both, given that CS is increasingly about "claims," and not just constructing artifacts.
doomscrollingbabel.manoel.xyz/p/the-empiri...
Paste any IIIF manifest → model classifies every page locally → see where illustrations appear.
Part of small-models-for-glam: small, efficient models for cultural heritage work.
Not everything needs GPT-4!
Try it: huggingface.co/spaces/small-models-for-glam/iiif-illustration-detector
‘Sam’s the biggest Cooke in town,’ New York City, 1964
If you are interested in the crawl to referral stats I mentioned, here's the Cloudflare blog post - blog.cloudflare.com/ai-search-cr...
Data is from the Cloudflare AI insights tool - radar.cloudflare.com/ai-insights#...
Slides and code from my #ff2025 talk, "Reasoning with Small Language Models (SLM) for Trustworthy Generative AI (GenAI)."
Slides:
docs.google.com/presentation...
Code:
github.com/jasonclark/a...
The kind folks at The Walrus published this feature over the weekend! Thank you! #skaterlibrarian thewalrusca.substack.com/p/meet-the-l...
“The Software Paper fills a gap for the computational and digital humanities communities...” Thank you to research software engineer extraordinaire @suttonkoeser.bsky.social for leading this initiative for Computational Humanities Research journal. Please share!
Thinking of Anne-Wil Harzing 12 guidelines for good academic referencing & how Generative AI engines even RAG/Deep Research often breaks many/most of them (1) harzing.com/blog/2016/04...
For academic writers, editors, and publishers looking for a systematic overview of emerging GenAI policies in an academic field, please see Yin and Chapelle's (2025) excellent paper which provides this for the field of applied linguistics: www.sciencedirect.com/science/arti...
Interesting. The evolution of conflict in literature / society
(by Instagram user: @grantdraws)
www.instagram.com/p/DBOlpAuRj9c/
The Best Album of 1989 Round 5 Match #124
#3 De La Soul, 3 FEET HIGH AND RISING
vs.
#11 Fugazi, 13 SONGS
forms.gle/R6JGyPyp7DQ8...
Responsible AI in Libraries and Archives team has released our interactive Viewfinder toolkit.
Toolkit (interactive website): www.lib.montana.edu/responsible-...
Project info: www.lib.montana.edu/responsible-...
#DLFForum #DLF2025
Reasoning with Small Language Models (SLM) to Create Trustworthy GenAI
Slides: doi.org/10.5281/zeno...
System Prompt: gist.github.com/jasonclark/1...
Model Context Protocol server:
gist.github.com/jasonclark/4...
#DLFForum #DLF2025
The Responsible AI in Libraries and Archives team has released Viewfinder: A toolkit for values-driven AI in libraries and archives that was created by librarians and tech ethicists at four universities.
Print-at-home PDF: osf.io/yue9s
Interactive website: www.lib.montana.edu/responsible-...