Peter Nadel's Avatar

Peter Nadel

@pnadelofficial.bsky.social

Digital Humanities NLP Specialist at Tufts University, working on Digital Philology and Augmented Intelligence.

73 Followers  |  392 Following  |  4 Posts  |  Joined: 04.10.2023  |  1.5093

Latest posts by pnadelofficial.bsky.social on Bluesky

Peter Nadel presents his streamlit application at the front of the room. The interface looks like a simple search engine. He is searching the Library of Congress Rosa Parks crowdsourced transcription dataset for the term “segregation.” There are 25 pages with 10 results per page. A result shows metadata from the spreadsheet including the transcription campaign, document title, link to the facsimile, and transcription text with search term in bold.

Peter Nadel presents his streamlit application at the front of the room. The interface looks like a simple search engine. He is searching the Library of Congress Rosa Parks crowdsourced transcription dataset for the term “segregation.” There are 25 pages with 10 results per page. A result shows metadata from the spreadsheet including the transcription campaign, document title, link to the facsimile, and transcription text with search term in bold.

For our third #lovedata25 workshop @pnadelofficial.bsky.social taught about building your own search engines with NLP. He demonstrates his streamlit app where you can load your own data and customize your search.

It just so happens that the LC’s crowdsourced transcriptions are perfectly formatted.

13.02.2025 18:17 — 👍 5    🔁 2    💬 1    📌 0
Love Data Week 2025 | Tisch Libraryart-gallery-11.svgDirection2mapsArtboard 10Education-library-school-study-universityArtboard 98 What is Love Data Week? "Love Data Week is an international celebration of data, taking place every year during the week of Valentine's day. Universities, nonprofit organizations, government agencies,...

Our Love Data line-up:
@kaylendwyer.bsky.social - Handwriting Text Recognition w/ @transkribus.bsky.social
@akijas.bsky.social - TEI LeafWriter @pnadelofficial.bsky.social - building a search engine
Finally, @douglassday.bsky.social transcribe-a-thon.

We'll be using Douglass Day data all week!

28.01.2025 20:25 — 👍 3    🔁 2    💬 1    📌 0

I have been saying for years that 98% of the computation in LLMs is wasted, it’s just that figuring out what matters was harder than writing checks

27.01.2025 21:23 — 👍 38    🔁 2    💬 2    📌 0
screenshot of the top of the tutorial that says "Fine-tune ColPali for Multimodal RAG"

screenshot of the top of the tutorial that says "Fine-tune ColPali for Multimodal RAG"

ColPali is landed at @hf.co transformers and I have just shipped a very lean fine-tuning tutorial in smol-vision 🤠💗

QLoRA fine-tuning with 4-bit with bsz of 4 can be done with 32 GB VRAM and is very fast! ✨
github.com/merveenoyan/...

20.12.2024 15:53 — 👍 46    🔁 7    💬 0    📌 0
Post image

A lot has happened in DH at Tufts and the DH MA has become very exciting.

17.12.2024 19:51 — 👍 22    🔁 8    💬 1    📌 0

disgraceful

30.12.2023 16:33 — 👍 1    🔁 0    💬 0    📌 0

Nico youtu.be/LUtmAWP1ndQ?...

17.11.2023 01:39 — 👍 1    🔁 0    💬 0    📌 0

Yea

25.10.2023 20:08 — 👍 2    🔁 0    💬 1    📌 0
Post image

Got some open source LLMs running on the Tufts cluster, so you know I had to make a Latin finetune of Llama2-7b. Really impressive results for only 6-7 hours of training.

07.10.2023 21:51 — 👍 2    🔁 0    💬 0    📌 0

@pnadelofficial is following 20 prominent accounts