Peter Nadel presents his streamlit application at the front of the room. The interface looks like a simple search engine. He is searching the Library of Congress Rosa Parks crowdsourced transcription dataset for the term “segregation.” There are 25 pages with 10 results per page. A result shows metadata from the spreadsheet including the transcription campaign, document title, link to the facsimile, and transcription text with search term in bold.
For our third #lovedata25 workshop @pnadelofficial.bsky.social taught about building your own search engines with NLP. He demonstrates his streamlit app where you can load your own data and customize your search.
It just so happens that the LC’s crowdsourced transcriptions are perfectly formatted.
13.02.2025 18:17 — 👍 5 🔁 2 💬 1 📌 0
I have been saying for years that 98% of the computation in LLMs is wasted, it’s just that figuring out what matters was harder than writing checks
27.01.2025 21:23 — 👍 38 🔁 2 💬 2 📌 0
screenshot of the top of the tutorial that says "Fine-tune ColPali for Multimodal RAG"
ColPali is landed at @hf.co transformers and I have just shipped a very lean fine-tuning tutorial in smol-vision 🤠💗
QLoRA fine-tuning with 4-bit with bsz of 4 can be done with 32 GB VRAM and is very fast! ✨
github.com/merveenoyan/...
20.12.2024 15:53 — 👍 46 🔁 7 💬 0 📌 0
A lot has happened in DH at Tufts and the DH MA has become very exciting.
17.12.2024 19:51 — 👍 22 🔁 8 💬 1 📌 0
disgraceful
30.12.2023 16:33 — 👍 1 🔁 0 💬 0 📌 0
Nico youtu.be/LUtmAWP1ndQ?...
17.11.2023 01:39 — 👍 1 🔁 0 💬 0 📌 0
Yea
25.10.2023 20:08 — 👍 2 🔁 0 💬 1 📌 0
Got some open source LLMs running on the Tufts cluster, so you know I had to make a Latin finetune of Llama2-7b. Really impressive results for only 6-7 hours of training.
07.10.2023 21:51 — 👍 2 🔁 0 💬 0 📌 0
Postdoc at Uppsala University Computational Linguistics with Joakim Nivre
PhD from LMU Munich, prev. UT Austin, Princeton, @ltiatcmu.bsky.social, Cambridge
computational linguistics, construction grammar, morphosyntax
leonieweissweiler.github.io
https://najoung.kim
langauge
Director, MIT Computational Psycholinguistics Lab. President, Cognitive Science Society. Chair of the MIT Faculty. Open access & open science advocate. He.
Lab webpage: http://cpl.mit.edu/
Personal webpage: https://www.mit.edu/~rplevy
Asst Prof at Johns Hopkins Cognitive Science • Director of the Group for Language and Intelligence (GLINT) ✨• Interested in all things language, cognition, and AI
jennhu.github.io
I make colorless green GPUs sleep brrriously. Computational phonology, morphology, language change models, speech/language technologies (especially for people with disabilities).
Assistant professor at Yale Linguistics. Studying computational linguistics, cognitive science, and AI. He/him.
Stanford Professor of Linguistics and, by courtesy, of Computer Science, and member of @stanfordnlp.bsky.social and The Stanford AI Lab. He/Him/His. https://web.stanford.edu/~cgpotts/
Research Scientist at Ai2, PhD in NLP 🤖 UofA. Ex
GoogleDeepMind, MSFTResearch, MilaQuebec
https://nouhadziri.github.io/
Postdoc @rug.nl with Arianna Bisazza.
Interested in NLP, interpretability, syntax, language acquisition and typology.
Faculty fellow at NYU CDS. Previously: PhD @ BIU NLP.
NLP, Linguistics, Cognitive Science, AI, ML, etc.
Job currently: Research Scientist (NYC)
Job formerly: NYU Linguistics, MSU Linguistics
(jolly good) Fellow at the Kempner Institute @kempnerinstitute.bsky.social, incoming assistant professor at UBC Linguistics (and by courtesy CS, Sept 2025). PhD @stanfordnlp.bsky.social with the lovely @jurafsky.bsky.social
isabelpapad.com
Assistant Professor of Computational Linguistics @ Georgetown; formerly postdoc @ ETH Zurich; PhD @ Harvard Linguistics, affiliated with MIT Brain & Cog Sci. Language, Computers, Cognition.
Interpretable Deep Networks. http://baulab.info/ @davidbau
Language and thought in brains and in machines. Assistant Prof @ Georgia Tech Psychology. Previously a postdoc @ MIT Quest for Intelligence, PhD @ MIT Brain and Cognitive Sciences. She/her
https://www.language-intelligence-thought.net
I study language using tools from cognitive science and neuroscience. I also like snuggles.
Associate professor at CMU, studying natural language processing and machine learning. Co-founder All Hands AI
🥇 LLMs together (co-created model merging, BabyLM, textArena.ai)
🥈 Spreading science over hype in #ML & #NLP
Proud shareLM💬 Donor
@IBMResearch & @MIT_CSAIL
PhD student at UC San Diego.
He/him/his.
https://tylerachang.github.io/