For anybody in the mid-atlantic region, the annual conference MASC is looking for a host next year. It's a great chance for your university to meet other researchers (and potential collaborators) in our region!
16.12.2024 21:53 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Could you give an example of the input/output you're looking for on which function call (encode, tokenize, etc)? And maybe which tokenizer it's inheriting from ๐
(looks like maybe the OPT models inherit from a GPT2Tokenizer?)
26.11.2024 19:32 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
an compilation of adorable dog photos referencing a Simpson's meme ("Do it for her")
Happy to talk about any of these topics and more!
I will also likely end up talking a lot about my pride and joy (my dog).
20.11.2024 00:03 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
GitHub - rewicks/ctxpro: Data and annotation toolkit for finding translation ambiguities in bitext
Data and annotation toolkit for finding translation ambiguities in bitext - rewicks/ctxpro
And if you think sentence-level machine translation is good-enough, I encourage you to run your systems on our evaluation data (ctxpro, an extension to ContraPro and other similar evaluation datasets)
github.com/rewicks/ctxpro
20.11.2024 00:03 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
jhu-clsp/paradocs ยท Datasets at Hugging Face
Weโre on a journey to advance and democratize artificial intelligence through open source and open science.
Most recently I've released the ParaDocs dataset which reconstructs document annotations on large, parallel machine translation datasets. Contextual information is integral to machine translation, but often overlooked!
Data: huggingface.co/datasets/jhu...
20.11.2024 00:03 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Since we're all new here, an introduction:
I'm a final-year PhD student at Johns Hopkins University (in @jhuclsp.bsky.social working with Philipp Koehn and Matt Post.
I'm largely interested in the creation and processing of high-quality, multilingual datasets for both training and evaluation.
20.11.2024 00:03 โ ๐ 19 ๐ 2 ๐ฌ 2 ๐ 0
CLSP
Join the conversation
Putting together a JHU Center for Language and Speech Processing starter pack!
Please reply or DM me if you're doing research at CLSP and would like to be added - I'm still trying to find out which of us are on here so far.
go.bsky.app/JtWKca2
19.11.2024 15:37 โ ๐ 22 ๐ 9 ๐ฌ 1 ๐ 1
Cool work by @jhuclsp colleagues Rafael Rivera Soto and Nick Andrews on how AI-generated text carries unique stylistic fingerprints, enabling the detection and identification of specific language models.
Based on ICLR paper: arxiv.org/pdf/2401.06712
hub.jhu.edu/2024/11/18/a...
19.11.2024 18:17 โ ๐ 15 ๐ 4 ๐ฌ 0 ๐ 0
PhD student at Johns Hopkins CLSP (@jhuclsp.bsky.social).
Researching natural and formal language processing.
williamjurayj.com
PhD student at JHU
https://aleemkhan62.github.io
PhD student at JHU. @Databricks MosaicML, Microsoft Semantic Machines/Translate, Georgia Tech. I like datasets!
https://marcmarone.com/
PhD student at Aalborg University, mostly working on NLP and linguistic typology
Second year CS PhD student @notredame.bsky.social | Intern: Amazon | Prev: @jhuclsp.bsky.social
https://yining610.github.io/
I make colorless green GPUs sleep brrriously. Computational phonology, morphology, language change models, speech/language technologies (especially for people with disabilities).
professor for natural language processing, head of
BamNLP @bamnlp.de
๐ Duisburg, Stuttgart, Bamberg
#NLProc #emotion #sentiment #factchecking #argumentmining #informationextraction #bionlp
Research Saves Lives
Johns Hopkins Whiting School of Engineering
Baltimore, Maryland
https://engineering.jhu.edu
Assistant professor translation technology & AI at Universitรฉ de Montreal (@umontreal.bsky.social)
Associated academic member of Mila - Quebec Artificial Intelligence Institute (@mila-quebec.bsky.social)
CS PhD Student in Italy ๐ฎ๐น
Working towards making Deep Learning at-the-edge feasible.
๐ฑ GitHub: https://github.com/matteorisso
๐ Scholar: https://scholar.google.com/citations?user=ltE9im8AAAAJ&hl=en
๐ธ๏ธ๐๏ธ matteorisso.github.io
PhD student @jhuclsp.bsky.social
AI Researcher | AI for Sign Language
Investigator@NLM; machine learning for health. Views my own
https://www.nlm.nih.gov/research/researchstaff/WeissJeremy.html
Intern @Google, Ph.D. Student @Cornell_CS.
Interested in machine learning, LLM, brain, and healthcare.
abehrouz.github.io
Computational linguist at Roku Voice
PhD in Medical Genetics / Genomics / Paediatric Cancer Research / Drug Safety / 2018 Norwood Fair 2nd place chocolate chip cookies
Research Fellow, AMSE | Senior Member, IEEE | Associate Editor, IEEE Transactions on Technology and Society | Cofounder ACCELERATION & ADAPTATION | Ph.D. in Economics, UW-Madison | B.S. in Electrical Engineering, UnB
๐ http://pedrohalbuquerque.net
๐ชธNLP researcher, AI scientist at Deccan AI. Core interests: Computational Social Sciences, Conversational AI, Ai safety and Multilinguality
machine learning researcher @Apple | PhD from @CoML_ENS | speech, ml and cognition.