New Paper Day! For ACL 2025 Findings:
You should **drop dropout** when you are training your LMs AND MLMs!
02.06.2025 01:22 โ ๐ 11 ๐ 4 ๐ฌ 1 ๐ 0
Paper: arxiv.org/abs/2405.16039
Code: github.com/robertcsorda...
12.12.2024 22:47 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
Come visit our poster "MoEUT: Mixture-of-Experts Universal Transformers" on Friday at 4:30 in East Exhibit Hall A-C #1907 on #NeurIPS2024. With Kazuki Irie, Jรผrgen Schmidhuber, Christopher Potts and @chrmanning.bsky.social.
12.12.2024 22:46 โ ๐ 14 ๐ 5 ๐ฌ 1 ๐ 0
Join Slido: Enter #code to vote and ask questions
Participate in a live poll, quiz or Q&A. No login required.
๐Excited for #neurips2024 and our "System 2 Reasoning at Scale" workshop. We have an excited lineup of speakers who will answer your most burning questions about AI and reasoning ๐
๐ฅGot spicy questions? Submit & vote here:
app.sli.do/event/dJNU63...
03.12.2024 17:43 โ ๐ 4 ๐ 3 ๐ฌ 1 ๐ 1
News and events from the Faculty of Informatics of the Universitร della Svizzera italiana (USI) - Lugano, Switzerland
Computer history. Reverse-engineering old chips. Restored Apollo Guidance Computer, Alto. Ex-Google, Sun, Msft. So-called boffin.
Vintage electronics from youtube/@curiousmarc and www.curiousmarc.com
Working on creativity, curiosity and interestingness. PhD @ IDSIA with Jรผrgen Schmidhuber in Lugano, Switzerland. Classical pianist.
https://vincentherrmann.github.io
Researcher (OpenAI. Ex: DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian.
Anon feedback: https://admonymous.co/giffmana
๐ Zรผrich, Suisse ๐ http://lucasb.eyer.be
Dimensionality Diabolist, Seeker of Optima
Senior Staff Research Scientist, Google DeepMind
Affiliated Lecturer, University of Cambridge
Associate, Clare Hall
GDL Scholar, ELLIS @ellis.eu
๐ท๐ธ๐ฒ๐ช๐ง๐ฆ
proud mediterrenean ๐งฟ open-sourceress at hugging face ๐ค multimodality, zero-shot vision, vision language models, transformers
PhD in ML/AI | Researching Efficient ML/AI (vision & language) ๐ & Interpretability | @SapienzaRoma @EdinburghNLP | https://alessiodevoto.github.io/ | ex @NVIDIA
Secular Bayesian.
Professor of Machine Learning at Cambridge Computer Lab
Talent aficionado at http://airetreat.org
Alum of Twitter, Magic Pony and Balderton Capital
Associate Professor of Machine Learning, University of Oxford;
OATML Group Leader;
Director of Research at the UK government's AI Safety Institute (formerly UK Taskforce on Frontier AI)
ML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://amzn.to/4fqvn0D) & reasoning (https://mng.bz/Nwr7).
Also blogging about AI research at magazine.sebastianraschka.com.
Researcher in ML/NLP at the University of Edinburgh (faculty at Informatics and EdinburghNLP), Co-Founder/CTO at www.miniml.ai, ELLIS (@ELLIS.eu) Scholar, Generative AI Lab (GAIL, https://gail.ed.ac.uk/) Fellow -- www.neuralnoise.com, he/they
Assistant professor at Yale Linguistics. Studying computational linguistics, cognitive science, and AI. He/him.
Researching planning, reasoning, and RL in LLMs @ Reflection AI. Previously: Google DeepMind, UC Berkeley, MIT. I post about: AI ๐ค, flowers ๐ท, parenting ๐ถ, public transit ๐. She/her.
http://www.jesshamrick.com
This is the account for the NLP community at Imperial College London! Looking forward to sharing our NLP research with you ๐
Natural Language Processing research community at the University of Colorado Boulder.
www.colorado.edu/research/bouldernlp