Our new work on scaling laws that includes compute, model size, and number of samples. The analysis involves an extremely fine-grained analysis of online sgd built up over the last 8 years of understanding sgd on simple toy models (tensors, single index models, multi index model)
05.05.2025 17:08 β π 5 π 1 π¬ 0 π 0
Welcome to the Bluesky account for Stand Up for Science 2025!
Keep an eye on this space for updates, event information, and ways to get involved. We can't wait to see everyone #standupforscience2025 on March 7th, both in DC and locations nationwide!
#scienceforall #sciencenotsilence
12.02.2025 17:04 β π 11586 π 5518 π¬ 292 π 679
Duck in Vancouver! Mott32
11.12.2024 03:23 β π 13 π 1 π¬ 1 π 0
βOn a log-log plot, my grandmother fits on a straight line.β
-Physicist Fritz Houtermans
There's a lot of truth to this. log-log plots are often abused and can be very misleading
1/5
03.12.2024 04:41 β π 43 π 13 π¬ 1 π 1
Just put together a starter pack for Deep Learning Theory. Let me know if you'd like to be included or suggest someone to add to the list!
go.bsky.app/2qnppia
22.11.2024 21:35 β π 88 π 32 π¬ 29 π 5
Lool
27.11.2024 20:58 β π 0 π 0 π¬ 0 π 0
Zihan Zhang (tinyurl.com/4nks7f9b) is a postdoc with Yuxin Chen, Simon Du, and me.
27.11.2024 20:54 β π 2 π 1 π¬ 1 π 0
What's known about the 1.27 lower bound? It's a guess or there is a reason ppl believe it's fundamental?
27.11.2024 17:50 β π 1 π 0 π¬ 1 π 0
Send your colt open problems to Zihan, with high probability he will solve it!
27.11.2024 14:33 β π 20 π 0 π¬ 0 π 0
What's the point of @perplexity_ai given chatgpt also does search?
25.11.2024 01:06 β π 2 π 0 π¬ 3 π 0
Yo add me to your starter packs!
24.11.2024 16:23 β π 19 π 2 π¬ 1 π 0
Spread of innovation in a small world network.
Assume that the nodes of a social network can choose between two alternative technologies: B and X.
A node using B receives a benefit with respect to X, but there is a benefit to using the same tech as the majority of your neighbors.
Assume everyone uses X at time t=0. Will they switch to B?
23.11.2024 22:48 β π 65 π 8 π¬ 3 π 0
Takes too much clicking...
23.11.2024 19:18 β π 0 π 0 π¬ 1 π 0
How do I bulk follow people?
23.11.2024 19:10 β π 6 π 0 π¬ 5 π 0
phd student @ princeton Β· deep learning theory
eshaannichani.com
Historian: White Flight; New Suburban History; Fog of War; One Nation Under God; Fault Lines; Voter Suppression; Myth America. CAMPAIGN TRAILS: campaign-trails.ghost.io
NLP/Machine Translation/NLG/Deep Learning
Researcher at NICT, Japan
Adjunct Faculty at IIT Madras
Visiting Professor at IIT Bombay
Ex Kyoto University
prajdabre.github.io
Associate Professor in Computer Science at the University of Maryland. Human-Centered Natural Language Processing & Machine Translation
Researcher at Charles University | multilingual natural language processing, machine translation
PhD student in linguistics at the University of Kansas. Morphosyntax, variation, change, revitalization, and a whole lot of food. https://theycallmezeal.me he/him
(Sworn) Translator (ella/she). Translation teacher at Universitat Rovira i Virgili. PhD student; diss. on translation teacher's competence in Spain.
sarahorcas@gmail.com
Researcher at Cohere | Multilingual LLM evaluation
PhD, CDT in NLP, University of Edinburgh. Prev: IIT Madras | University of Mumbai. She/her.
SNSF Professor at University of Zurich. #NLP / #ML.
http://www.cl.uzh.ch/sennrich
Prof. @ Karlsruhe Institute for Technology, NLP
CTO of the MITRA project @BAIR, UC Berkeley.
Research in ancient Asian low resource languages, especially text reuse, machine translation, semantic similarity search.
Buddhist studies MA, now PhD in computational linguistics @Duesseldorf university.
Dublin. 29. π³οΈβπ. He/Him. Brazilian. Partnered. Instructional Designer. PhD researcher in AI - QA for machine translation. Keratoconus is my nemesis.
This is a personal account.
Insta is @johnrihawf.
Associate Professor of Translation and Human-Centred AI @LeidenHumanities (NL). Loves metaphor, stylistics and (machine) translation. PI of NWO-Vidi project "Metaphors in Machine Translation: Reactions, Responses, Repercussions" (2025-2030).
Postdoc at @hitz-zentroa.bsky.social / internship @IKER zentroa (UMR5478)
Participatory research
Human-Centered NLP
Machine Translation
(eu) kontu pertsonala:
https://mastodon.eus/@XabierSoto
PhD student at the University of Trento and @fbk-mt.bsky.social, working on gender-inclusive machine translation (he/him)
Applied Scientist Intern at Amazon
apierg.github.io
#NLP #NLProc #MT
Assistant Professor of Mathematics at
Oxford College of Emory University. Father & Husband. Wannabe Cook. I like to teach/do/spread math as I can. (He/His)
Assistant professor (of mathematics) at the University of Toronto. Algebraic geometry, number theory, forever distracted and confused, etc. He/him.
The liberated (and disabled) mathematician. Solidarity and accessible vibes only. Black + AuDHD + Not Open to Being Mistreated. she/they β women. π³οΈβπ
Artist & math nerd. She/her. Actually autistic π²
Aspiring polyglot:
πΉππ¨π³ππ°π±π¦π°ππ©πͺ
I used to be a tattooer until I took an arrow to the knee
https://linktr.ee/Acid_Lich?utm_source=linkt