Thom Lake @thomlake - Bluesky Profile

Latest posts by thomlake.bsky.social on Bluesky

I'm at #Neurips2024 this week!

My work (arxiv.org/abs/2406.17692) w/ @gregdnlp.bsky.social & @eunsol.bsky.social exploring the connection between LLM alignment and response pluralism will be at pluralistic-alignment.github.io Saturday. Drop by to learn more!

11.12.2024 17:39 — 👍 28 🔁 6 💬 0 📌 0

Due to the split between the inputs statements and query, the resulting model isn't a generic sequence processor like RNNs or transformers. However, if you were to process a sequence by treating each element as a new query, you'd get something that looks a lot like a transformer.

02.12.2024 16:37 — 👍 6 🔁 0 💬 0 📌 0

MemNets first encode each input sentence/statement with a position embedding independently. These are the "memories". Finally, you encode the query and apply cross-attention between that and the memories. Rinse and repeat for some fixed depth. No for-loop over time here.

02.12.2024 16:35 — 👍 6 🔁 0 💬 1 📌 0

The recurrence there is referencing depth-wise weight tying (see Section 2.2).

> Layer-wise (RNN-like): the input and output embeddings are the same across different layers

02.12.2024 15:49 — 👍 0 🔁 0 💬 1 📌 0

End-To-End Memory Networks We introduce a neural network with a recurrent attention model over a possibly large external memory. The architecture is a form of Memory Network (Weston et al., 2015) but unlike the model in that wo...

Memory networks were earlier, attention only, and had position embeddings, but were not word/token level: arxiv.org/abs/1503.08895

They were later elaborated with the key-value distinction which is, AFAIK, where this terminology arises: arxiv.org/abs/1606.03126

02.12.2024 06:32 — 👍 1 🔁 0 💬 1 📌 0

A scatter plot comparing language models by performance (y-axis, measured in average performance on 10 benchmarks) versus training computational cost (x-axis, in approximate FLOPs). The plot shows OLMo 2 models (marked with stars) achieving Pareto-optimal efficiency among open models, with OLMo-2-13B and OLMo-2-7B sitting at the performance frontier relative to other open models like DCLM, Llama 3.1, StableLM 2, and Qwen 2.5. The x-axis ranges from 4x10^22 to 2x10^24 FLOPs, while the y-axis ranges from 35 to 70 benchmark points.

Excited to share OLMo 2!

🐟 7B and 13B weights, trained up to 4-5T tokens, fully open data, code, etc
🐠 better architecture and recipe for training stability
🐡 staged training, with new data mix Dolmino🍕 added during annealing
🦈 state-of-the-art OLMo 2 Instruct models

#nlp #mlsky

links below👇

26.11.2024 20:59 — 👍 68 🔁 12 💬 1 📌 1

👋

25.11.2024 14:44 — 👍 0 🔁 0 💬 0 📌 0

NLP at UT Austin Join the conversation

A starter pack for the NLP and Computational Linguistics researchers at UT Austin!
go.bsky.app/75g9JLT

22.11.2024 17:18 — 👍 22 🔁 7 💬 0 📌 0

@thomlake is following 20 prominent accounts

Sung Kim
@sungkim

A business analyst at heart who enjoys delving into AI, ML, data engineering, data science, data analytics, and modeling. My views are my own. You can also find me at threads: @sung.kim.mw

Costa Huang
@vwxyzjn

RL + LLM @ai2.bsky.social; main dev of https://cleanrl.dev/

Alex Dimakis
@alexdimakis

UC Berkeley Professor working on AI. Co-Director: National AI Institute on the Foundations of Machine Learning (IFML). http://BespokeLabs.ai cofounder

NYU Center for Data Science
@nyudatascience

Official account of the NYU Center for Data Science, the home of the Undergraduate, Master’s, and Ph.D. programs in data science. cds.nyu.edu

Eugene Vinitsky 🍒
@eugenevinitsky

Anti-cynic. Towards a weirder future. Reinforcement Learning, Autonomous Vehicles, transportation systems, the works. Asst. Prof at NYU https://emerge-lab.github.io https://www.admonymous.co/eugenevinitsky

Harshit Sikchi
@harshitsikchi

Research @OpenAI. I study Reinforcement Learning. PhD from UT Austin. Previously FAIR Paris, Meta US, NVIDIA, CMU, and IIT Kharagpur. Website: https://hari-sikchi.github.io/

Stone Tao
@stonet2000

PhDing @UCSanDiego @NVIDIA @hillbot_ai on scalable robot learning and embodied AI. Co-founded @LuxAIChallenge to build AI competitions. @NSF GRFP fellow http://stoneztao.com

Steph Milani
@stephmilani

PhD Student in Machine Learning at CMU. 🐦 twitter.com/steph_milani 🌐 stephmilani.github.io

Yuda Song
@yus167

PhD at Machine Learning Department, Carnegie Mellon University | Interactive Decision Making | https://yudasong.github.io

Willem Röpke
@willemropke

PhD student | Interested in all things decision-making and learning

Ulyana Piterbarg
@upiter

PhD at NYU studying reasoning, decision-making, and open-endedness alum of MIT | prev: Google, MSR, MIT CoCoSci https://upiterbarg.github.io/

Aditya Makkar
@adityamakkar

In search of mathematics and ML content. PhD@NYU

Ahana (she/her)
@ahana

Reinforcement Learning PhD student, UPF Barcelona. Uncertain in the face of optimism. ahanadeb.com

Orr Krupnik
@orrkrup

AI Researcher @ Hirundo | Prev: Robot Learning & RL PhD @Technion More data isn't all we need 🔭🦾 🌍

Aditya Mohan
@amsks96

PhD Student working on Generlization and State abstractions in #RL, #MetaLearning, and #AutoRL amsks.github.io

Theresa Eimer
@theeimer

RL researcher looking for DACs // What is this AutoRL anyway? she/her Currently: Leibniz Uni Hannover Previously: Uni Freiburg (Master's) | Meta AI London (Intern) Always & Forever: AutoRL.org

Allen Nie
@allenanie

Stanford CS PhD working on RL and LLMs with Emma Brunskill and Chris Piech. Co-creator of Trace. Prev @GoogleDeepMind @MicrosoftResearch Specifically - Offline RL - In-context RL - Causality https://anie.me/about Unverified hot takes go to this account

Claas Voelcker
@cvoelcker

For professional, see https://cvoelcker.de If I seem very angry, check if I have been watered in the last 24 hours. Now 🇺🇸 flavoured, previously available in 🇨🇦 and 🇩🇪

Marcel Hussing
@marcelhussing

PhD student at the University of Pennsylvania. Prev, intern at MSR, and Meta FAIR. Interested in reliable and replicable reinforcement learning, robotics and knowledge discovery: https://marcelhussing.github.io/ All posts are my own.

Artem Zholus
@artemzholus

Visiting Researcher at Meta; PhD student @mila.quebec. Ex: Intern @GoogleDeepMind, Intern @ EPFL, MSc@MIPT; artemzholus.github.io