Eric W. Tramel's Avatar

Eric W. Tramel

@fujikanaeda.bsky.social

Research Scientist, Engineer, & Builder in ML & AI (learning flavors: Generative, Privacy preserving, federated, unsupervised, Bayesian). Ex: Unlearn.ai, Amazon Alexa, Owkin, INRIA, ENS.

30 Followers  |  236 Following  |  4 Posts  |  Joined: 25.11.2024  |  1.5295

Latest posts by fujikanaeda.bsky.social on Bluesky

What are the power dynamics involved with the amount of abuse I have to heap on small LMs to get them to stay on prompt?

Large models you can just ask nicely. Qwen and below you have to threaten with bodily harm.

04.12.2024 23:08 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

The story so far:

- Sane ML in exile from X
- welcome to bsky we are happy to have you!
- letโ€™s help build better bsky tools using our knowledge and skills!

HOW DARE YOU! WE WILL BURN YOU TO THE GROUND!

28.11.2024 06:42 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

FL was a field that caught its own tail, in terms of hype cycle. Now it is just something entirely detached from reality.

Thankfully, while incredibly hyped, LLM/transformer field has actually delivered โ€” or delivered enough to outpace itself.

25.11.2024 22:14 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

But federated learning is a synchronous training owing to its need for data privacy โ€” induces huge sync overhead for SMPC. Not sure FL is the right paradigm lens to look through.

Why not go back to EA-SGD, Hogwild, etc and build back up from there? Async breaks bottleneck and grads can be quant

25.11.2024 12:47 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@fujikanaeda is following 19 prominent accounts