What are the power dynamics involved with the amount of abuse I have to heap on small LMs to get them to stay on prompt?
Large models you can just ask nicely. Qwen and below you have to threaten with bodily harm.
04.12.2024 23:08 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
The story so far:
- Sane ML in exile from X
- welcome to bsky we are happy to have you!
- letโs help build better bsky tools using our knowledge and skills!
HOW DARE YOU! WE WILL BURN YOU TO THE GROUND!
28.11.2024 06:42 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
FL was a field that caught its own tail, in terms of hype cycle. Now it is just something entirely detached from reality.
Thankfully, while incredibly hyped, LLM/transformer field has actually delivered โ or delivered enough to outpace itself.
25.11.2024 22:14 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
But federated learning is a synchronous training owing to its need for data privacy โ induces huge sync overhead for SMPC. Not sure FL is the right paradigm lens to look through.
Why not go back to EA-SGD, Hogwild, etc and build back up from there? Async breaks bottleneck and grads can be quant
25.11.2024 12:47 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
The AI community building the future!
AI @ OpenAI, Tesla, Stanford
a mediocre combination of a mediocre AI scientist, a mediocre physicist, a mediocre chemist, a mediocre manager and a mediocre professor.
see more at https://kyunghyuncho.me/
Professor, UW Biology / Santa Fe Institute
I study how information flows in biology, science, and society.
Book: *Calling Bullshit*, http://tinyurl.com/fdcuvd7b
LLM course: https://thebullshitmachines.com
Corvids: https://tinyurl.com/mr2n5ymk
he/him
Research & code: Research director @inria
โบData, Health, & Computer science
โบPython coder, (co)founder of scikit-learn, joblib, & @probabl.bsky.social
โบSometimes does art photography
โบPhysics PhD
๐ค
new arXiv preprints mentioning "differential privacy" or "differentially private" in the title/abstract/metadata
- unrelated quantum/FL papers
+ updates from https://differentialprivacy.org
[Under construction.]
building the future
research at midjourney, deepmind. slinging ai hot takes ๐ฅat artfintel.com
Professor at Wharton, studying AI and its implications for education, entrepreneurship, and work. Author of Co-Intelligence.
Book: https://a.co/d/bC2kSj1
Substack: https://www.oneusefulthing.org/
Web: https://mgmt.wharton.upenn.edu/profile/emollick
Independent AI researcher, creator of datasette.io and llm.datasette.io, building open source tools for data journalism, writing about a lot of stuff at https://simonwillison.net/
AI safety at Anthropic, on leave from a faculty job at NYU.
Views not employers'.
I think you should join Giving What We Can.
cims.nyu.edu/~sbowman
Reverse engineering neural networks at Anthropic. Previously Distill, OpenAI, Google Brain.Personal account.
What would we need to understand in order to design an amazing future? Ex DeepMind, OpenAI
Human being. Trying to do good. CEO @ Encultured AI. AI Researcher @ UC Berkeley. Listed bday is approximate ;)