This is some great work!! I personally feel that one of the bottlenecks with memorization evals is having access to the gigantic training data. Super cool to see we can still run reliable evals without having access to the training data!
23.03.2025 23:24 β π 6 π 2 π¬ 0 π 0
On the subject of people traveling to the US right about now: Fuck, if I were from another country and I didn't need to come, I sure as hell wouldn't, the current administration has made it clear traveling here isn't safe, and why not take them at their word, there's a whole world to visit
16.03.2025 17:53 β π 38285 π 5209 π¬ 1308 π 303
Image of Bill Nye, with text at the top of the image that says Speaker Announcement and at the bottom that says Stand Up for Science, March 7th, 2025 and a link to standupforscience2025.org.
STAND UP FOR SCIENCE SPEAKER ANNOUNCEMENT! βοΈ
Bill Nye will be speaking at Stand Up for Science in D.C. on March 7th!
More speaker announcements coming soon, stay tuned! πβοΈ
06.03.2025 00:04 β π 2041 π 404 π¬ 28 π 36
PhD AI agents for more than most PhD humans cost.
Look forward to the sales figure on this, cc @edzitron.com
05.03.2025 16:34 β π 73 π 14 π¬ 12 π 8
The Simons Institute is now on BlueSky! πΎ
Follow them: @simonsinstitute.bsky.social #TCSSky
28.02.2025 23:14 β π 65 π 16 π¬ 2 π 0
Francis Collins, the NIH Director for 12 years, led the Human Genome Project and other NIH efforts for 32 years, resigned today. Key words from his resignation letter
www.nytimes.com/2025/03/01/u...
01.03.2025 18:07 β π 3153 π 1394 π¬ 62 π 95
7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Efficient | Notion
A replication of DeepSeek-R1 training on small models with limited data
an open source 7B replication of R1-zero and R1
notable: they claim they developed in parallel and that most of their experiments were performed *prior to* the release of R1 and they came to the same conclusions
hkust-nlp.notion.site/simplerl-rea...
25.01.2025 16:33 β π 57 π 16 π¬ 0 π 4
YouTube video by Anthropic
Alignment faking in large language models
Amazing discussion from anthropic, "Alignment faking in LLMs"
www.youtube.com/watch?v=9eXV...
19.12.2024 02:48 β π 1 π 0 π¬ 0 π 0
At the NeurIPS optimization workshop. In my opinion, the βmost creative poster designβ award should go to these folks:
15.12.2024 18:49 β π 71 π 5 π¬ 0 π 0
YouTube video by Lex Clips
Yann LeCun is a controversial visionary | Aravind Srinivas and Lex Fridman
Aravind Srinivas (Perplexity) tells Lex all the ways in which I was right against the prevalent ideas of the time: DL, ConvNets, energy-based models, SSL, the limitation of RL, and now the limitations of auto-regressive generative models including LLMs.
Thanks Aravind!
youtu.be/mnGUfkMt9fE?...
15.12.2024 22:28 β π 205 π 16 π¬ 11 π 1
YouTube video by The Royal Institution
The End of the Universe - with Geraint Lewis
It's Wednesday here in Australia, and so it's Hump Day. To cheer you up, here's a lecture I gave the RI London on the Future History of the Universe. #cosmology #physics #scicomm
youtu.be/IF4UhElRUFg?...
19.11.2024 19:40 β π 10 π 5 π¬ 2 π 0
Supervisor of the year is an understatement! Was so fortunate you supervised my PhD, thank you!!!
28.11.2024 03:29 β π 1 π 0 π¬ 0 π 0
Just created the Starter Pack for Optimization Researchers to help you on your journey into optimization! π
Did I miss anyone? Tag them or let me know what to add!
go.bsky.app/VjpyyRw
23.11.2024 23:59 β π 38 π 8 π¬ 14 π 0
When you're the kid on the block with the latest greatest RL code lol, thanks to @vwxyzjn
28.11.2024 00:32 β π 27 π 1 π¬ 1 π 0
The closing date for this position is 20 December 2024 and a direct link is here: usyd.wd3.myworkdayjobs.com/en-US/USYD_E...
27.11.2024 03:08 β π 7 π 5 π¬ 0 π 0
thank you for sharing!
27.11.2024 04:58 β π 0 π 0 π¬ 0 π 0
I have not seen a starter pack for the study of brain rhythms. So, here's a start.
go.bsky.app/A6zgHeE
26.11.2024 17:52 β π 126 π 38 π¬ 28 π 2
Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress?
Several recent works seek to develop foundation models specifically for medical applications, adapting general-purpose large language models (LLMs) and vision-language models (VLMs) via continued pret...
Medically adapted foundation models (think Med-*) turn out to be more hot air than hot stuff. Correcting for fatal flaws in evaluation, the current crop are no better on balance than generic foundation models, even on the very tasks for which benefits are claimed.
arxiv.org/abs/2411.04118
26.11.2024 18:12 β π 259 π 57 π¬ 8 π 13
I hope this platfrom brings back the old twitter flavour, the social interactions and interesting science conversations. Feels like clean air in my limited time here.
27.11.2024 04:24 β π 1 π 0 π¬ 0 π 0
yeah, never really managed to understand (and as you said, didn't care!) how mastodon worked.
27.11.2024 04:09 β π 1 π 0 π¬ 0 π 0
n/n
We've got two implementations:
PTAViT3D: Runs S2 or S1 independently.
PTAViT3D-CA: Uses cross-attention to fuse S2 & S1 data.
With modification (causality) you can use them for forecasting too, but more about that in upcoming paper ;).
ππ€ #AI #DeepLearning #Geospatial"
27.11.2024 04:07 β π 0 π 0 π¬ 0 π 0
Handling Cloud Contamination
"Clouds? No problem! βοΈ Our model extracts field boundaries from clouded Sentinel-2 images and switches to SAR Sentinel-1 for dense cloud coverage.
#EarthObservation #RemoteSensing
27.11.2024 04:05 β π 0 π 0 π¬ 1 π 0
The world's leading venue for collaborative research in theoretical computer science. Follow us at http://YouTube.com/SimonsInstitute.
Digital Diary | Live the moment, love my cats, create a life I love π
Upcoming postdoc at ETH Zurich. Working on neuroscience, AI, and robotics. Also into physics, ALife, complexity, filmmaking, philosophy, and space exploration.
Strengthening the Eastern European ML community and improving diversity in the field.
facebook.com/EEMLcommunity
EEML Organizer, ML researcher
Professor a NYU; Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
http://yann.lecun.com
Saber crear software de calidad te da libertad.
Escribo historias, consejos y experiencias.
https://xurxodev.com/libros
https://xurxodev.com/estudio-comunidad-xurxodev/
#CS Associate Prof York University, #ComputerVision Scientist Samsung #AI, VectorInst Faculty Affiliate, TPAMI AE, ELLIS4Europe Member, #CVPR2025 Publicity Chair on X
πToronto π¨π¦ π csprofkgd.github.io
ποΈ Joined Nov 2024
π¨βπ» Software Engineer πΎ Software minimalist/retro
π€ AI tinkerer ποΈ Building tech communities
πͺπΊ UK π¬π§π©πͺπ»πͺ | Check bio: axelgarciak.com/bio
Anaconda Founder&Head of AI; created the PyData movement, PyScript, Bokeh, Datashader; Fellow @ Python Software Foundation; Center for Humane Tech
Game~B; Physics, Cybernetics, Memetics. A student of the human condition.
Memento mori
Research Associate at The Brown University School of Public Health focusing on pandemics and health policy. Formerly of Middlebury.
β Founder of Our World in Data
β Professor at the University of Oxford
Data to understand global problems and research to make progress against them.
Cosmologist & Galactic Archaeologist at
@Sydney_Uni. Proud Silurian, Australian immigrant, & author of books about the strange universe we find ourselves in!
Professor @UCLA, Research Scientist @ByteDance | Recent work: SPIN, SPPO, DPLM 1/2, GPM, MARS | Opinions are my own
Senior Research Scientist at Google DeepMind. I β Optimization β© Machine Learning. Fan of IronMaidenπ€.Here to discuss research π€
I work on AI at OpenAI.
Former VP AI and Distinguished Scientist at Microsoft.