There is a restaurant that serves Chinese & Italian food, located in the New Delhi metro area in India, named 'Chao Bella'. π
Here's their Insta page:
www.instagram.com/chaobellacpo...
@saurabhprasad-2.bsky.social
π Software Architect & Sr. Fullstack Engineer / Dev - Java, JS, Python, ML/AI Other interests: CompSci, Tech, Startups, Academia, Research, Electronics, Mathematics, Physics, Space, Science, AR/VR, Robotics, Guitar, Piano, Sci-fi, Languages
There is a restaurant that serves Chinese & Italian food, located in the New Delhi metro area in India, named 'Chao Bella'. π
Here's their Insta page:
www.instagram.com/chaobellacpo...
Here's why "alignment research" when it comes to LLMs is a big mess, as I see it.
Claude is not a real guy. Claude is a character in the stories that an LLM has been programmed to write. Just to give it a distinct name, let's call the LLM "the Shoggoth".
Happy new year, Bluesky peeps! ππ₯³
Wish you all a very happy, healthy, and prosperous 2025! π
@camachocollados.bsky.social and I taught ML for #nlp last semester.
Here is a list of resources that we shared with the students (sorted by theme) in this blogpost: github.com/nedjmaou/CMT...
[Not all students were familiar with coding so it also includes resources for beginners.]
LLMs faking alignment during training. #mlsky
thezvi.substack.com/p/ais-will-i...
#NeurIPS2024 wrapped up last week. I put together a curated reading list for #DeepRL and #reinforcementlearning work. (represents my interests).
Talks and workshops:
third-crowd-c77.notion.site/NeurIPS2024-...
Curated reading list
fracturedplane.notion.site/NeurIPS2024-...
#Holidayreading
Interviewer: Can you explain this gap in your resume?
Physicist: I can only tell you about my momentum at that time, not the position
One of the most bizarre + complex talks at NeurIPS [1] was given by my fellow Yorkshireman, the inimitable Prof Karl Friston [2], explaining active inference to a room full of #AI people who are not really neuroscientists. This was interesting to me because... (1/n)
22.12.2024 06:57 β π 32 π 6 π¬ 3 π 0Transformer LMs get pretty far by acting like ngram models, so why do they learn syntax? A new paper by sunnytqin.bsky.social, me, and @dmelis.bsky.social illuminates grammar learning in a whirlwind tour of generalization, grokking, training dynamics, memorization, and random variation. #mlsky #nlp
20.12.2024 17:55 β π 136 π 32 π¬ 5 π 1π§ͺ Emory leverages #DigitaTwin tech to guide GLP-1 prescriptions for weight and diabetes care. An AI model simulates clinical decisions, providing expert-level support for patient suitability. π©Ίπ» #MLSky
20.12.2024 15:25 β π 11 π 2 π¬ 1 π 0The ever elusive quantum computer moved a few crucial steps toward practical applications this year.
www.quantamagazine.org/the-year-in-...
Read about this yearβs biggest moments in biology.
www.quantamagazine.org/the-year-in-...
This year, researchers detected a hint of a signal that, if real, could upend and deepen our understanding of the fundamental laws of the universe. Read our annual review of the biggest developments in physics:
www.quantamagazine.org/the-year-in-...
Here are the biggest breakthroughs that happened in mathematics this year.
www.quantamagazine.org/the-year-in-...
I'll get straight to the point.
We trained 2 new models. Like BERT, but modern. ModernBERT.
Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.
It's much faster, more accurate, longer context, and more useful. π§΅
Great blog post (by a 15-author team!) on their release of ModernBERT, the continuing relevance of encoder-only models, and how they relate to, say, GPT-4/llama. Accessible enough that I might use this as an undergrad reading.
19.12.2024 19:11 β π 76 π 19 π¬ 1 π 2Picture of a plaque assay (purple field with clear holes) showing gorgeous poliovirus plaques
My replies are perpetually full of anti-vaxxers these days telling me about polio vaccines.
Not shockingly, most of what they are saying is wrong. Luckily, I trained with Vincent Racaniello & he taught me a few things about poliovirus.
So letβs discuss the king of the Picornaviridaeππ»
Artificial intelligence is going to improve productivity. That will also create more wealth. How do you keep AI-enhanced productivity from only benefiting the wealthy? Raj Reddy, one of the pioneers of AI research, has ideas. www.quantamagazine.org/the-ai-pione...
04.12.2024 15:06 β π 37 π 14 π¬ 7 π 5It took four days from submission to publication, and nearly five years from publication to retraction. After campaigning by many, many scientists, and an investigation by Elsevier, an infamous paper on hydroxychloroquine as a COVID-19 treatment has been retracted. π§ͺ
www.science.org/content/arti...
Ilya Sutskever full talk "Sequence to sequence learning with neural networks: what a decade" at NeurIPS 2024 in Vancouver, Canada.
www.youtube.com/watch?v=1yvB...
Want all NeurIPS/ICML/ICLR papers in one single .bib file? Here you go!
ποΈ short blog post: fabian-sp.github.io/posts/2024/1...
π bib files: github.com/fabian-sp/ml-bib
Fun fact related to this thread:
Do you know what brought the ReLU activation function back into vogue in AI?
It was this paper from Yoshua's group, which was motivated by the observation that ReLU is a better match to real neurons' IO-functions!
proceedings.mlr.press/v15/glorot11a
π§ π #NeuroAI
1/ Okay, one thing that has been revealed to me from the replies to this is that many people don't know (or refuse to recognize) the following fact:
The unts in ANN are actually not a terrible approximation of how real neurons work!
A tiny π§΅.
π§ π #NeuroAI #MLSky
πNoether's razorβοΈ Our NeurIPS 2024 paper connects ML symmetries to conserved quantities through a seminal result in mathematical physics: Noether's theorem. We can learn neural network symmetries from data by learning associated conservation laws. Learn moreπ. 1/16π§΅
06.12.2024 13:42 β π 100 π 13 π¬ 3 π 2π― How can we empower scientific discovery in millions of nature photos?
Introducing INQUIRE: A benchmark testing if AI vision-language models can help scientists find biodiversity patterns- from disease symptoms to rare behaviors- hidden in vast image collections.
Threadππ§΅
Gukesh Dommaraju becomes youngest world chess champion after horrific Ding Liren blunder
12.12.2024 22:24 β π 149 π 20 π¬ 7 π 6Have you ever wondered how training dynamics differ between LLMs ποΈ and Vision ποΈ models? We explore this and close the gap between VMs and LLMs in our #NeurIPS2024 paper "TrAct: Making First-layer Pre-Activations Trainable".
Paper link π: arxiv.org/abs/2410.23970
Video link π₯: youtu.be/ZjTAjjxbkRY
π§΅
It's never too late to start learning a new instrument π
12.12.2024 17:34 β π 1 π 0 π¬ 0 π 0That's so cool!
I'm a software architect & engineer/developer, but I enjoy playing the guitar & piano as a hobby! πΈπ€πΌπ
Could you please add me ππ
12.12.2024 09:27 β π 0 π 0 π¬ 0 π 0