Saurabh Prasad's Avatar

Saurabh Prasad

@saurabhprasad-2.bsky.social

πŸ’™ Software Architect & Sr. Fullstack Engineer / Dev - Java, JS, Python, ML/AI Other interests: CompSci, Tech, Startups, Academia, Research, Electronics, Mathematics, Physics, Space, Science, AR/VR, Robotics, Guitar, Piano, Sci-fi, Languages

1,929 Followers  |  9,112 Following  |  29 Posts  |  Joined: 24.11.2024  |  2.2285

Latest posts by saurabhprasad-2.bsky.social on Bluesky

There is a restaurant that serves Chinese & Italian food, located in the New Delhi metro area in India, named 'Chao Bella'. πŸ˜„

Here's their Insta page:
www.instagram.com/chaobellacpo...

08.01.2025 08:36 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Here's why "alignment research" when it comes to LLMs is a big mess, as I see it.

Claude is not a real guy. Claude is a character in the stories that an LLM has been programmed to write. Just to give it a distinct name, let's call the LLM "the Shoggoth".

19.12.2024 23:15 β€” πŸ‘ 288    πŸ” 79    πŸ’¬ 9    πŸ“Œ 36


Happy new year, Bluesky peeps! πŸŽ‰πŸ₯³

Wish you all a very happy, healthy, and prosperous 2025! πŸ’

01.01.2025 08:06 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - nedjmaou/CMT122-2425-Resources: Resources for CMT122 students (2024-2025). Resources for CMT122 students (2024-2025). Contribute to nedjmaou/CMT122-2425-Resources development by creating an account on GitHub.

@camachocollados.bsky.social and I taught ML for #nlp last semester.
Here is a list of resources that we shared with the students (sorted by theme) in this blogpost: github.com/nedjmaou/CMT...
[Not all students were familiar with coding so it also includes resources for beginners.]

24.12.2024 11:24 β€” πŸ‘ 27    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Preview
AIs Will Increasingly Fake Alignment This post goes over the important and excellent new paper from Anthropic and Redwood Research, with Ryan Greenblatt as lead author, Alignment Faking in Large Language Models.

LLMs faking alignment during training. #mlsky

thezvi.substack.com/p/ais-will-i...

24.12.2024 18:04 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
NeurIPS2024 Related RL papers | Notion Deep RL papers

#NeurIPS2024 wrapped up last week. I put together a curated reading list for #DeepRL and #reinforcementlearning work. (represents my interests).

Talks and workshops:
third-crowd-c77.notion.site/NeurIPS2024-...

Curated reading list
fracturedplane.notion.site/NeurIPS2024-...

#Holidayreading

23.12.2024 19:38 β€” πŸ‘ 70    πŸ” 15    πŸ’¬ 0    πŸ“Œ 0

Interviewer: Can you explain this gap in your resume?

Physicist: I can only tell you about my momentum at that time, not the position

22.12.2024 23:37 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

One of the most bizarre + complex talks at NeurIPS [1] was given by my fellow Yorkshireman, the inimitable Prof Karl Friston [2], explaining active inference to a room full of #AI people who are not really neuroscientists. This was interesting to me because... (1/n)

22.12.2024 06:57 β€” πŸ‘ 32    πŸ” 6    πŸ’¬ 3    πŸ“Œ 0
Preview
Sometimes I am a Tree: Data Drives Unstable Hierarchical Generalization Language models (LMs), like other neural networks, often favor shortcut heuristics based on surface-level patterns. Although LMs behave like n-gram models early in training, they must eventually learn...

Transformer LMs get pretty far by acting like ngram models, so why do they learn syntax? A new paper by sunnytqin.bsky.social, me, and @dmelis.bsky.social illuminates grammar learning in a whirlwind tour of generalization, grokking, training dynamics, memorization, and random variation. #mlsky #nlp

20.12.2024 17:55 β€” πŸ‘ 136    πŸ” 32    πŸ’¬ 5    πŸ“Œ 1
Preview
Supporting GLP-1 prescribing with digital twin technology | TechTarget Learn how digital twin technology could support clinical decision-making for providers prescribing GLP-1s.

πŸ§ͺ Emory leverages #DigitaTwin tech to guide GLP-1 prescriptions for weight and diabetes care. An AI model simulates clinical decisions, providing expert-level support for patient suitability. πŸ©ΊπŸ’» #MLSky

20.12.2024 15:25 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Preview
The Year in Computer Science | Quanta Magazine Researchers got a better look at chatbots’ thoughts, amateurs learned just how complicated simple systems can be, and codes became expert self-fixers.

The ever elusive quantum computer moved a few crucial steps toward practical applications this year.
www.quantamagazine.org/the-year-in-...

19.12.2024 16:04 β€” πŸ‘ 63    πŸ” 23    πŸ’¬ 0    πŸ“Œ 3
Preview
The Year in Biology | Quanta Magazine Biologists used artificial intelligence to make discoveries about molecules and the brain, and overturned long-held assumptions about the immune system and RNA.

Read about this year’s biggest moments in biology.
www.quantamagazine.org/the-year-in-...

18.12.2024 15:34 β€” πŸ‘ 58    πŸ” 22    πŸ’¬ 1    πŸ“Œ 4
Preview
The Year in Physics | Quanta Magazine Physicists discovered strange supersolids, constructed new kinds of superconductors, and continued to make the case that the cosmos is far weirder than anyone suspected.

This year, researchers detected a hint of a signal that, if real, could upend and deepen our understanding of the fundamental laws of the universe. Read our annual review of the biggest developments in physics:
www.quantamagazine.org/the-year-in-...

17.12.2024 16:00 β€” πŸ‘ 51    πŸ” 17    πŸ’¬ 1    πŸ“Œ 1
Preview
The Year in Math | Quanta Magazine Landmark results in geometry and number theory marked an exciting year for mathematics, at a time when advances in artificial intelligence are starting to transform the subject’s future.

Here are the biggest breakthroughs that happened in mathematics this year.
www.quantamagazine.org/the-year-in-...

16.12.2024 18:00 β€” πŸ‘ 75    πŸ” 25    πŸ’¬ 2    πŸ“Œ 2
Post image

I'll get straight to the point.

We trained 2 new models. Like BERT, but modern. ModernBERT.

Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.

It's much faster, more accurate, longer context, and more useful. 🧡

19.12.2024 16:45 β€” πŸ‘ 628    πŸ” 148    πŸ’¬ 19    πŸ“Œ 34
Preview
Finally, a Replacement for BERT: Introducing ModernBERT We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Great blog post (by a 15-author team!) on their release of ModernBERT, the continuing relevance of encoder-only models, and how they relate to, say, GPT-4/llama. Accessible enough that I might use this as an undergrad reading.

19.12.2024 19:11 β€” πŸ‘ 76    πŸ” 19    πŸ’¬ 1    πŸ“Œ 2
Picture of a plaque assay (purple field with clear holes) showing gorgeous poliovirus plaques

Picture of a plaque assay (purple field with clear holes) showing gorgeous poliovirus plaques

My replies are perpetually full of anti-vaxxers these days telling me about polio vaccines.

Not shockingly, most of what they are saying is wrong. Luckily, I trained with Vincent Racaniello & he taught me a few things about poliovirus.

So let’s discuss the king of the PicornaviridaeπŸ‘‡πŸ»

18.12.2024 13:07 β€” πŸ‘ 700    πŸ” 263    πŸ’¬ 36    πŸ“Œ 63
Preview
The AI Pioneer With Provocative Plans for Humanity | Quanta Magazine While some fret about technology’s social impacts, Raj Reddy still believes in the power of artificial intelligence to improve lives.

Artificial intelligence is going to improve productivity. That will also create more wealth. How do you keep AI-enhanced productivity from only benefiting the wealthy? Raj Reddy, one of the pioneers of AI research, has ideas. www.quantamagazine.org/the-ai-pione...

04.12.2024 15:06 β€” πŸ‘ 37    πŸ” 14    πŸ’¬ 7    πŸ“Œ 5
Preview
Infamous paper that popularized unproven COVID-19 treatment finally retracted Study on hydroxychloroquine by Didier Raoult and colleagues gets pulled on ethical and scientific grounds

It took four days from submission to publication, and nearly five years from publication to retraction. After campaigning by many, many scientists, and an investigation by Elsevier, an infamous paper on hydroxychloroquine as a COVID-19 treatment has been retracted. πŸ§ͺ
www.science.org/content/arti...

17.12.2024 17:31 β€” πŸ‘ 1233    πŸ” 516    πŸ’¬ 38    πŸ“Œ 65
Ilya Sutskever: "Sequence to sequence learning with neural networks: what a decade"
YouTube video by seremot Ilya Sutskever: "Sequence to sequence learning with neural networks: what a decade"

Ilya Sutskever full talk "Sequence to sequence learning with neural networks: what a decade" at NeurIPS 2024 in Vancouver, Canada.

www.youtube.com/watch?v=1yvB...

15.12.2024 21:16 β€” πŸ‘ 49    πŸ” 8    πŸ’¬ 0    πŸ“Œ 1
A Bibliography Database for Machine Learning Getting the correct bibtex entry for a conference paper (e.g. published at NeurIPS, ICML, ICLR) is annoyingly hard: if you search for the title, you will often find a link to arxiv or to the pdf file,...

Want all NeurIPS/ICML/ICLR papers in one single .bib file? Here you go!

πŸ—žοΈ short blog post: fabian-sp.github.io/posts/2024/1...

πŸ“‡ bib files: github.com/fabian-sp/ml-bib

17.12.2024 10:42 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Fun fact related to this thread:

Do you know what brought the ReLU activation function back into vogue in AI?

It was this paper from Yoshua's group, which was motivated by the observation that ReLU is a better match to real neurons' IO-functions!

proceedings.mlr.press/v15/glorot11a

πŸ§ πŸ“ˆ #NeuroAI

17.12.2024 17:35 β€” πŸ‘ 65    πŸ” 16    πŸ’¬ 4    πŸ“Œ 0

1/ Okay, one thing that has been revealed to me from the replies to this is that many people don't know (or refuse to recognize) the following fact:

The unts in ANN are actually not a terrible approximation of how real neurons work!

A tiny 🧡.

πŸ§ πŸ“ˆ #NeuroAI #MLSky

16.12.2024 20:03 β€” πŸ‘ 153    πŸ” 39    πŸ’¬ 21    πŸ“Œ 17
Post image

🌟Noether's razor⭐️ Our NeurIPS 2024 paper connects ML symmetries to conserved quantities through a seminal result in mathematical physics: Noether's theorem. We can learn neural network symmetries from data by learning associated conservation laws. Learn moreπŸ‘‡. 1/16🧡

06.12.2024 13:42 β€” πŸ‘ 100    πŸ” 13    πŸ’¬ 3    πŸ“Œ 2
Post image

🎯 How can we empower scientific discovery in millions of nature photos?

Introducing INQUIRE: A benchmark testing if AI vision-language models can help scientists find biodiversity patterns- from disease symptoms to rare behaviors- hidden in vast image collections.

ThreadπŸ‘‡πŸ§΅

06.12.2024 20:28 β€” πŸ‘ 88    πŸ” 33    πŸ’¬ 3    πŸ“Œ 3
Preview
Gukesh Dommaraju becomes youngest world chess champion after horrific Ding Liren blunder * Indian teenager becomes 18th world chess champion * Modi hails β€˜historic and exemplary’ achievement * Move-by-move report of Game 14 – as it happened * Play through 22 famous world championship games Indian teenager Gukesh Dommaraju capped a…

Gukesh Dommaraju becomes youngest world chess champion after horrific Ding Liren blunder

12.12.2024 22:24 β€” πŸ‘ 149    πŸ” 20    πŸ’¬ 7    πŸ“Œ 6
Video thumbnail

Have you ever wondered how training dynamics differ between LLMs πŸ–‹οΈ and Vision πŸ‘οΈ models? We explore this and close the gap between VMs and LLMs in our #NeurIPS2024 paper "TrAct: Making First-layer Pre-Activations Trainable".
Paper link πŸ“œ: arxiv.org/abs/2410.23970
Video link πŸŽ₯: youtu.be/ZjTAjjxbkRY
🧡

04.12.2024 18:39 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1

It's never too late to start learning a new instrument πŸ˜„

12.12.2024 17:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

That's so cool!

I'm a software architect & engineer/developer, but I enjoy playing the guitar & piano as a hobby! 🎸🀘🏼😊

12.12.2024 14:50 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Could you please add me πŸ‘‹πŸ˜Š

12.12.2024 09:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@saurabhprasad-2 is following 20 prominent accounts