sankalp (dejavucoder)'s Avatar

sankalp (dejavucoder)

@dejavucoder.bsky.social

into applied ai + product engg interested in all things ai and distributed systems

99 Followers  |  54 Following  |  10 Posts  |  Joined: 26.11.2024  |  1.5378

Latest posts by dejavucoder.bsky.social on Bluesky

Preview
Alex L. Zhang | A Meticulous Guide to Advances in Deep Learning Efficiency over the Years A very long and thorough guide how deep learning algorithms, hardware, libraries, compilers, and more have become more efficient.

bookmarking here to read this soon
alexzhang13.github.io/blog/2024/ef...

09.01.2025 20:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
The state of post-training in 2025 A re-record of my NeurIPS tutorial on language modeling (plus some added content).

The state of post-training in 2025: a tutorial on modern post-training
A re-record of my NeurIPS tutorial on language modeling (plus some added content on the high level state of things)
Blog + extra context: https://buff.ly/424VvLm
YouTube: https://buff.ly/40808l5
Slides: https://buff.ly/404jGa9

08.01.2025 15:38 β€” πŸ‘ 80    πŸ” 17    πŸ’¬ 4    πŸ“Œ 0
Preview
The Evolution of AI-assisted coding features and developer interaction patterns Yes, I agree that's a fancy title. There have been several developments over the last 7 years in the AI-assisted coding arena. We have gone from simple autoc...

new blog post

Evolution of AI-assited coding features and developer interaction patterns. I go through the history of progression of ai-assisted coding features, talk about how we interact with them and a Gears analogy control vs speed tradeoff

sankalp.bearblog.dev/evolution-of...

21.12.2024 19:54 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1
Post image

First slide deck for NeurIPS is done -- an overview of how I view post-training for applications.
A higher level summary on the key decisions along the way of scoping a problem, choosing a base model, optimization algorithm, etc. (+some thoughts on OpenAI's RL Finetuning).

https://buff.ly/3ZpY5IR

09.12.2024 19:04 β€” πŸ‘ 34    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0

agent orchestrator more like agent pimp

04.12.2024 17:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

will check this out for synthetic data creation and evals

04.12.2024 17:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
OpenAI's o1 using "search" was a PSYOP How to understand OpenAI's o1 models as really just one wacky, wonderful, long chain of thought

New post! OpenAI's o1 using "search" was a PSYOP.
How to understand OpenAI's o1 models as really just one wacky, wonderful, long chain of thought.

A fun one trying to communicate intuitions for what large scale RL training does to LLMs. Much more to explore here in 2025!

04.12.2024 15:33 β€” πŸ‘ 38    πŸ” 6    πŸ’¬ 3    πŸ“Œ 0

Wow, this is such a useful resource of industry LLM applications! And filtering via search/tags is so responsive. I was thinking of compiling something like this over the holidays (ala applied-ml) but thanks to @strickvl.bsky.social I can spend the time reading instead β™₯️

zenml.io/llmops-datab...

03.12.2024 01:54 β€” πŸ‘ 50    πŸ” 4    πŸ’¬ 3    πŸ“Œ 0

lmao

28.11.2024 09:47 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

this is kinda nice

26.11.2024 14:54 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

same haha. planning to spend some time on bluesky to check ai discussions and meet mutuals who are more active here

26.11.2024 14:49 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

hello sir

26.11.2024 14:45 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

we are planning to read this blog blog.dottxt.co

26.11.2024 14:43 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

hello world

26.11.2024 14:41 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@dejavucoder is following 17 prominent accounts