rohit's Avatar

rohit

@rnair.bsky.social

agi whisperer

1,177 Followers  |  1,191 Following  |  175 Posts  |  Joined: 11.04.2023  |  1.6065

Latest posts by rnair.bsky.social on Bluesky

claude 3.7 performance on Pokemon

claude 3.7 performance on Pokemon

of all the benchmarks being sent around for Claude 3.7 this is the one i'm paying the most attention to. they're cheating a little bit by giving it the oldest, original pokemon game (red/blue) which is more than 20 years old and will have plenty of info online to learn from.

24.02.2025 19:39 β€” πŸ‘ 30    πŸ” 3    πŸ’¬ 3    πŸ“Œ 2

sheesh! what a day for AI. QwQ-Max, sonnet-3.7 AND open source of FlashMLA

24.02.2025 21:31 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

the only interview question i need to ask is:

how many sqlite databases do you have on your machine and what do you use them for?

if you're able to answer with a specific number, you're not ready yet unless you have a custom cron job cleaning up unused sqlite files.

25.02.2025 02:38 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
a screen of various ai assistants hinting at the user's digital hoarding proclivities

a screen of various ai assistants hinting at the user's digital hoarding proclivities

i have a hoarding problem

25.02.2025 02:27 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

opening discord though on drop days, that's a whole other beast.

25.02.2025 02:20 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

i don't check social media for 8h and people are already playing with claude 3.7 sonnet

25.02.2025 02:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

edtech companies who've amassed a bank of their own content in the last decade are sitting on a goldmine.

15.02.2025 19:29 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

saving all of my generated deepseek r1's reasoning traces to a USB drive so when the time comes i can send it back to my 12 year old self to prevent him from bashing his head against the desk for being stuck on the 2nd problem of an AMC 12.

15.02.2025 19:22 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

good thing i'm intentionally curating everything to discreetly bend the ai agents to my will.

17.01.2025 05:39 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - deepseek-ai/DeepSeek-V3 Contribute to deepseek-ai/DeepSeek-V3 development by creating an account on GitHub.

DeepSeek, a LLM trained for a fraction of the cost of GPT-Xx models, in 2 months for 6 million, on limited GPUs due to export restrictions, and competing head to head. This is crazy.

It's not the AI part I'm excited about, it's the level of efficiency. github.com/deepseek-ai/...

31.12.2024 17:07 β€” πŸ‘ 271    πŸ” 37    πŸ’¬ 10    πŸ“Œ 10

engineering blogs and white papers from the heyday of pre-AI tech are a treasure trove of insights and decision making processes without the burden of validating AI slop.

don't discount them merely on their being of outdated.

31.12.2024 18:06 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

plus if you do decide to pause the 9-5 lifestyle to become an AI consultant, you'll have a solid foundation and not have to start from scratch.

31.12.2024 17:57 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

if you're a cs student aspiring to become a software engineer in 2025, make sure to hone adjacent skills like writing, marketing, & sales.

in the age of agents, establishing human connection & building an audience organically will make set you apart from the rest.

31.12.2024 17:55 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Gonna think more thoughts in 2025

31.12.2024 17:04 β€” πŸ‘ 88    πŸ” 10    πŸ’¬ 8    πŸ“Œ 1
thumbnail that says introducing smolagents

thumbnail that says introducing smolagents

supercharge your LLM apps with smolagents πŸ”₯

however cool your LLM is, without being agentic it can only go so far

enter smolagents: a new agent library by @hf.co to make the LLM write code, do analysis and automate boring stuff! huggingface.co/blog/smolage...

31.12.2024 15:32 β€” πŸ‘ 88    πŸ” 17    πŸ’¬ 2    πŸ“Œ 3

the intersection between hardcore biotech, precision therapeutics, & AI is an interesting one.

i'm particularly bullish about the work being done with organoid intelligence - mainly because of more energy efficient computing as well as possible contributions to neurodegenerative disease research

13.12.2024 04:21 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

every PR is another step towards the dream of having a universal near zero latency jarvis+second brain ai mesh

seemed like a gargantuan task back when i discovered obsidian/roam in college but current tech & capabilities make it easier to bootstrap a decent representation of this.

13.12.2024 04:03 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
smol.ai News and Hackathons for AI Engineers!

the smol.ai newsletter is truly a godsend (thanks @swyx.io)

with all of the model releases this week, neurips, and discord chats popping off, having a single place to start from really helps

highly recommend.

13.12.2024 03:53 β€” πŸ‘ 18    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0


when you've been on the internet as long as i have, you would understand that everything can be used for anything and leaves a trail.

the difference today is a lot more security theatre & forced transparency

it's why i've always written stuff envisioning that they'd be immortalized by a super AGI

27.11.2024 03:27 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

annoy an ML engineer with these simple phrases:

"cosine distance"
"L2 similarity"
"but did you ship it?"

26.11.2024 22:38 β€” πŸ‘ 56    πŸ” 3    πŸ’¬ 5    πŸ“Œ 1
Post image Post image Post image Post image

My deep learning course at the University of Geneva is available on-line. 1000+ slides, ~20h of screen-casts. Full of examples in PyTorch.

fleuret.org/dlc/

And my "Little Book of Deep Learning" is available as a phone-formatted pdf (nearing 700k downloads!)

fleuret.org/lbdl/

26.11.2024 06:15 β€” πŸ‘ 1265    πŸ” 254    πŸ’¬ 50    πŸ“Œ 17

wanna become a prolific open source contributor?

just try using open source llm agent frameworks with rough asynchronous programming patterns in prod

26.11.2024 01:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@georgehotz.bsky.social is here! Bluesky is going to be so much fun.

24.11.2024 04:41 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

deploying llm apps on modal labs at night is a much better experience than my daytime woes with terraform, cloud build, k8s, and docker

24.11.2024 04:42 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

lol nah I think i'll stick to modal

24.11.2024 04:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
a cartoon of superman flying through the air with a red cape . ALT: a cartoon of superman flying through the air with a red cape .
24.11.2024 04:23 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

distributed systems man. why do we do this to ourselves

24.11.2024 04:05 β€” πŸ‘ 609    πŸ” 57    πŸ’¬ 31    πŸ“Œ 6

the good thing about this new generation of voice assistants is that everyone can have their own hunter s thompson like narrators for the most mundane parts of their daily routines

23.11.2024 18:57 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
rohit's mind palace - a menagerie of tech, books, posters, holograms and chessboards

rohit's mind palace - a menagerie of tech, books, posters, holograms and chessboards

i finally asked chatgpt to generate an image given what it knew about me.

spot on

23.11.2024 18:50 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Currently OpenAI o1 has sparked a surge of interest in the study of large reasoning models (LRM). Building on this momentum, Marco-o1 not only focuses on disciplines with standard answers, such as mat...

Alibaba has their own version on GPT-o1. This might be the best description of β€œo1-type”systems so far arxiv.org/abs/2411.14405

22.11.2024 12:18 β€” πŸ‘ 268    πŸ” 38    πŸ’¬ 9    πŸ“Œ 2

@rnair is following 20 prominent accounts