Ivan Rubachev's Avatar

Ivan Rubachev

@puhsu.bsky.social

ML Researcher at research.yandex.com | Working on DL for Tabular Data

286 Followers  |  1,208 Following  |  17 Posts  |  Joined: 08.02.2024  |  1.9356

Latest posts by puhsu.bsky.social on Bluesky

Video thumbnail

this?

03.02.2026 23:54 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

How hard can it be to build a browser from scratch for three platforms anyways?

Apparently 20K lines of code and ~70 hours from first commit to last.

emsh.cat/one-human-on...

#llm #llms #ai #codex #openai

27.01.2026 13:26 β€” πŸ‘ 37    πŸ” 9    πŸ’¬ 4    πŸ“Œ 2
The Importance of Diversity I read Dario’s The Adolescence of Technology and it’s scary. It assumes the perspective of a top-down ruler, that someone can and will get to control AI. This is taken as a given. Machines of Loving G...

I liked the response better geohot.github.io//blog/jekyll...

27.01.2026 18:28 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

if you’re going to use AI in your workflow, you have to get extremely good at self-discipline/focus because AI will literally tempt you to pursue every tiny whim/idea that enters your brain and thus will absolutely destroy you and your work if left unchecked

slow down before it’s too late

21.01.2026 16:38 β€” πŸ‘ 101    πŸ” 12    πŸ’¬ 6    πŸ“Œ 2
Rust's standard library on the GPU GPU code can now use Rust's standard library. We share the implementation approach and what this unlocks for GPU programming.

We are excited to announce that we can successfully use Rust's standard library from the GPU. This has never been done before.

www.vectorware.com/blog/rust-st...

Supporting Rust's standard library enables existing Rust code to work on the GPU and makes GPU programming feel normal.

20.01.2026 15:39 β€” πŸ‘ 247    πŸ” 57    πŸ’¬ 4    πŸ“Œ 6
Knowing Less, Producing More On AI, Craftsmanship, and the Slow Erosion of Productive Friction.

www.d12frosted.io/posts/2025-1...

19.01.2026 21:14 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
A Social Filesystem β€” overreacted Formats over apps.

formats over apps

18.01.2026 07:05 β€” πŸ‘ 735    πŸ” 175    πŸ’¬ 57    πŸ“Œ 76
Preview
N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs In-context learning allows models like transformers to adapt to new tasks from a few examples without updating their weights, a desirable trait for reinforcement learning (RL). However, existing in-co...

Explicitly adding induction heads helps. Some gains in NLP, seemingly bigger in RL algorithm distillation arxiv.org/abs/2411.01958

04.12.2024 17:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

⚑️

03.12.2024 12:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Day 1 - Advent of Code 2024

I just completed "Historian Hysteria" - Day 1 - Advent of Code 2024 #AdventOfCode adventofcode.com/2024/day/1 (in zig btw)

01.12.2024 15:56 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Yep, just need to find the code. I can share

29.11.2024 08:23 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yeah. I've experimented a bit with the existing code. It generalized to some of our specific problems in tabular DL (even though the meta-train was mostly from language and vision tasks). Curious what do you mean by actually worked here? No edge cases and failures, or just easy to use technically?

29.11.2024 07:37 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

#MLsky

29.11.2024 05:32 β€” πŸ‘ 69    πŸ” 6    πŸ’¬ 0    πŸ“Œ 1

The rejects were horribly misinformed self contradictory but extremely confident. PSGD, SOAP and friends are taking over regardless of academia.

28.11.2024 20:17 β€” πŸ‘ 0    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
VeLO: Training Versatile Learned Optimizers by Scaling Up While deep learning models have replaced hand-designed features across many domains, these models are still trained with hand-designed optimizers. In this work, we leverage the same scaling approach b...

VeLO was something else, I’m a fan arxiv.org/abs/2211.09760

29.11.2024 05:18 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0
Post image

Thank you @bsky.app team for correcting the mistake. Glad to be back!

28.11.2024 20:00 β€” πŸ‘ 304    πŸ” 24    πŸ’¬ 39    πŸ“Œ 32

Did you know that 99% of email today is spam? Your inbox isn’t 99% spam because AI is used to filter it.

The same 99% will happen here too, but if AI researchers continue to get perma-banned for making available the datasets needed to filter it, it’s going to make this platform unusable.

28.11.2024 18:12 β€” πŸ‘ 512    πŸ” 64    πŸ’¬ 42    πŸ“Œ 25

@trl-research.bsky.social

26.11.2024 18:42 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
How AutoML Creates New Opportunities for Europe - Frank Hutter // CyberValley Podcast #5
YouTube video by Cyber Valley How AutoML Creates New Opportunities for Europe - Frank Hutter // CyberValley Podcast #5

Tabular DL and AutoML podcast just dropped. For sure watching this

youtu.be/3qpQ-sMRafE

26.11.2024 18:42 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

Hello to all #ICLR reviewers on #MLsky

25.11.2024 04:47 β€” πŸ‘ 27    πŸ” 4    πŸ’¬ 0    πŸ“Œ 2

bsky.app/profile/hame...

25.11.2024 04:08 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

But keep the numbers in appendix or code pls

So annoying when the only info is in visual form with unclear axes etc. I agree that it’s much better for presentation, but when digging in, I often need raw metrics.

24.11.2024 09:10 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - Bossett/bsky-feeds Contribute to Bossett/bsky-feeds development by creating an account on GitHub.

…extend of customisability?

If I understand correctly, we can do a lot with custom feeds.

Some examples here github.com/Bossett/bsky...

18.11.2024 20:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Custom Feeds | Bluesky Custom feeds, or feed generators, are services that provide custom algorithms to users through the AT Protocol. This allows users to choose their own timelines, whether it's an algorithmic For You pag...

Wow. Didn’t know we can create custom algorithmic feeds here. This is cool! What are your favourites, what’s the extend of

(context: docs.bsky.app/docs/starter...)

18.11.2024 20:17 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Paper screenshot and Figure 1 (c) with cumulative ablations for components of RealMLP-TD.

Paper screenshot and Figure 1 (c) with cumulative ablations for components of RealMLP-TD.

Can deep learning finally compete with boosted trees on tabular data? 🌲
In our NeurIPS 2024 paper, we introduce RealMLP, a NN with improvements in all areas and meta-learned default parameters.
Some insights about RealMLP and other models on large benchmarks (>200 datasets): 🧡

18.11.2024 14:15 β€” πŸ‘ 60    πŸ” 9    πŸ’¬ 1    πŸ“Œ 7
Preview
Table Representation Learning researchers Join the conversation

WIP starterpack w researchers on Table Representation Learning (TRL): all things related to representation learning and generative models for e.g. tables, DBs, spreadsheets!

I'll curate but DM/reply w handle+some info welcome! Also follow @trl-research.bsky.social for updates πŸ€—

go.bsky.app/4SNSMRj

18.11.2024 10:48 β€” πŸ‘ 24    πŸ” 8    πŸ’¬ 8    πŸ“Œ 1

@puhsu is following 20 prominent accounts