Rudolf Arseni Braun fasttosmile

Maybe it's good to play games because it's a way to get a verifiable reward signal for one's thoughts.

19.03.2025 18:21 — 👍 0 🔁 0 💬 0 📌 0

100% agree on the fly UIs is the future. It will still take a bit for our models to be good enough at generating dense data (i.e. not text) but it will happen.
Is tricky to imagine and reason about the downstream effects, wish I could get a glimpse of the future!

14.12.2024 23:26 — 👍 0 🔁 0 💬 0 📌 0

CTCLoss gradient is incorrect · Issue #52241 · pytorch/pytorch 🐛 Bug Hi, While working on some CTC extensions, I noticed that torch's CTCLoss was computing incorrect gradient. At least when using CPU (I have not tested on GPU yet). I observed this problem on b...

I just learned that Torch ctc_loss calculates the wrong gradient (but when there was log_softmax before, it does not matter).

For the grad ctc_loss w.r.t. log_probs, it calculates exp(log_probs) - y, but correct would be -y. Some workaround: github.com/pytorch/pyto...

PS: First Bluesky post.

26.11.2024 23:16 — 👍 10 🔁 2 💬 0 📌 0

Observing the responses here and on twitter made me reflect, realize and act. There is a real difference, and I describe it (as well as some of the non-differences) here:

gist.github.com/yoavg/9142e5...

23.11.2024 22:35 — 👍 12 🔁 4 💬 0 📌 1

Posts by Rudolf Arseni Braun (@fasttosmile.bsky.social)