Rudolf Arseni Braun's Avatar

Rudolf Arseni Braun

@fasttosmile.bsky.social

Here for AI stuff. Currently ASR@AWS Sometimes write on rudolfarseni.me

18 Followers  |  129 Following  |  2 Posts  |  Joined: 19.11.2024
Posts Following

Posts by Rudolf Arseni Braun (@fasttosmile.bsky.social)

Maybe it's good to play games because it's a way to get a verifiable reward signal for one's thoughts.

19.03.2025 18:21 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

100% agree on the fly UIs is the future. It will still take a bit for our models to be good enough at generating dense data (i.e. not text) but it will happen.
Is tricky to imagine and reason about the downstream effects, wish I could get a glimpse of the future!

14.12.2024 23:26 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
CTCLoss gradient is incorrect ยท Issue #52241 ยท pytorch/pytorch ๐Ÿ› Bug Hi, While working on some CTC extensions, I noticed that torch's CTCLoss was computing incorrect gradient. At least when using CPU (I have not tested on GPU yet). I observed this problem on b...

I just learned that Torch ctc_loss calculates the wrong gradient (but when there was log_softmax before, it does not matter).

For the grad ctc_loss w.r.t. log_probs, it calculates exp(log_probs) - y, but correct would be -y. Some workaround: github.com/pytorch/pyto...

PS: First Bluesky post.

26.11.2024 23:16 โ€” ๐Ÿ‘ 10    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Observing the responses here and on twitter made me reflect, realize and act. There is a real difference, and I describe it (as well as some of the non-differences) here:

gist.github.com/yoavg/9142e5...

23.11.2024 22:35 โ€” ๐Ÿ‘ 12    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1