Tian and Karolina and team are at ICLR. Come say hi.
21.04.2025 13:00 β π 7 π 1 π¬ 0 π 0@roydanroy.bsky.social
Research Director, Founding Faculty, Canada CIFAR AI Chair @VectorInst. Full Prof @UofT - Statistics and Computer Sci. (x-appt) danroy.org I study assumption-free prediction and decision making under uncertainty, with inference emerging from optimality.
Tian and Karolina and team are at ICLR. Come say hi.
21.04.2025 13:00 β π 7 π 1 π¬ 0 π 0Curious. Didnβt know meta had a PPL team.
07.04.2025 01:13 β π 1 π 0 π¬ 0 π 0I like to think about non-reasoning model responses as vibes.
07.04.2025 00:50 β π 2 π 0 π¬ 0 π 0So whoβs read the 2027 article? What do you think?
07.04.2025 00:47 β π 1 π 0 π¬ 3 π 0Someone has suggested I check out bsky again. So I'm back looking around here. Notification list is kinda boring. So any good conversations going on? Perhaps about LLM/AI reasoning?
23.03.2025 21:10 β π 34 π 0 π¬ 8 π 1Of course.
23.03.2025 21:04 β π 0 π 0 π¬ 0 π 0Anyone else have the worry that a lot of LLM research is .... just bad psychology?
03.02.2025 03:26 β π 69 π 2 π¬ 10 π 2And, to achieve the results in this paper, what was the most challenging part? Why had previous attempts fallen short? What was your key new insight?
03.02.2025 03:25 β π 2 π 0 π¬ 2 π 0Very interesting. So, what was the biggest hole to fill, in terms of hypotheses?
02.02.2025 17:19 β π 3 π 0 π¬ 1 π 0Okay, so just a few* thoughts (*this got longer as I wrote π β¦.long thread)-
08.01.2025 12:40 β π 53 π 32 π¬ 1 π 7Acknowledgments.
08.01.2025 16:56 β π 7 π 0 π¬ 0 π 0I got to ski Revelstoke this winter break.
Couple observations: the price of receiving 600 cm of snow by Jan 8 is that it is constantly snowing. Saw almost no sun the whole time and the peak was often in whiteout conditions (though North Bowl was always clearβ¦).
See image for more.
Multiple friends have likely lost their homes in Los Angeles. Canβt imagine how disorienting this would be. They had only minutes to flee and grab belongings.
08.01.2025 16:55 β π 7 π 0 π¬ 0 π 0What are the key papers to read?
30.12.2024 20:36 β π 6 π 0 π¬ 1 π 0OK. Practical question times. How are you adjusting your research given progress in reasoning style models? Also how are you adjusting the way you work?
22.12.2024 07:39 β π 62 π 4 π¬ 12 π 1A $100,000,000 experiment is no longer "consequence" free. Ilya is saying "scaling is over", but this may simply be that the scaling "laws" (not laws) are no longer accurate. Also, those laws are tied to hyperparameter tunings.
15.12.2024 13:08 β π 3 π 0 π¬ 1 π 0Sure some were empirical. Some were not.
14.12.2024 21:27 β π 0 π 0 π¬ 0 π 0I'd say no in a sense. Xavier-He initialization was theoretical work. And that was absolutely critical.
14.12.2024 01:47 β π 2 π 0 π¬ 1 π 0Pretraining is not done. It's just that theorists haven't told the hackers how to do it better.
13.12.2024 22:52 β π 35 π 0 π¬ 3 π 0Annoying. If it could be automatic, sure.
13.12.2024 00:14 β π 0 π 0 π¬ 1 π 0I'd say wait then.
12.12.2024 22:37 β π 1 π 0 π¬ 0 π 0That's part of the spec. I don't think this is too problematic. The example they give is problems in NP, where there is a polynomial time checker (i.e., a polytime EV), but generating an instance that passes the checker is hard in the worst case.
12.12.2024 22:37 β π 1 π 0 π¬ 0 π 0Now that I've had a taste of X without post length limitations, I've got to say that it is quite annoying have to fit tweets into 256 characters here on bsky. On X, when they get to long, they go below the fold, and so you're still incentivized to make it short. Can't we have that here?
12.12.2024 22:35 β π 24 π 1 π¬ 9 π 2Lottery ticket?
11.12.2024 22:18 β π 3 π 0 π¬ 1 π 0@gkdziugaite.bsky.social. Works at GDM and Mila. Influential, technical work.
11.12.2024 13:20 β π 3 π 0 π¬ 1 π 0OK
11.12.2024 13:17 β π 0 π 0 π¬ 1 π 0Many of these sound to be very problematic if you hope that the result would be accepted by the mathematical community. E.g. "The proof appears to use computational evidence (listing out cases) as a substitute for theoretical proof." It seems you're not meeting the usual standard.
11.12.2024 01:37 β π 0 π 0 π¬ 1 π 0Please ask Claude. What would likely be the chief criticisms of my argument above were I to submit it to a traditional mathematical journal.
11.12.2024 00:39 β π 1 π 0 π¬ 1 π 0I've now read this paper carefully if anyone wants to discuss it.
10.12.2024 22:50 β π 6 π 0 π¬ 4 π 0Great analogy.
06.12.2024 22:04 β π 0 π 0 π¬ 0 π 0