@sharky6000.bsky.social why github is kind of up and down, you might be interested to take a look
I am already thinking about awesome meta-game solving papers repository to track the progress.....
@annoyingreposter.bsky.social
@sharky6000.bsky.social why github is kind of up and down, you might be interested to take a look
I am already thinking about awesome meta-game solving papers repository to track the progress.....
Sure, but while IL can work with IID data, RL -- I am not sure about that. That's why we need special assumptions of the data, which you highlighted in your post. It satisfies my initial question, xD
09.02.2026 18:12 β π 0 π 0 π¬ 0 π 0what about github repo/blog where we read some papers, by hand, and try to understand their role in context of modern science?
09.02.2026 17:52 β π 0 π 0 π¬ 0 π 0for imitation learning (same as behavioural clonning), I guess, data has to be as less interdependent as possible (the Markov assumption isn't really the requirement). More or less, the same with other Monte-Carlo methods :) A question is why temporal dependence might be pivotal to exploit the data
09.02.2026 17:31 β π 0 π 0 π¬ 1 π 0something about Markovian property of the RL data
09.02.2026 15:38 β π 0 π 0 π¬ 1 π 0I have updated my tutorial on making Vision Language Action models. This tutorial starts with a basic Transformer and walks people through the steps to transform it into a full VLA that uses PaliGemma as the pretrained VLM. Links below.
09.02.2026 14:15 β π 9 π 4 π¬ 1 π 0what about temporal dependenices in data
09.02.2026 13:18 β π 0 π 0 π¬ 1 π 0aamas papers started to flood arxiv, good.
09.02.2026 08:13 β π 0 π 0 π¬ 0 π 0PSRO with JOINT (!!!) experience best response
arxiv.org/abs/2602.06599
interesting that you paste equation to "latex" it instead of just doing it in any markdown environment
that's why ram costs 1337$ per stick, I guess
I wrote about how I donβt know math but still am somehow a successful computer scientist. I have strong feelings about this. But I also want to understand.
togelius.blogspot.com/2026/02/math...
The most important finding from this analysis! See the post for more details
08.02.2026 20:20 β π 5 π 1 π¬ 0 π 0insulting AI?
08.02.2026 19:51 β π 0 π 0 π¬ 0 π 0I enjoyed the book after I took decision theory during my bachelor's degree and took this coursera course: www.coursera.org/learn/narrat...
I am bad at economics but least I've got a high level overview of what Dr. Lancot is describing in the thread, partially motivating me to model interactions...
Looks goofy but arxiv.org/abs/2602.01665
08.02.2026 17:14 β π 0 π 0 π¬ 0 π 0#tutorial
08.02.2026 09:04 β π 0 π 0 π¬ 0 π 0I found myself not having time and β mainly β desire to read. Quite sad and don't know what to do.
06.02.2026 20:29 β π 0 π 0 π¬ 0 π 0In-context learning is the most affordable to the general public way to do and understand meta-learning and why formalisation of it was genius.
06.02.2026 20:12 β π 0 π 0 π¬ 0 π 0What a great week at the @aliceworkshop.org (Artificial Life, Intelligence, Complexity & Evolution) in Copenhagen.
Our multidisciplinary group worked intensely for the whole week and we got the 2nd prize!!
Thanks to the amazing organizers from the REAL lab and the jury.
@sharky6000.bsky.social might be interested!
balatro is like poker but rogue-like
en.wikipedia.org/wiki/Balatro
A cool benchmark
balatrobench.com
Talks from the World Models Workshop, happening at MILA in Montreal!
04.02.2026 07:20 β π 3 π 1 π¬ 0 π 0@sharky6000.bsky.social can one submit a game to play with LLMs, one like it should be an org with the results and resources?
I have still been teased with civ/wow/colonisation like things...
I am not sure in UCB, in particular, but I like the perspective of this paper arxiv.org/abs/2602.00966
I've the same thing on my mind, maybe we can unify it.
The 2026 IFAAMAS Influential Paper Award Committee has selected two winners for this yearβs award.
πΉBook Award
Rules of Encounter by Jeffrey S. Rosenschein & Gilad Zlotkin
πΉCollection of Papers Award
Influential works by Amy Greenwald, Keith Hall, @Junling Hu, Michael Wellman, and Amir Jafari.
It once took 13 years and $3 billion to sequence the human genome.
Now, we're using Googleβs AI tools to sequence animal genomes in just days to help save endangered species 𧬠(1/4) β
goo.gle/4kgx18P
during this crazy session of the alyssa workshop, we found out that RL is not that deep...
02.02.2026 21:31 β π 1 π 0 π¬ 1 π 0he definitely enjoys the process
02.02.2026 18:24 β π 0 π 0 π¬ 0 π 0