Would love to work with your team on this! Along with experience in agentic systems i also am experienced in RL. Let me know if there is any fit with your team: shoan-raj.github.io/uploads/Shoa...
20.02.2025 19:06 β π 0 π 0 π¬ 0 π 0@addledanorak.bsky.social
Undergrad student @ IIT delhi ML (and recently RL) enthusiast Love playing chess Phil Dunphy fanatic
Would love to work with your team on this! Along with experience in agentic systems i also am experienced in RL. Let me know if there is any fit with your team: shoan-raj.github.io/uploads/Shoa...
20.02.2025 19:06 β π 0 π 0 π¬ 0 π 0Congratulations Lucas! I've been wanting to learn more about MORL, any resources you recommend?
16.02.2025 18:11 β π 1 π 0 π¬ 1 π 0Hey man, just saw your projects and wanted to let you know that they seem amazing! Excited to see what you make with RL
14.01.2025 14:37 β π 2 π 0 π¬ 1 π 0This felt like a perfect read to get started with MARL, concise enough to finish it quickly but still having a good depth in the concepts
08.01.2025 07:48 β π 1 π 0 π¬ 0 π 0I have a draft of my introduction to cooperative multi-agent reinforcement learning on arxiv. Check it out and let me know any feedback you have. The plan is to polish and extend the material into a more comprehensive text with Frans Oliehoek.
arxiv.org/abs/2405.06161
"Code Structure" section explaining structure of the codebase (file by file explanations) in the readme.md of a repo
I wish more repos had this
29.12.2024 05:38 β π 1 π 0 π¬ 0 π 0In many benchmark environments, 10-20 lines of static instructions could leap past square 1 βnot perfect, but better than nothing. This makes me think RL excels at refining good systems into great ones rather than starting from a blank slate, which would explain its increasing usage in LLM alignment
28.12.2024 14:44 β π 3 π 0 π¬ 1 π 0Another point I'd like to add is to have a good file structure - having worked with multiple large codebases I'd say this is probably the most useful thing. Once you start doing this properly, modularity within code follows automatically making it much more readable
19.12.2024 11:50 β π 0 π 0 π¬ 0 π 0Campus cats are the best
18.12.2024 15:01 β π 1 π 0 π¬ 0 π 0Do undergrads count? Would love to be added :)
01.12.2024 06:08 β π 0 π 0 π¬ 0 π 0Figure 1: Learning from machine-unique knowledge. Shows that the goal of the paper is extracting new concepts from machines that humans don't know about yet.
Sharing this awesome paper which shows that:
1. You can extract concepts unknown to humans from superhuman agents
2. Those concepts can then seemingly be taught to experts via examples
arxiv.org/abs/2310.16410
The amount of new things I discovered just by scrolling on this app is insane
14.11.2024 17:51 β π 1 π 0 π¬ 0 π 0Lovely username btw
13.11.2024 20:28 β π 2 π 0 π¬ 0 π 0The best part of social media is paper recs so please start sharing them!
13.11.2024 04:46 β π 36 π 4 π¬ 1 π 1That makes sense, thankyou!
12.11.2024 12:53 β π 1 π 0 π¬ 0 π 0Do professors notice messages from students on these platforms?
12.11.2024 11:04 β π 1 π 0 π¬ 0 π 0Tbh I am a 2nd year undergraduate rn, but im not using Twitter/bluesky with the sole purpose of getting an internship, but yeah the audience is there.
PS: I've been cold emailing professors for RL research internships, any tips?