Shoan :)'s Avatar

Shoan :)

@addledanorak.bsky.social

Undergrad student @ IIT delhi ML (and recently RL) enthusiast Love playing chess Phil Dunphy fanatic

28 Followers  |  602 Following  |  14 Posts  |  Joined: 11.11.2024  |  1.6067

Latest posts by addledanorak.bsky.social on Bluesky


Would love to work with your team on this! Along with experience in agentic systems i also am experienced in RL. Let me know if there is any fit with your team: shoan-raj.github.io/uploads/Shoa...

20.02.2025 19:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Congratulations Lucas! I've been wanting to learn more about MORL, any resources you recommend?

16.02.2025 18:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Hey man, just saw your projects and wanted to let you know that they seem amazing! Excited to see what you make with RL

14.01.2025 14:37 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This felt like a perfect read to get started with MARL, concise enough to finish it quickly but still having a good depth in the concepts

08.01.2025 07:48 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
A First Introduction to Cooperative Multi-Agent Reinforcement Learning Multi-agent reinforcement learning (MARL) has exploded in popularity in recent years. While numerous approaches have been developed, they can be broadly categorized into three main types: centralized ...

I have a draft of my introduction to cooperative multi-agent reinforcement learning on arxiv. Check it out and let me know any feedback you have. The plan is to polish and extend the material into a more comprehensive text with Frans Oliehoek.

arxiv.org/abs/2405.06161

07.01.2025 16:25 β€” πŸ‘ 78    πŸ” 19    πŸ’¬ 3    πŸ“Œ 3
"Code Structure" section explaining structure of the codebase (file by file explanations) in the readme.md of a repo

"Code Structure" section explaining structure of the codebase (file by file explanations) in the readme.md of a repo

I wish more repos had this

29.12.2024 05:38 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In many benchmark environments, 10-20 lines of static instructions could leap past square 1 β€”not perfect, but better than nothing. This makes me think RL excels at refining good systems into great ones rather than starting from a blank slate, which would explain its increasing usage in LLM alignment

28.12.2024 14:44 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Another point I'd like to add is to have a good file structure - having worked with multiple large codebases I'd say this is probably the most useful thing. Once you start doing this properly, modularity within code follows automatically making it much more readable

19.12.2024 11:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Campus cats are the best

18.12.2024 15:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Do undergrads count? Would love to be added :)

01.12.2024 06:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Figure 1: Learning from machine-unique knowledge. Shows that the goal of the paper is extracting new concepts from machines that humans don't know about yet.

Figure 1: Learning from machine-unique knowledge. Shows that the goal of the paper is extracting new concepts from machines that humans don't know about yet.

Sharing this awesome paper which shows that:
1. You can extract concepts unknown to humans from superhuman agents
2. Those concepts can then seemingly be taught to experts via examples
arxiv.org/abs/2310.16410

15.11.2024 21:23 β€” πŸ‘ 40    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

The amount of new things I discovered just by scrolling on this app is insane

14.11.2024 17:51 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Lovely username btw

13.11.2024 20:28 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The best part of social media is paper recs so please start sharing them!

13.11.2024 04:46 β€” πŸ‘ 36    πŸ” 4    πŸ’¬ 1    πŸ“Œ 1

That makes sense, thankyou!

12.11.2024 12:53 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Do professors notice messages from students on these platforms?

12.11.2024 11:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Tbh I am a 2nd year undergraduate rn, but im not using Twitter/bluesky with the sole purpose of getting an internship, but yeah the audience is there.

PS: I've been cold emailing professors for RL research internships, any tips?

12.11.2024 10:52 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@addledanorak is following 19 prominent accounts