Shoan :) @addledanorak - Bluesky Profile

Shoan :)

@addledanorak.bsky.social

Undergrad student @ IIT delhi ML (and recently RL) enthusiast Love playing chess Phil Dunphy fanatic

28 Followers | 602 Following | 14 Posts | Joined: 11.11.2024 | 1.6067

Latest posts by addledanorak.bsky.social on Bluesky

Would love to work with your team on this! Along with experience in agentic systems i also am experienced in RL. Let me know if there is any fit with your team: shoan-raj.github.io/uploads/Shoa...

20.02.2025 19:06 — 👍 0 🔁 0 💬 0 📌 0

Congratulations Lucas! I've been wanting to learn more about MORL, any resources you recommend?

16.02.2025 18:11 — 👍 1 🔁 0 💬 1 📌 0

Hey man, just saw your projects and wanted to let you know that they seem amazing! Excited to see what you make with RL

14.01.2025 14:37 — 👍 2 🔁 0 💬 1 📌 0

This felt like a perfect read to get started with MARL, concise enough to finish it quickly but still having a good depth in the concepts

08.01.2025 07:48 — 👍 1 🔁 0 💬 0 📌 0

A First Introduction to Cooperative Multi-Agent Reinforcement Learning Multi-agent reinforcement learning (MARL) has exploded in popularity in recent years. While numerous approaches have been developed, they can be broadly categorized into three main types: centralized ...

I have a draft of my introduction to cooperative multi-agent reinforcement learning on arxiv. Check it out and let me know any feedback you have. The plan is to polish and extend the material into a more comprehensive text with Frans Oliehoek.

arxiv.org/abs/2405.06161

07.01.2025 16:25 — 👍 78 🔁 19 💬 3 📌 3

"Code Structure" section explaining structure of the codebase (file by file explanations) in the readme.md of a repo

I wish more repos had this

29.12.2024 05:38 — 👍 1 🔁 0 💬 0 📌 0

In many benchmark environments, 10-20 lines of static instructions could leap past square 1 —not perfect, but better than nothing. This makes me think RL excels at refining good systems into great ones rather than starting from a blank slate, which would explain its increasing usage in LLM alignment

28.12.2024 14:44 — 👍 3 🔁 0 💬 1 📌 0

Another point I'd like to add is to have a good file structure - having worked with multiple large codebases I'd say this is probably the most useful thing. Once you start doing this properly, modularity within code follows automatically making it much more readable

19.12.2024 11:50 — 👍 0 🔁 0 💬 0 📌 0

Campus cats are the best

18.12.2024 15:01 — 👍 1 🔁 0 💬 0 📌 0

Do undergrads count? Would love to be added :)

01.12.2024 06:08 — 👍 0 🔁 0 💬 0 📌 0

Figure 1: Learning from machine-unique knowledge. Shows that the goal of the paper is extracting new concepts from machines that humans don't know about yet.

Sharing this awesome paper which shows that:
1. You can extract concepts unknown to humans from superhuman agents
2. Those concepts can then seemingly be taught to experts via examples
arxiv.org/abs/2310.16410

15.11.2024 21:23 — 👍 40 🔁 3 💬 1 📌 0

The amount of new things I discovered just by scrolling on this app is insane

14.11.2024 17:51 — 👍 1 🔁 0 💬 0 📌 0

Lovely username btw

13.11.2024 20:28 — 👍 2 🔁 0 💬 0 📌 0

The best part of social media is paper recs so please start sharing them!

13.11.2024 04:46 — 👍 36 🔁 4 💬 1 📌 1

That makes sense, thankyou!

12.11.2024 12:53 — 👍 1 🔁 0 💬 0 📌 0

Do professors notice messages from students on these platforms?

12.11.2024 11:04 — 👍 1 🔁 0 💬 0 📌 0

Tbh I am a 2nd year undergraduate rn, but im not using Twitter/bluesky with the sole purpose of getting an internship, but yeah the audience is there.

PS: I've been cold emailing professors for RL research internships, any tips?

12.11.2024 10:52 — 👍 1 🔁 0 💬 1 📌 0

@addledanorak is following 19 prominent accounts

Akhil Arora
@akhilarora

Assistant Prof. @csaudk.bsky.social | Fellow @cphsodas.bsky.social Previous: @icepfl.bsky.social @americanexpress @Xerox @Intel Interests: 🥾🏔️🚴‍♂️🏋️‍♂️🎸 #NLProc #LLMs #AgenticAI #Causality #GraphML https://www.cs.au.dk/~clan/people/aarora

Greg Robison
@gregrobison

Chief AI Architect, CTO and Founder at F'inn (https://www.linkedin.com/in/gregrobison), writer (https://gregrobison.medium.com/) and photographer (https://www.gregrobison.com)

Mohit Iyyer
@miyyer

associate prof at UMD CS researching NLP & LLMs

Naoshige Uchida
@naoshigeuchida

Neuroscientist. Professor at Harvard University. Studies the neural mechanisms underlying decision-making and learning. Dopamine.

@poshakpathak

Longqi Yang
@ylongqi

Partner Applied Research Manager at Microsoft #AI #GenAI #LLM #FutureofWork

Peter Vamplew
@amp1874

Professor in IT @ Federation Uni. Multi-objective reinforcement learning. Human-aligned AI. Best known for the f*cking mailing list paper. Jambo & Bengals fan. https://t.co/UNoOrbGApz

Sébastien Darses
@sebdarses

Math Assoc. Prof. (on leave, Aix-Marseille, France) Interest: Prob / Stat / ANT. See: https://sites.google.com/view/sebastien-darses/research?authuser=0 Teaching Project (non-profit): https://highcolle.com/

Dan Clark
@danclark.org

🤔Making search that learns w/ English like a kid to $1M revenue. 💸Making automated trading systems (stocks/futures) Avid reader, into robotics, compression, AI 💼 Projects: oxo.ai, multiplayer.ai 🏡 About me: danclark.org 📍 San Francisco

Ezgi Korkmaz
@ezgikorkmaz

Machine Learning Researcher. PhD in Machine Learning. ✨Researching Reinforcement Learning. Been at @UCL @GoogleDeepmind @UCBerkeley @ucberkeleyofficial.bsky.social @ucl.ac.uk Website: https://ezgikorkmaz.github.io/

Tamsin
@tamsin-ai

Researching Al and Machine Learning in the Finance sector, conference speaker, co-author of The Al book

Eric Neustadter (e)
@thevowel

(semi?) retired tech architect/tech exec. ex-Pokémon. ex-Xbox. labrador at heart. retro games & computers. go ducks, kraken, seahawks, sounders! he/him.

Tim G. J. Rudner
@timrudner

Assistant Professor, University of Toronto. Junior Research Fellow, Trinity College, Cambridge. AI Fellow, Georgetown University. Probabilistic Machine Learning, AI Safety & AI Governance. Prev: Oxford, Yale, UC Berkeley, NYU. https://timrudner.com

Bram Grooten
@bramgrooten

PhD student at the TU Eindhoven. Researching deep learning, RL, sparse neural networks, robotics.

Tom Ringstrom
@noreward4u

Reward-Free Model-based Maximalist. High-dimensional Empowerment. Self-Preserving Autonomous Agents. Theories of intelligence grounded in compositional control.

Tom Dupuis
@tomdupuis

PhD student @ENSTAParis🇫🇷, TD Learning and deep RLing, representation matters MVA/CentraleSupélec alumni

Stone Tao
@stonet2000

PhDing @UCSanDiego @NVIDIA @hillbot_ai on scalable robot learning and embodied AI. Co-founded @LuxAIChallenge to build AI competitions. @NSF GRFP fellow http://stoneztao.com

Jorge Mendez-Mendez
@jmendezm

Embodied lifelong learning (compositionality, RL, TAMP, robotics). Assistant Professor at Stony Brook ECE. Postdoc at MIT CSAIL, PhD from GRASP lab at Penn. https://jorge-a-mendez.github.io

Raymond Chua
@raymondrchua

NeuroAI PhD Candidate at McGill / Mila. Loves: 🧠 🏕️ 🏔️ 🏊🏻‍♂️ 🚴🏻‍♂️ 🏃🏻‍♂️ 🎨📚☕ https://raymondchua.github.io