Taylor W. Killian @twkillian

CoCoMARL 2025 Stimulated by the increasing complexity of real-world problems, multi-agent reinforcement learning (MARL) has emerged as a promising approach for enabling autonomous agents to learn and adapt in dynam...

There has been a multi-agent workshop the past two years!

Here's last year's workshop website: sites.google.com/view/cocomar...

13.02.2026 22:20 — 👍 0 🔁 0 💬 0 📌 0

The workshops are maybe the best part of RLC. Bring us your workshops that could never happen anywhere else!

13.02.2026 22:07 — 👍 14 🔁 1 💬 0 📌 0

3 weeks to get your RLC papers in shape! And to get the word out to those who have yet to experience an RLC review process.

13.02.2026 04:14 — 👍 10 🔁 2 💬 0 📌 0

🚀 Excited to share REPPO, a new on-policy RL agent!

TL;DR: Replace PPO with REPPO for fewer hyperparameter headaches and more robust training.

REPPO, led by @cvoelcker.bsky.social, will be presented at ICLR 2026. How does it work? 🧵👇

13.02.2026 19:28 — 👍 22 🔁 10 💬 1 📌 0

RLC 2026 Workshop Proposals Welcome to the OpenReview homepage for RLC 2026 Workshop Proposals

Please don't hesitate to reach out with questions to me directly or via email to workshops@rl-conference.cc

The OpenReview submission site can be found at:
openreview.net/group?id=rl-...

13.02.2026 21:50 — 👍 1 🔁 0 💬 0 📌 0

RLJ | RLC Call for Workshops

Workshops, held on the first day, are a primary feature of
@rl-conference.bsky.social and have set a delightfully inquisitive tone. Discussions started at both workshops I've been a part of in the past persisted through the week, with some still ongoing!

CfW here: rl-conference.cc/call_for_wor...

13.02.2026 21:50 — 👍 2 🔁 0 💬 1 📌 1

We're thrilled to share that the Call for Workshops for this year's @rl-conference.bsky.social is now live!

As Workshop co-chair (alongside the wonderful Raksha Kumaraswamy and @claireve.bsky.social) we are looking forward to seeing the proposals for workshops that we receive.

LINK IN NEXT POST

13.02.2026 21:50 — 👍 8 🔁 3 💬 1 📌 1

One paper accepted to ICML with one paper rejected as well. It’s called balance (and measured frustration) #icml2025

01.05.2025 14:00 — 👍 5 🔁 0 💬 0 📌 0

Shipping a copy of Kuhn's "Structure of Scientific Revolutions" to every ICML meta-reviewer

01.05.2025 12:59 — 👍 46 🔁 4 💬 2 📌 1

Great question! To be honest, no idea. Part of the research design is to identify the objectives we want to hit and these aren’t fully fleshed out just yet.

16.04.2025 18:38 — 👍 1 🔁 0 💬 0 📌 0

Multi-agent in terms of mixed autonomy type of environments (or training a LLM to gather information from other agents, human or artificial) is somewhat along the path of our future strategic directions for the US-SV office.

Large behavioral sims or specific applied focus areas are likely not.

16.04.2025 18:18 — 👍 1 🔁 0 💬 1 📌 0

MBZIFM

We're hiring at all levels for engineering and research roles with focus areas in CV, LLMs, and RL (broadly defined). We are firmly committed to open science and have extensive computational resources. Check us out, see you in Singapore! ifm.mbzuai.ac.ae

16.04.2025 17:40 — 👍 2 🔁 0 💬 1 📌 0

We're putting in final planning and preparation steps this week to be ready for #ICLR2025. I'm excited to be attending, in part to help represent @mbzuai.bsky.social and @llm360.bsky.social as well as recruit Research Scientists and Engineers to our new lab in the Bay Area! (see next for more info)

16.04.2025 17:40 — 👍 2 🔁 0 💬 1 📌 0

Thank you for the kind consideration. I am however no longer in Canada (left during the COVID pandemic to follow my advisor to MIT, and now find myself in the SF Bay Area and missing Massachusetts and Ontario a whole lot...)

27.03.2025 21:31 — 👍 4 🔁 0 💬 0 📌 0

I love this idea. Really and truly. Let's do this @icmlconf.bsky.social! (Maybe also something @rl-conference.bsky.social could look into?)

27.03.2025 02:33 — 👍 0 🔁 0 💬 0 📌 0

Wow! Congratulations to both BU and you!

27.03.2025 02:30 — 👍 1 🔁 0 💬 0 📌 0

I used to dream of days like this.

Mowed the lawn, trimmed some hedges, pruned a few trees to let more sun into our yard, picked the rest of our orange tree (we yielded 658 🍊this year!) had the kids help clean up the branches, etc. #suburbandadsaturdays

22.03.2025 20:33 — 👍 4 🔁 0 💬 0 📌 0

Real-world RL for the public good! Love to see it!

12.03.2025 04:20 — 👍 14 🔁 2 💬 0 📌 0

Real-World Deployment and Assessment of a Multi-Agent Reinforcement Learning-Based Variable Speed Limit Control System This article presents the first field deployment of a multi-agent reinforcement learning (MARL) based variable speed limit (VSL) control system on Interstate 24 (I-24) near Nashville, Tennessee. We de...

A year later of a multi-agent RL controlled variable speed limit system:
arxiv.org/abs/2503.01017
Fewer crashes, quicker responses, quicker warnings

12.03.2025 02:14 — 👍 39 🔁 4 💬 2 📌 1

Reviewing a paper right now and I'm having lots of "I wish that I'd written this!" feelings. Great sign for the authors tbh.

12.03.2025 04:18 — 👍 9 🔁 0 💬 1 📌 0

Was about to ask if this was how you were doing your ICML reviewing ;)

12.03.2025 04:17 — 👍 0 🔁 0 💬 0 📌 0

Congratulations Andy and Rich! You've given RL yet another place in the history books. I can't imagine how you're feeling right now, but it must be amazing to reflect on how far the ideas have come, especially after all the care and dedication you gave them.

awards.acm.org/about/2024-t...

05.03.2025 21:20 — 👍 9 🔁 1 💬 0 📌 0

Think of LLM Applications as POMDPs — Not Agents · TensorZero Think of LLM Applications as POMDPs — Not Agents

As I've started down the RL+LLM rabbit hole (for reals this time), this blog post by TensorZero is absolutely compelling. It's nice to see clear thinking conceptual framing that at once open the doors to new research but also leverages existing insights:
www.tensorzero.com/blog/think-o...

04.03.2025 22:29 — 👍 20 🔁 4 💬 2 📌 0

a man with a beard is saying he can 't keep getting away with it ! ALT: a man with a beard is saying he can 't keep getting away with it !

Struggled to find a paper to recommend to authors that would strengthen their claims while writing a review. Used @GeminiApp and @youdotcom to no success. Reluctantly pulled up @ChatGPTapp, nailed it in the first returned result. #HesitantAboutLLMsButTryingToLearn

03.03.2025 05:19 — 👍 0 🔁 0 💬 0 📌 0

"Scaling laws" tries to piggyback on physics where the scaling laws are resultant from statistical limits. To call them "laws" rather than "empirical scaling curves" is real iffy

28.02.2025 18:41 — 👍 40 🔁 1 💬 2 📌 0

I’ve got a lot of thoughts about the current market as an ML researcher, not sure I’ll share them in total.

But… it was more difficult than anticipated and I felt hopeless a lot of the time. If you’re feeling similarly; know you’re not worthless, it just may take more time. 🤗

24.02.2025 23:22 — 👍 2 🔁 0 💬 0 📌 0

Some additional information (and job postings) can be found here: mbzuai.ac.ae/institute-of-f…

Let me know if you want to learn more about IFM, why I chose this opportunity, or chat about open access research.

24.02.2025 23:22 — 👍 1 🔁 0 💬 1 📌 0

Day 1 with @mbzuai.bsky.social as we start building out a new non-profit, open research lab in the Bay Area. Lots to learn and my head is spinning but I’m excited to get started pushing the boundaries of what we know about reasoning and decision making under uncertainty.

24.02.2025 23:22 — 👍 3 🔁 0 💬 1 📌 0

I’m so glad that you found this paper. Lots of really fun implications and avenues to explore, both in applied and foundational directions.

08.02.2025 00:55 — 👍 3 🔁 0 💬 0 📌 0

Oh man, i wish that I had a concrete answer here. I do recall discussions about evaluations over all aspects of nuPlan but don’t remember what results we had.

07.02.2025 15:05 — 👍 4 🔁 0 💬 0 📌 0

Taylor W. Killian

Latest posts by twkillian.bsky.social on Bluesky

@twkillian is following 20 prominent accounts