Here’s some photos of GAIPS member @pedrosantospps.bsky.social presenting his work on ICML 2025 in Vancouver and EWRL 2025 in Tübingen, Germany. His poster was selected as a "spotlight poster" (top 2.6% of the papers)! 🙌 Read his work here: icml.cc/virtual/2025...
03.10.2025 14:39 — 👍 1 🔁 1 💬 0 📌 0
Walking around posters at @icmlconf.bsky.social, I was happy to see some buzz around convex RL—a topic I’ve worked on and strongly believe in.
Thought I’d share a few ICML papers on this direction. Let’s dive in👇
But first… what is convex RL?
🧵
1/n
24.07.2025 13:09 — 👍 5 🔁 1 💬 1 📌 1
The paper can be found here: arxiv.org/pdf/2409.15128
03.05.2025 08:46 — 👍 0 🔁 0 💬 0 📌 0
We provide lower and upper bounds on the mismatch between the finite and infinite trials formulations for GUMDPs, as well as empirical results to support our claims, highlighting how the number of trajectories and the structure of the underlying GUMDP influence policy evaluation.
03.05.2025 08:34 — 👍 0 🔁 0 💬 1 📌 0
We show that the number of trials plays a key role in infinite-horizon GUMDPs, and the expected performance of a given policy depends, in general, on the number of trials.
03.05.2025 08:34 — 👍 0 🔁 0 💬 1 📌 0
We contribute the first analysis on the impact of the number of trials, i.e., the number of randomly sampled trajectories, in infinite-horizon GUMDPs (considering both discounted and average formulations).
03.05.2025 08:34 — 👍 0 🔁 0 💬 1 📌 0
The general-utility Markov decision processes (GUMDPs) framework generalizes the MDPs framework by considering objective functions that depend on the frequency of visitation of state-action pairs induced by a given policy.
03.05.2025 08:34 — 👍 1 🔁 0 💬 1 📌 0
Happy to share that our paper "The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes" got accepted as a spotlight poster at the International Conference on Machine Learning (ICML).
03.05.2025 08:34 — 👍 5 🔁 1 💬 2 📌 0
PhD candidate at KAUST. Working on RL, bandits, and optimization.
fouratifares.github.io/website/
RL & Agents Reading Group @ University of Edinburgh
We regularly discuss recent papers in RL, MARL & related
https://edinburgh-rl.github.io/reading-group
Assistant Professor Instituto Superior Técnico (IST) and IT.
Interested in understanding uncertainty in data, models, life. NLP, ML and climbing fan.
PhD Student in Human AI Interaction @INESC-ID, University of Lisbon.
Curious about Politics and Society
Postdoctoral Researcher at the Max Planck Institute for Security and Privacy. Interested in RL, Graph Neural Networks, AI for Science, and Multi-Agent Systems. 🇧🇷
PhD student @Criteo-INRIA-CREST | RL, Privacy, Learning augmented algorithms
Associate Professor at UvA Amsterdam. Interested in multiagent systems, collective dynamics and fair/prosocial AI. JSMF fellow at Princeton, PhD at @istecnico
Strengthening Europe's Leadership in AI through Research Excellence | ellis.eu
This is the official account of EWRL18 - European Workshop on Reinforcement Learning
Official website: https://euro-workshop-on-reinforcement-learning.github.io/ewrl18/
Group of AI for People and Society, INESC-ID
IJCAI is the longest-running premier international AI research conference since 1969. 🌍 Connect across domains and feel the pulse of AI. 🗓️ #IJCAI2025 ❗16-22 August 2025❗Montreal 🇨🇦
NLP/ML researcher in Lisbon
ELLIS Unit Lisbon for Learning and Intelligent Systems bringing together researchers in AI & ML across Instituto Superior Técnico, Instituto de Telecomunicações, INESC-ID & ISR-Lisboa. Part of ELLIS Europe.
https://sites.google.com/view/ellis-unit-lisbon/
Distinguished Professor of ECE and Feedzai Professor of machine learning @istecnico.bsky.social.
Professor, photographer, music lover, curious about almost everything.
www.lukaschaefer.com
Researcher @msftresearch.bsky.social; working on autonomous agents in video games; PhD Univ of Edinburgh ; Ex Huawei Noah’s Ark Lab, Dematic; Young researcher HLF 2022
Postdoc at the Sardine Lab, Instuto Telecomunicações Lisbon
working on multi-agent systems, machine teaching, reinforcement learning and NLP
Principal Researcher in AI/ML/RL Theory @ Microsoft Research NE/NYC. Previously @ MIT, Cornell. http://dylanfoster.net
RL Theory Lecture Notes: https://arxiv.org/abs/2312.16730
TMLR Homepage: https://jmlr.org/tmlr/
TMLR Infinite Conference: https://tmlr.infinite-conf.org/