Pedro Santos @pedrosantospps

Pedro Santos

@pedrosantospps.bsky.social

PhD student at @istecnico.bsky.social working on sequential decision-making and reinforcement learning. https://ppsantos.github.io/

17 Followers | 37 Following | 6 Posts | Joined: 15.01.2025 | 1.386

Latest posts by pedrosantospps.bsky.social on Bluesky

Here’s some photos of GAIPS member @pedrosantospps.bsky.social presenting his work on ICML 2025 in Vancouver and EWRL 2025 in Tübingen, Germany. His poster was selected as a "spotlight poster" (top 2.6% of the papers)! 🙌 Read his work here: icml.cc/virtual/2025...

03.10.2025 14:39 — 👍 1 🔁 1 💬 0 📌 0

Walking around posters at @icmlconf.bsky.social, I was happy to see some buzz around convex RL—a topic I’ve worked on and strongly believe in.

Thought I’d share a few ICML papers on this direction. Let’s dive in👇

But first… what is convex RL?

🧵

1/n

24.07.2025 13:09 — 👍 5 🔁 1 💬 1 📌 1

The paper can be found here: arxiv.org/pdf/2409.15128

03.05.2025 08:46 — 👍 0 🔁 0 💬 0 📌 0

We provide lower and upper bounds on the mismatch between the finite and infinite trials formulations for GUMDPs, as well as empirical results to support our claims, highlighting how the number of trajectories and the structure of the underlying GUMDP influence policy evaluation.

03.05.2025 08:34 — 👍 0 🔁 0 💬 1 📌 0

We show that the number of trials plays a key role in infinite-horizon GUMDPs, and the expected performance of a given policy depends, in general, on the number of trials.

03.05.2025 08:34 — 👍 0 🔁 0 💬 1 📌 0

We contribute the first analysis on the impact of the number of trials, i.e., the number of randomly sampled trajectories, in infinite-horizon GUMDPs (considering both discounted and average formulations).

03.05.2025 08:34 — 👍 0 🔁 0 💬 1 📌 0

The general-utility Markov decision processes (GUMDPs) framework generalizes the MDPs framework by considering objective functions that depend on the frequency of visitation of state-action pairs induced by a given policy.

03.05.2025 08:34 — 👍 1 🔁 0 💬 1 📌 0

Happy to share that our paper "The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes" got accepted as a spotlight poster at the International Conference on Machine Learning (ICML).

03.05.2025 08:34 — 👍 5 🔁 1 💬 2 📌 0

@pedrosantospps is following 20 prominent accounts

Fares Fourati
@faresfourati

PhD candidate at KAUST. Working on RL, bandits, and optimization. fouratifares.github.io/website/

RL & Agents Reading Group
@rl-agents-rg

RL & Agents Reading Group @ University of Edinburgh We regularly discuss recent papers in RL, MARL & related https://edinburgh-rl.github.io/reading-group

@aaai.org

@desariky

Chrysoula Zerva
@chryssazrv

Assistant Professor Instituto Superior Técnico (IST) and IT. Interested in understanding uncertainty in data, models, life. NLP, ML and climbing fan.

Regina de Brito Duarte
@reginabduarte

PhD Student in Human AI Interaction @INESC-ID, University of Lisbon. Curious about Politics and Society

Felipe Vecchietti
@lfelipesv

Postdoctoral Researcher at the Max Planck Institute for Security and Privacy. Interested in RL, Graph Neural Networks, AI for Science, and Multi-Agent Systems. 🇧🇷

Corentin Pla
@corentin-pla

PhD student @Criteo-INRIA-CREST | RL, Privacy, Learning augmented algorithms

Fernando P. Santos
@fernandopsantos

Associate Professor at UvA Amsterdam. Interested in multiagent systems, collective dynamics and fair/prosocial AI. JSMF fellow at Princeton, PhD at @istecnico

ELLIS
@ellis.eu

Strengthening Europe's Leadership in AI through Research Excellence | ellis.eu

EWRL18
@ewrl18

This is the official account of EWRL18 - European Workshop on Reinforcement Learning Official website: https://euro-workshop-on-reinforcement-learning.github.io/ewrl18/

GAIPS Lab
@gaipslab

Group of AI for People and Society, INESC-ID

IJCAIconf
@ijcai.org

IJCAI is the longest-running premier international AI research conference since 1969. 🌍 Connect across domains and feel the pulse of AI. 🗓️ #IJCAI2025 ❗16-22 August 2025❗Montreal 🇨🇦

André Martins
@andre-t-martins

NLP/ML researcher in Lisbon

ELLIS Unit Lisbon
@ellisunitlisbon

ELLIS Unit Lisbon for Learning and Intelligent Systems bringing together researchers in AI & ML across Instituto Superior Técnico, Instituto de Telecomunicações, INESC-ID & ISR-Lisboa. Part of ELLIS Europe. https://sites.google.com/view/ellis-unit-lisbon/

Mário Figueiredo
@marfig

Distinguished Professor of ECE and Feedzai Professor of machine learning @istecnico.bsky.social. Professor, photographer, music lover, curious about almost everything.

Lukas Schäfer
@lukaschaefer

www.lukaschaefer.com Researcher @msftresearch.bsky.social; working on autonomous agents in video games; PhD Univ of Edinburgh ; Ex Huawei Noah’s Ark Lab, Dematic; Young researcher HLF 2022

Miguel Faria
@miguel-at-faria

Postdoc at the Sardine Lab, Instuto Telecomunicações Lisbon working on multi-agent systems, machine teaching, reinforcement learning and NLP

Dylan Foster 🐢
@djfoster

Principal Researcher in AI/ML/RL Theory @ Microsoft Research NE/NYC. Previously @ MIT, Cornell. http://dylanfoster.net RL Theory Lecture Notes: https://arxiv.org/abs/2312.16730

TMLR Published Papers
@tmlr-pub

TMLR Homepage: https://jmlr.org/tmlr/ TMLR Infinite Conference: https://tmlr.infinite-conf.org/