Replicable Reinforcement Learning with Linear Function Approximation
Replication of experimental results has been a challenge faced by many scientific disciplines, including the field of machine learning. Recent work on the theory of machine learning has formalized rep...
I think I posted about it before but never with a thread. We recently put a new preprint on arxiv.
π Replicable Reinforcement Learning with Linear Function Approximation
π arxiv.org/abs/2509.08660
In this paper, we study formal replicability in RL with linear function approximation. The... (1/6)
26.10.2025 14:16 β π 20 π 6 π¬ 2 π 1
Academic Productivity with GenAI: A Researcherβs Guide
My Everyday Use of GenAI as a Researcher
hi bluesky π Iβm starting a blog! First post on how I use GenAI in my workflow as an academic. give it a read + tell me what you think:
yeganeha.substack.com/p/academic-p... #GenAI #Academia
09.06.2025 15:45 β π 8 π 2 π¬ 0 π 0
Dhruv Rohatgi will be giving a lecture on our recent work on comp-stat tradeoffs in next-token prediction at the RL Theory virtual seminar series (rl-theory.bsky.social) tomorrow at 2pm EST! Should be a fun talk---come check it out!!
26.05.2025 19:19 β π 11 π 5 π¬ 1 π 0
Later today, Sikata and Marcel will talk about their recent work on oracle-efficient RL with ensembles. Join us!
20.05.2025 15:48 β π 6 π 4 π¬ 0 π 0
Last seminars before the summer break:
04/29: Max Simchowitz (CMU)
05/06: Jeongyeol Kwon (Univ. of Widsconsin-Madison)
05/20: Sikata Sengupta & Marcel Hussing (Univ. of Pennsylvania)
05/27: Dhruv Rohatgi (MIT)
06/03: David Janz (Univ. of Oxford)
06/10: Nneka Okolo (MIT)
16.04.2025 17:20 β π 14 π 5 π¬ 0 π 2
@mkearnsphilly.bsky.social is now on bsky as well!
12.12.2024 17:51 β π 4 π 0 π¬ 0 π 0
@mkearnsphilly.bsky.social
12.12.2024 17:47 β π 1 π 0 π¬ 0 π 0
If you are at #NeurIPS, we will be presenting this work (#6610) from 4:30-7:30PM today and would love to chat! @marcelhussing.bsky.social @optimistsinc.bsky.social @aaroth.bsky.social
12.12.2024 17:29 β π 10 π 4 π¬ 1 π 1
Thank you so much!
25.11.2024 22:03 β π 1 π 0 π¬ 1 π 0
Thank you so much for making this list! Could I also request @marcelhussing.bsky.social to be added by any chance?
25.11.2024 21:53 β π 3 π 0 π¬ 1 π 0
Thanks so much @antoine-mln.bsky.social!
25.11.2024 21:53 β π 1 π 0 π¬ 0 π 0
I made a starter pack for learning theory people to gather some people around the topic. There are too many names on here that I don't know so I only added a few I do. If you believe you should be on this list, let me know. I will add people with accurate profile descriptions.
go.bsky.app/21nFz12
10.11.2024 18:08 β π 52 π 19 π¬ 23 π 1
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Reinforcement learning (RL) in large or infinite state spaces is notoriously challenging, both theoretically (where worst-case sample and computational complexities must scale with state space cardina...
Actual content post: Have not talked much about this work yet but we have a paper on Oracle-Efficient Reinforcement Learning for Max Value Ensembles at this year's #NeurIPS. We provide an efficient algorithm to ensemble policies given a value function oracle. arxiv.org/abs/2405.16739
10.11.2024 17:51 β π 14 π 3 π¬ 1 π 2
π PhD student at the Max Planck Institute for Intelligent Systems
π¬ Safe and robust AI, algorithms and society
π https://andrefcruz.github.io
π researcher in π©πͺ, from π΅πΉ
Researcher on MDPs and RL. Retired prof. #orms #rl
Research group leader @ Max Planck Institute working on theory & social aspect of CS. Previous @UCSC@GoogleDeepMind @Stanford @PKU1898
https://yatongchen.github.io/
MIT postdoc, incoming UIUC CS prof
katedonahue.me
Senior Researcher at Microsoft Research | Human-AI Interaction | Building AutoGen at Microsoft
The Computer Science Department's mission has remained steadfast: to lead in computer science research and education that has real-world impact β to push the frontiers of the field and produce the next generations leaders.
Researching reasoning at OpenAI | Co-created Libratus/Pluribus superhuman poker AIs, CICERO Diplomacy AI, and OpenAI o-series / π
asst prof @cornellbowers.bsky.social thinking about dynamics, control, machine learning
sdean.website
Postdoc at Boston University with Aldo Pacchiano (PLAIA Lab plaia.ai). Interested in RL, Bandit problems and Adaptive Control.
Website: alessiorusso.net
Assistant Professor and Director of Critical ML@ WaterlooENG | Postdoc @ CS USCViterbi | PhD @ UMN | Representation Learning | AI for Manufacturing & Ops | AI for Health | Theory-guided ML for the Real-World |Views Personal |
https://sirisharambhatla.com/
πΏπ¦ | Fairness in AI | University of Oxford | Deep Learning Indaba | Internships: Google DeepMind || Microsoft Research
Personal Account
Founder: The Distributed AI Research Institute @dairinstitute.bsky.social.
Author: The View from Somewhere, a memoir & manifesto arguing for a technological future that serves our communities (to be published by One Signal / Atria
I lead Cohere For AI. Formerly Research
Google Brain. ML Efficiency, LLMs,
@trustworthy_ml.
PhD Student in TΓΌbingen (MPI-IS & Uni TΓΌ), interested in reinforcement learning. Freedom is a pure idea. https://onnoeberhard.com/
Charles W. Eliot Professor and President Emeritus at Harvard, former Secretary of the Treasury for President Clinton
πΉπΌ Cyber Ambassador, 1st Digital Minister (2016-2024) & π 1st π³οΈββ§οΈ cabinet minister.