Canadian researchers should be aware the there is a motion before the Parliamentary Standing Committee on Science and Research to force Tricouncils to hand over disaggregated peer review data on all applications:
Applicant names, profiles, demographics
Reviewers names, profiles, comments, and scores
30.10.2025 20:33 โ ๐ 125 ๐ 155 ๐ฌ 10 ๐ 46
As Transactions on Machine Learning Research (TMLR) grows in number of submissions, we are looking for more reviewers and action editors. Please sign up!
Only one paper to review at a time and <= 6 per year, reviewers report greater satisfaction than reviewing for conferences!
14.10.2025 13:32 โ ๐ 9 ๐ 12 ๐ฌ 1 ๐ 2
Happy Diwali!
21.10.2025 22:10 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
Apply to join one of the top AI research ๐ค institutes in the world in the wonderful city of Montrรฉal!
I will be recruiting 1-2 students this year. Just mention my name in your application.
15.10.2025 15:41 โ ๐ 9 ๐ 3 ๐ฌ 0 ๐ 0
A table showing that different discretization of the Riemannian Gradient Flow leads to different algorithm:
Geometry discretized, Objective discretized: Natural Gradient Descent
Geometry not discretized, Objective discretized: Mirror Descent
Geometry discretized, Objective not discretized: unknown (?)
Mirrorless Mirror Descent: A Natural Derivation of Mirror Descent (Gunasekar, Woodworth, Srebro at AISTATS, 2021)
They show that the Mirror Descent algorithm is a particular way of discretization a certain geometry-aware gradient flow.
proceedings.mlr.press/v130/gunasek...
Interesting paper!
11.10.2025 19:35 โ ๐ 7 ๐ 0 ๐ฌ 0 ๐ 0
a close up of a sad cat with the words pleeeaasse written below it
ALT: a close up of a sad cat with the words pleeeaasse written below it
cvoelcker.de/blog/2025/re...
I finally gave in and made a nice blog post about my most recent paper. This was a surprising amount of work, so please be nice and go read it!
02.10.2025 21:34 โ ๐ 29 ๐ 7 ๐ฌ 0 ๐ 3
They characterize the convergence behaviour using the "Committal Rate" of the algorithm, quantifying how aggresive the algorithm's update rule is.
01.10.2025 17:57 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Methods benefitting from the geometry (e.g., Natural PG) behave much better than standard PG (under softmax policy), if they have access to the exact policy gradient (or NPG). If the direction of improvement is estimated on-policy with noise, PG >> NPG.
01.10.2025 17:57 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Understanding the Effect of Stochasticity in Policy Optimization (NeurIPS 2021) by Jincheng Mei, Bo Dai, Chenjun Xiao, @skiandsolve.bsky.social, Dale Schuurmans.
Interesting paper on Policy Gradient (PG) methods!
PG >> NPG or PG << NPG?! It depends on your estimator.
arxiv.org/abs/2110.15572
01.10.2025 17:57 โ ๐ 5 ๐ 0 ๐ฌ 1 ๐ 0
The philosopher John Searle died recently.
He was one of the famous contemporary philosophers about whom I kept reading or hearing, mostly because of his Chinese Room argument.
Reading some comments by those who have actually met him, it seems that he was a character!
29.09.2025 20:31 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
Shanah tovah! ืฉื ื ืืืื
24.09.2025 01:57 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
Three years ago today, #MahsaAmini was murdered by the Islamic Republic. Her death started the hopeful Women, Life, Freedom movement in Iran and across the globe.
Mahsa, Nika, Sarina, and 100s of others are not among us anymore, but their influence has changed Iran forever.
16.09.2025 16:37 โ ๐ 7 ๐ 1 ๐ฌ 0 ๐ 0
1) The exponential growth of the field implies an exponential growth of recent papers, so even uniform sampling of papers means reading more of the recent ones.
2) With a limited time to read, there is a FOMO-type of incentive to focus on the new. The sampling distribution is not uniform.
12.09.2025 03:41 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0
The citation depth of ML is shallow: most people don't meaningfully cite a more than a few years old paper (yes, textbooks and obligatory classics aside).
Why? Many reasons, including that the authors probably haven't actually read the old papers.
Why? Two reasons:
12.09.2025 03:41 โ ๐ 5 ๐ 0 ๐ฌ 1 ๐ 0
But its output was inspirational!
08.09.2025 06:48 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Surprisingly, it didn't do a good job. Maybe it needed more hand-helding and guidance. My prompt was something like this:
This is my Research Proposal, these are the Evaluation Criteria, these are suggestions of what should go in each section, and this is my actual CV. Come up with something good.
08.09.2025 06:48 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
What is expected is not clear, there is no samples to see, there is a 5-page limit (if you write in French, you get an extra page!), but I suppose this invention helps everyone keep busy and the beareaucrats continue to have their jobs.
07.09.2025 21:04 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
I recently learned the existence of something called Descriptive CV. This is a type of CV that must adapt to the grant application, so whenever you write a research proposal, you have to write a new CV too. Wonderful!
07.09.2025 21:04 โ ๐ 5 ๐ 0 ๐ฌ 3 ๐ 0
Big news, everyone!
02.09.2025 15:25 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0
I don't know what exactly he was thinking of (minimax results? adaptivity? continual learning?), but I have been thinking of a version of this thought in the past 2-3 years, 50 something years after him!
01.09.2025 02:34 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
... I propose to study the synthesis of brain models by the parallel development of a series of matched (theoretical) environments and corresponding brain models that adapt to them."
01.09.2025 02:34 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
... Even the complex human brain first adapts to the simpler aspects of its environment and gradually builds up to the more complex features. ...
01.09.2025 02:34 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
This proposal by Claude Shannon is quite surprising to me:
"The matched environment brain model approach to automata. In general a machine or animal can only adapt to or operate in a limited class of environments. ...
01.09.2025 02:34 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
It all started with a proposal, 60 years ago!
"A Proposal for Dartmouth Summer Research Project on Artificial Intelligence".
jmc.stanford.edu/articles/dar...
01.09.2025 02:34 โ ๐ 5 ๐ 0 ๐ฌ 1 ๐ 0
Both can be useful. The top picture has 2+ nice properties:
1) It is modular. One can put other modules before or after that.
2) It emphasizes the compression aspect of AE.
You've already mentioned the nice properties of the bottom figure.
30.08.2025 20:41 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Freaking Famine!
The situation in Gaza will be the stain on our generation and all whose support led to this.
22.08.2025 21:32 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
The Prime Minister is primed to the Policy Search methods. The AI minister, however, is more into the Critique.
21.08.2025 03:15 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0
Doesn't this imply that one should not consider the comments of the rejecting committee as the reviewers will be different?
(Of course, some comments may make sense, but if they don't, ...).
The same for a paper.
13.08.2025 04:25 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Nephrologist, Associate Prof @UAlberta,
Social Media Editor - @pdi-journal.bsky.social #PeritonealDialysis Unit Director, #AskRenal #NephJC #NSMC, #AI, #MedEd, #NephSky #MedicalInformatics
๐ฎ๐ณโถ๏ธ๐จ๐ฆ
#LetsGoOilers
Senior Scientist at the Kite Research Institute | Toronto Rehab - University Health Network. Associate Professor, Affiliated Scientist in the Department of Computer Science, University of Toronto (cross appointed at the Institute of Biomedical Engineering)
Professor, author of book on Simulation-Based Optimization, #ReinforcementLearning #MDPs #ORMS www.simoptim.com
Researcher on MDPs and RL. Retired prof. #orms #rl
Research group leader @ Max Planck Institute working on theory & social aspect of CS. Previous @UCSC@GoogleDeepMind @Stanford @PKU1898
https://yatongchen.github.io/
The Multi-disciplinary Conference on Reinforcement Learning and Decision Making.
11-14 June 2025.
Trinity College Dublin.
https://rldm.org/
Brain and behavior. PhD candidate at NYU.
Here I post mostly at the intersection of science and science fiction.
Images are made by myself unless otherwise specified.
Thinking on dopamine heterogeneity
NeuroAI PhD student at @mcgillu.
Prev: @GeorgiaTech @FlatironCCN.
Dad. Author of the Cosmic Collisions books. Astrophysicist and Citizen Science fanatic @nasa. www.marckuchner.com
Multi-Agent RL, PhD Student @ TUWien
Interested in ML, comp bio, immunology, and just about anything one hop away from either.
Researching planning, reasoning, and RL in LLMs @ Reflection AI. Previously: Google DeepMind, UC Berkeley, MIT. I post about: AI ๐ค, flowers ๐ท, parenting ๐ถ, public transit ๐. She/her.
http://www.jesshamrick.com
RL & Agents Reading Group @ University of Edinburgh
We regularly discuss recent papers in RL, MARL & related
https://edinburgh-rl.github.io/reading-group
#RL Postdoc at Mila - Quebec AI Institute and Universitรฉ de Montrรฉal
Researcher in ML/NLP at the University of Edinburgh (faculty at Informatics and EdinburghNLP), Co-Founder/CTO at www.miniml.ai, ELLIS (@ELLIS.eu) Scholar, Generative AI Lab (GAIL, https://gail.ed.ac.uk/) Fellow -- www.neuralnoise.com, he/they
ML Engineer at NVIDIA. Previously: Stealth GPU startup; Stability AI; AMD; Autodesk; CEO of 2 startups (3D + AI). Toronto, Canada
Postdoc at the University of Manchester working on misinformation; Mum; #WomanLifeFreedom
https://sites.google.com/view/somayehtohidi/about
Science Fiction and Fantasy writer. As a hobby, I also write Criticism and Non-Fiction.
Associate professor @ University of Alberta and Canada CIFAR AI Chair @ Alberta Machine Intelligence Institute.
Games, machine learning, and creativity | he/him