Regardless of whether HRM lives up to the hype, itโs a good stress test for alignment / AI security strategies.
Does your approach accommodate the sudden appearance of a new paradigm that outperforms LLMs? If not, it might be too brittle.
03.08.2025 19:16 โ ๐ 48 ๐ 7 ๐ฌ 5 ๐ 0
We have open-sourced anonymized data and core analysis code for our developer productivity RCT.
The paper is also live on arXiv, with two new sections: One discussing alternative uncertainty estimation methods, and a new 'bias from developer recruitment' factor that has unclear effect on slowdown.
30.07.2025 20:10 โ ๐ 28 ๐ 8 ๐ฌ 1 ๐ 0
Poster โSolving Probabilistic Verification Problems of Neural Networks using Branch and Boundโ
Iโll be presenting my poster on solving probabilistic verification problems of NNs at ICML today! Meet me at poster E-2302 :)
17.07.2025 17:01 โ ๐ 3 ๐ 1 ๐ฌ 0 ๐ 0
Personally, I would rather use โtrueโ as a stand-in for fidelity when talking about explanations
12.06.2025 12:05 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
But I think thatโs also what Occamโs razor is used for. You have two explanations but one is longer. Which is the โtrueโ one? => Occamโs razor. But Iโd say โbetterโ is the better word than โtrueโ here.
12.06.2025 12:04 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
I would assumed that you have to make explanations identifiable using something akin to Occamโs razor
11.06.2025 20:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Wow
08.06.2025 18:18 โ ๐ 112 ๐ 10 ๐ฌ 7 ๐ 2
Thanks @icmlconf.bsky.social for gifting me the conference registration
29.05.2025 08:06 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Just got notified that I was selected as an ICML top reviewer! I really enjoy reviewing and Iโm happy to know that Iโm apparently also good at it ๐
29.05.2025 08:04 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
To be presented at ICML 2025
arxiv.org/pdf/2405.17556
02.05.2025 16:57 โ ๐ 2 ๐ 1 ๐ฌ 0 ๐ 0
|| assistant prof at University of Montreal || leading the systems neuroscience and AI lab (SNAIL: https://www.snailab.ca/) ๐ || associate academic member of Mila (Quebec AI Institute) || #NeuroAI || vision and learning in brains and machines
Director of Research, DAIR
Roller derby athlete
https://alex-hanna.com
Book: thecon.ai
Pod+newsletter: https://dair-institute.org/maiht3k
๐ช๐ฌ๐ณ๏ธโโง๏ธ She/ูู
๐ธ @willtoft.bsky.social
Rep๐ @ianbonaparte.bsky.social
I do SciML + open source!
๐งช ML+proteins @ http://Cradle.bio
๐ Neural ODEs: http://arxiv.org/abs/2202.02435
๐ค JAX ecosystem: http://github.com/patrick-kidger
๐งโ๐ป Prev. Google, Oxford
๐ Zรผrich, Switzerland
METR is a research nonprofit that builds evaluations to empirically test AI systems for capabilities that could threaten catastrophic harm to society.
AI (security, privacy, HPC) at UoM, researcher who rides bikes to go nowhere, and lifts heavy things for fun. Surprisingly interested in Iranian brickwork (just weird like that).
Working on creativity, curiosity and interestingness. PhD @ IDSIA with Jรผrgen Schmidhuber in Lugano, Switzerland. Classical pianist.
https://vincentherrmann.github.io
Assistant Professor at the Department of Computer Science, University of Liverpool.
https://lutzoe.github.io/
A next-generation code editor that enables high-performance collaboration with AI and your team.
zed.dev
milkshake, boys, yards, schreiben รผber Patriarchat (erst der Dinge, jetzt der Mythen)
1/2 von @femshelfcontrol.bsky.social
Foto: Andrew Collberg
interests: software, neuroscience, causality, philosophy | ex: salk institute, u of washington, MIT | djbutler.github.io
ML, ฮป โข language and the machines that understand it โข https://ocramz.github.io
@Penn Prof, deep learning, brains, #causality, rigor, http://neuromatch.io, Transdisciplinary optimist, Dad, Loves outdoors, ๐ฆ , c4r.io
Trustworthy AI, Counterfactual Explanations, Open-Source Software and other PhD things at TU Delft.
@julialang.org developer @taija.org
Website: www.patalt.org
A figment of the universe's imagination. Culture ambassador. http://elsewhereunbound.com
Dog lover, book lover and cannabis aficionado. MA native.
Deputy Editor, Our World in Data
Senior Researcher, University of Oxford
Climate, energy, environment, all things data.
Random initial conditions: M between 0.1 & 150 solar masses | Speed less than 10 km/s | Positions in a 50x50 AU box | code and more info at https://github.com/kirklong/ThreeBodyBot
data justice, tech policy, cities & nighttime governance | asst prof of data science / writer | theyโขthem ๐ณ๏ธโโง๏ธ | ๐ง๐ท๐จ๐ฆ๐บ๐ธ
Postdoc at ETH. Formerly, PhD student at the University of Cambridge :)