Really glad to have been a part of this super cool project... LLMs can verbalize more than just a single confidence number, and we can evaluate their ability to do so!
02.10.2025 19:39 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
Many treat uncertainty = a number. At Apple, we're rethinking this: LLMs should output strings that reveal all information of their internal distributions. We find that Reasoning, SFT, CoT can't do it - yet. To get there, we introduce the SelfReflect benchmark.
arxiv.org/pdf/2505.20295
01.10.2025 09:53 โ ๐ 30 ๐ 6 ๐ฌ 3 ๐ 1
https://www.interspeech2025.org/tutorials
Your cookies are disabled, please enable them.
Now that @interspeech.bsky.social registration is open, time for some shameless promo!
Sign-up and join our Interspeech tutorial: Speech Technology Meets Early Language Acquisition: How Interdisciplinary Efforts Benefit Both Fields. ๐ฃ๏ธ๐ถ
www.interspeech2025.org/tutorials
โฌ๏ธ (1/2)
27.05.2025 16:14 โ ๐ 9 ๐ 5 ๐ฌ 1 ๐ 1
Probabilistic ML researcher at Google Deepmind
Senior Staff Research Scientist @Google DeepMind, previously Stats Prof @Oxford Uni - interested in Computational Statistics, Generative Modeling, Monte Carlo methods, Optimal Transport.
full-time ML theory nerd, part-time AI-non enthusiast
natural language processing and computational linguistics at google deepmind.
Associate Professor (UHD) at the University of Amsterdam. Probabilistic methods, deep learning, and their applications in science in engineering.
Research Scientist @ ๏ฃฟ | Previously @ Toyota Research Institute and Google | PhD from Georgia Tech.
Research Scientist at Apple for uncertainty quantification.
Professor of Statistics and Machine Learning at UCL Statistical Science. Interested in computational statistics, machine learning and applications in the sciences & engineering.
He teaches information science at Cornell. http://mimno.infosci.cornell.edu
Professor of Statistics @ ESSEC Business School Asia-Pacific campus Singapore ๐ธ๐ฌ
https://pierrealquier.github.io/
Previously: RIKEN AIP ๐ฏ๐ต ENSAE Paris ๐ซ๐ท ๐ช๐บ UCD Dublin ๐ฎ๐ช ๐ช๐บ
Random posts about stats/maths/ML/AI, poor jokes & birds photo ๐
Research fellow @OxfordStats @OxCSML, spent time at FAIR and MSR
Former quant ๐ (@GoldmanSachs), former former gymnast ๐คธโโ๏ธ
My opinions are my own
๐ง๐ฌ-๐ฌ๐ง sh/ssh
Cofounder & CTO @ Abridge, Raj Reddy Associate Prof of ML @ CMU, occasional writer, relapsing ๐ท, creator of d2l.ai & approximatelycorrect.com
Professor, Santa Fe Institute. Research on AI, cognitive science, and complex systems.
Website: https://melaniemitchell.me
Substack: https://aiguide.substack.com/
So far I have not found the science, but the numbers keep on circling me.
Views my own, unfortunately.
DeepMind Professor of AI @Oxford
Scientific Director @Aithyra
Chief Scientist @VantAI
ML Lead @ProjectCETI
geometric deep learning, graph neural networks, generative models, molecular design, proteins, bio AI, ๐ ๐ถ
Research Director, Founding Faculty, Canada CIFAR AI Chair @VectorInst.
Full Prof @UofT - Statistics and Computer Sci. (x-appt) danroy.org
I study assumption-free prediction and decision making under uncertainty, with inference emerging from optimality.
Machine learning prof at U Toronto. Working on evals and AGI governance.
Machine learning, environmental modeling, sustainability, robotics
Professor @UCL
He/him
Professor of HCII and LTI at Carnegie Mellon School of Computer Science.
jeffreybigham.com