OSF
It'ssssss PREPRINT THURSDAY! (Sadly not a thing). Approximately an era ago, I spent a *lot* of time discussing the assumptions that underpin the field of #ComputationalPskychiatry with the brilliant @jonroiser.bsky.social and @olijrobinson.bsky.social osf.io/preprints/ps...
10.07.2025 11:07 โ ๐ 52 ๐ 13 ๐ฌ 1 ๐ 1
I think deep linear networks are an example of this, where you have a deep model with just the capacity of regular linear regression.
01.06.2025 09:45 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Bullshitting Engines?
25.05.2025 07:57 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
To be fair, if you're doing analytic philosophy, "bullshit engine" reads as an engine that is bullshit, not an engine that engages in the communicative act of bullshitting, because engines don't communicate or have concerns, they set things in motion.
25.05.2025 07:56 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Today, weโre announcing the preview release of ty, an extremely fast type checker and language server for Python, written in Rust.
In early testing, it's 10x, 50x, even 100x faster than existing type checkers. (We've seen >600x speed-ups over Mypy in some real-world projects.)
13.05.2025 17:00 โ ๐ 333 ๐ 84 ๐ฌ 14 ๐ 14
Demand characteristics - Wikipedia
It's in the opposite direction though, once your participants know how you want them to behave, they're pretty likely to behave that way.
en.m.wikipedia.org/wiki/Demand_...
18.04.2025 18:37 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0
I think it's a choose-your-own-null adventure kind of thing, like you're worried it might be. I can see the (bad) argument, if you allow lots of parameters to vary either side of the dotted line, any differences prove that the line is important?
06.04.2025 19:42 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Um.
statmodeling.stat.columbia.edu/2021/11/21/s...
06.04.2025 19:13 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
I'm on the job market, and like many, I hate LinkedIn. So I've done what any product-oriented data scientist would: built my own workflow. With browser automation/scraping, data pipelines, structured LLM outputs, and a fun way of using @notion.com as a UI.
eointravers.com/blog/job-scr...
24.03.2025 14:47 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Data-for-good rapid response team
I'm collecting people with data science skills who may want to hear about sporadic opportunities to use their skills for good on short projects. Project themes will center on topics I am connected wit...
I'm organizing a "Data for Good Rapid Response Team", i.e. a set of people who can be mobilized to work on short data science projects to help various groups and organizations. Sign up here if this is of interest to you, and please share with others with data science skills.
23.03.2025 17:21 โ ๐ 96 ๐ 53 ๐ฌ 3 ๐ 2
For one-in-three, itโs ~70.4%. One-in-four, ~68.4%. As n increases, the answer gets closer and closer to
1โ1/e โ 63.2%, where e is Eulerโs number.
en.wikipedia.org/wiki/E_(math...
Why 1-1/eโ? Honestly, you would have to ask someone better at maths than me, but I think itโs a pretty cool result.
20.03.2025 11:50 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
So the prob. that it does happen at least once is the probability that it *doesnโt not happen*,
1 - (1 - 1/n)^n
For a one-in-two chance, this works out as
1 - (1 - 1/2)^2 = 1 - 1/4 = 75%
20.03.2025 11:50 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
The prob. of trying twice and it not happening is the prob. of it not happening the first time, times the prob. of it not happening the second time:
(1 - 1/n) * (1 - 1/n), or (1 - 1/n)^2
The prob. of it not happening in n attempts is
(1 - 1/n)^n
20.03.2025 11:50 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
If you take a one-in-n chance, the probability of it coming off is 1/nโ. If you roll a six-side die, the probability of rolling a 6 is 1/6โ.
The prob. of the event not occurring is one minus the probability that it does occur: 1 - 1/n
20.03.2025 11:50 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
But first, @xkcd.com
xkcd.com/882/
20.03.2025 11:50 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Taking One-In-N Chances
...
Stats time.
If you take a one-in-n chance n times (e.g. taking 10 one-in-10 chances), whatโs the probability that at least one of them will come off?
Somewhat satisfyingly, the answer, regardless of what n is, turns out to be โaround 63%โ. Hereโs why.
(Also at eointravers.com/blog/one-in-n/)
20.03.2025 11:50 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Poker is probably an interesting case study here, because AFAIK expert poker players don't try to solve K-level theory of mind problems, they just have really good heuristics.
19.03.2025 23:43 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
In principle, that might mean we can get LLMs to reason under uncertainty pretty well if we fine-tune on the right heuristics?
19.03.2025 23:41 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
There's an old idea in psychology (e.g. core.ac.uk/download/pdf...) that when people do perform well under uncertainty, it's because they're pattern matching using the right heuristics, rather than doing Bayesian inference.
19.03.2025 23:41 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
To be fair, humans are famously also pretty bad at it, so this one might be a draw.
19.03.2025 19:13 โ ๐ 6 ๐ 0 ๐ฌ 1 ๐ 0
Humble-brag:
Over the past two years at Unmind, I got to build two AI features: Nova AI wellbeing coach, and AI practitioner matching. It's lovely to see both get a call out in Fast Company's list of the most innovative companies in the workplace for 2025.
www.fastcompany.com/91270254/wor...
19.03.2025 17:19 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Comic strip from the linked page.
Yes, but also: xkcd.com/927/
19.03.2025 11:33 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
*Please repost* @sjgreenwood.bsky.social and I just launched a new personalized feed (*please pin*) that we hope will become a "must use" for #academicsky. The feed shows posts about papers filtered by *your* follower network. It's become my default Bluesky experience bsky.app/profile/pape...
10.03.2025 18:14 โ ๐ 522 ๐ 296 ๐ฌ 23 ๐ 83
a statue of a hippopotamus with its mouth open and teeth showing .
Alt: A hippo being tossed a watermelon, which it crushes in its massive jaws.
I have been doing entirely too much earnest posting about deep things recently, I need to do a proper thread about hippo testicles or something just to keep myself sane.
Oh by the way hippos have migratory testicles.
12.03.2025 05:33 โ ๐ 3749 ๐ 1236 ๐ฌ 148 ๐ 1039
and, as an ilustration, uses an LLM to quickly check the job spec against my requirements. A browser extension is a more mature way of achieving the same thing, but would take me considerably more time to get up and running.
10.03.2025 16:47 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Automation frameworks for testing like Selenium (selenium-python.readthedocs.io/index.html) are underrated as a way of quickly prototyping new AI-driven UXs. For example, this script, (<1hr work with Cursor) will open a dedicated browswer that saves every job ad I open to a database [...]
10.03.2025 16:47 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
R, Snowflake, and MacOS: A match made in Hell. ๐
07.03.2025 09:57 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
The ultimate human data platform to power world-changing AI and research.
๐ www.prolific.com
AI, Engineer, Developer, Traveler, Technologist, @fastdotai Fellow
Machine Learning researcher at Google Health, working on human centered medical AI. Trained in neuroscience, untrained but enthusiastic cyclist
Co-Founder & CEO, Sakana AI ๐ โ @sakanaai.bsky.social
https://sakana.ai/careers
I work on AI at OpenAI.
Former VP AI and Distinguished Scientist at Microsoft.
Reverse engineering neural networks at Anthropic. Previously Distill, OpenAI, Google Brain.Personal account.
Research scientist at Anthropic.
PhD in machine learning from the University of Toronto and Vector Institute.
Prev: NVIDIA, Google
Researching reasoning at OpenAI | Co-created Libratus/Pluribus superhuman poker AIs, CICERO Diplomacy AI, and OpenAI o-series / ๐
๐ฌResearch Scientist, Meta AI (FAIR).
๐PhD from McGill University + Mila
๐โโ๏ธI study Multimodal LLMs, Vision-Language Alignment, LLM Interpretability & Iโm passionate about ML Reproducibility (@reproml.org)
๐https://koustuvsinha.com/
AGI safety researcher at Google DeepMind, leading causalincentives.com
Personal website: tomeveritt.se
Engineering at OpenAI. Formerly working on Fuschia at Google
research scientist @deepmind. language & multi-agent rl & interpretability. phd @BrownUniversity '22 under ellie pavlick (she/her)
https://roma-patel.github.io
Research Scientist at GDM. Statistician. Mostly work on Responsible AI. Academia-industry flip-flopper.
Staff Research Scientist at Google DeepMind. Artificial and biological brains ๐ค ๐ง
Sr. Principal Research Manager at Microsoft Research, NYC // Machine Learning, Responsible AI, Transparency, Intelligibility, Human-AI Interaction // WiML Co-founder // Former NeurIPS & current FAccT Program Co-chair // Brooklyn, NY // http://jennwv.com
Ethical/Responsible AI. Rigor in AI. Opinions my own. Principal Researcher @ Microsoft Research. Grumpy eastern european in north america. Lovingly nitpicky.
Gemini Post-Training @ Google DeepMind
Previously:ย ETH Zurich, Cambridge, CERN
alizeepace.com
PhD Candidate, University of Washington
https://salonidash.com/
Using qualitative & computational methods to study writers on the internet. I study how machine learning experts communicate. I study the interaction between writers & audiences. Professor @ University of Illinois, Urbana Champaign