AI models that lie, cheat and plot murder: how dangerous are LLMs really?
Tests of large language models reveal that they can behave in deceptive and potentially harmful ways. What does this mean for the future?
I'm pretty sceptical about 'AI scheming', as it's so easy to anthropomorphise & experiments often involve telling the AI to do the bad thing they end up doing.
To understand what's behind the hype, read this smart & sober overview from@silverjacket.bsky.social π§ͺπ€
www.nature.com/articles/d41...
09.10.2025 10:59 β π 9 π 1 π¬ 0 π 0
Over at @naturepodcast.bsky.social we also made a super short vid on this year's physics Nobel Prize. Quantum phenomena at the macroscopic scale in two minutes... go! With me, and Ben Thompson & camera work by the fab @emzywb.bsky.social π§ͺβοΈ
www.youtube.com/shorts/krita...
08.10.2025 11:16 β π 34 π 12 π¬ 0 π 0
Groundbreaking quantum-tunnelling experiments win physics Nobel
John Clarke, Michel Devoret and John Martinis discovered quantum physics on a macroscopic scale, paving the way for quantum computing.
John Martinisβ wife didnβt wake him in the middle of the night (California time) to tell him he had won a Nobel. βI got up a little bit before 6. Then I opened my computer and saw John and Michelβs and my pictures."
Story by @lizziegibney.bsky.social and me
www.nature.com/articles/d41...
07.10.2025 16:30 β π 10 π 5 π¬ 0 π 1
Is there a reason we picked this slightly terrifying still? π€£
02.10.2025 09:18 β π 0 π 0 π¬ 1 π 0
HSBC claims quantum trading breakthrough
Europeβs largest lender tested a tool developed by IBM on bond market data
I'm sure there's valuable science in there but experts I consulted were not super convinced by the paper (why does it work better with noise? Why didn't they compare to the best available classical algo?) We won't cover it. Alas others already have&without outside comment www.ft.com/content/d9d4...
26.09.2025 11:35 β π 10 π 1 π¬ 2 π 0
HSBC unleashes yet another βqombieβ: a zombie claim of quantum advantage that isnβt
Today, I got email after email asking me to comment on a new paper from HSBCβyes, the bankβtogether with IBM. The paper claims to use a quantum computer to get a 34% advantage in predicβ¦
I was going to post a rant about the perils of quantum computing hype, following the claim by #HSBC & #IBM that they have improved bond market predictions using a QC, but instead I will just link to Scott Aaronson's blog which says it all scottaaronson.blog?p=9170 π§ͺβοΈ cc @bullshitquantum.bsky.social
26.09.2025 11:35 β π 31 π 7 π¬ 3 π 1
Also read @helenpearson.bsky.social & @heidiledford.bsky.social's excellent story unpicking the origins of Trump's paracetamol/autism claims www.nature.com/articles/d41...
This quote sums it up: "We do not think that taking acetaminophen is in any way contributing to actually causing autism"
23.09.2025 11:27 β π 4 π 1 π¬ 0 π 0
Bring us your LLMs: why peer review is good for AI models
Deepseekβs R1 model has been peer reviewed. Others should follow the firmβs example.
And here's Nature's take on why more AI developers should follow suit and put their LLMs through the peer review wringer. The process is far from perfect, but it seems a valuable counterbalance against AI hype and good for clarity & safety www.nature.com/articles/d41...
17.09.2025 19:10 β π 5 π 0 π¬ 0 π 0
Secrets of DeepSeek AI model revealed in landmark paper
First peer reviewed study shows how a Chinese start-up firm made the market-shaking LLM for US$300,000.
Remember DeepSeek's R1 model that crashed the US stock market in Jan? DeepSeek has said it did not boost the model by training on OpenAI outputs. This and much more (eg $$ to train & technical details) revealed in the firm's peer reviewed paper out in Nature today π§ͺπ€ www.nature.com/articles/d41...
17.09.2025 19:10 β π 17 π 4 π¬ 1 π 0
The top line is we're never going to get rid of hallucinations as it's just the way LLMs are built: they're not understanding, they're guessing based on stats. But maybe LLMs can be better fine-tuned to sound less confident, so humans aren't so taken in by them & use them more appropriately?
11.09.2025 09:11 β π 24 π 4 π¬ 6 π 0
Can researchers stop AI making up citations?
OpenAIβs GPT-5 hallucinates less than previous models do, but cutting hallucination completely might prove impossible.
ICYMI, this week I wrote about GPT-5 and the effort to stop LLMs from hallucinating, as well as saying they've done tasks they haven't. With web search to verify facts, finding citations is getting a lot better. But as soon as it's off, things can still go very wonkyπ§ͺπ€ www.nature.com/articles/d41...
11.09.2025 09:11 β π 12 π 6 π¬ 3 π 0
OpenAI launches reasoning LLM that you can download and tweak
One version of the gpt-oss large language model can run on a laptop, and performs nearly as well as the companyβs most powerful models.
OpenAI's new open-weight #AI model looks to be a powerful reasoner. It's small enough to use locally and they've done loads to make it available.
But has it entered the game too late to become the go-to for researchers? Do ping me if you're trying it out!
www.nature.com/articles/d41... π€π§ͺ
07.08.2025 10:55 β π 7 π 4 π¬ 2 π 0
Global Quantum Needs Assessment Survey β Diversity In Quantum
Hey quantum fans. Here's a slightly different #quantum survey from my mind-boggling one on interpretations, but seems important! This group hopes to understand how to help quantum scientists to stay in the field & thrive. If you have time, you can fill it out hereπ www.diviq.org/survey π§ͺβοΈ
05.08.2025 15:29 β π 2 π 6 π¬ 0 π 0
Physicists disagree wildly on what quantum mechanics says about reality, Nature survey shows
First major attempt to chart researchersβ views finds interpretations in conflict.
Oh & THERE'S A QUIZ! In the vein of "what Spice Girl are you?" from a 90s issue of Bliss, just answer a few questions (in the story www.nature.com/articles/d41...) & we will tell you which quantum interpretation you are (& do let me know which one you think correlates with which spice girl π)
30.07.2025 14:19 β π 12 π 3 π¬ 1 π 1
Please read, share and disagree with my write up in the spirit of Bohr, Einstein and the best in quantum physics πAll the data is also available to download, if you'd like to analyse it yourself.
30.07.2025 14:19 β π 10 π 1 π¬ 2 π 0
Thanks also to the organisers of the Helgoland2025 conference who helped distribute the survey, to the +1,100 people who kindly responded & to editor extraordinaire @richvn.bsky.social, who learned far more about quantum foundations than he ever wished to & it would have been impossible without π
30.07.2025 14:19 β π 6 π 0 π¬ 1 π 0
We asked a bunch more in-depth questions in a survey crafted in consultation with experts @fdlevi.bsky.social @seanmcarroll.bsky.social @philipcball.bsky.social Renato Renner & Elise Crull, to whom I am extremely grateful (& who all put up with me badgering them repeatedly over the last few months)
30.07.2025 14:19 β π 8 π 0 π¬ 1 π 0
I found it fascinating to unpack how there is so much disagreement, given that everyone is working with the same equations. As Renner put it, each interpretation has its faults & the price someone is willing to pay comes down to something personal. βItβs a very deeply emotional thing,β he says.
30.07.2025 14:19 β π 7 π 0 π¬ 1 π 0
But epistemic approaches, which say quantum states capture information or knowledge rather than anything real, seem to be growing in prominence at 17% (+4% for @rovelli.bsky.social's relational quantum mechanics). "Many worlds" had a strong following at 15% & lots of folk had their own explanations
30.07.2025 14:19 β π 8 π 0 π¬ 1 π 0
It may be no surprise to some, but we found there exists a LOT of disagreement, even among the most eminent in the field, about what quantum mechanics means for reality. The survey revealed that 36% of respondents favourite the traditional Copenhagen interpretation (particularly experimentalists)
30.07.2025 14:19 β π 9 π 2 π¬ 2 π 0
Physicists disagree wildly on what quantum mechanics says about reality, Nature survey shows
First major attempt to chart researchersβ views finds interpretations in conflict.
This year quantum physics turns 100, so @nature.com decided to conduct the biggest ever survey of what lies behind it. Do physicists really believe in multiple universes? Can influences happen instantaneously? Is the Copenhagen interpretation all it's cut out to be? βοΈπ§ͺ www.nature.com/articles/d41...
30.07.2025 14:19 β π 85 π 47 π¬ 11 π 10
Spain bids β¬400 million to host mega telescope at risk in US budget cuts
New Canary Islands home could save controversial Thirty Meter Telescope first proposed for Hawaii.
The Thirty Meter Telescope β long planned to be built in Hawaii but now with its US funding on the chopping block β could be given a new lease of life in Spain, following a government bid to host at the telescope on La Palma. Will the TMT board accept?
π§ͺπ www.nature.com/articles/d41...
26.07.2025 17:43 β π 64 π 16 π¬ 1 π 2
I'm a journalist at Nature and often write and publish stories about astronomy and related topics (policy as well as science)
25.07.2025 10:15 β π 0 π 0 π¬ 1 π 0
yes
25.07.2025 10:12 β π 0 π 0 π¬ 1 π 0
@bot.astronomy.blue signup
25.07.2025 10:11 β π 2 π 0 π¬ 1 π 0
Moonshot AI has now published its technical report about Kimi K2
github.com/MoonshotAI/K...
The 32-page doc says the model trained on NVIDIA H800 GPUs, including post-training reinforcement learning on agentic examples
My story from last week is here: www.nature.com/articles/d41...
22.07.2025 12:11 β π 7 π 1 π¬ 0 π 0
Contributing Writer at The New Yorker, freelance science & tech writer, fire dancer. Into cognitionβanimal and mineral (aka psych & AI).
Science filmmaker and TTRPG performer over with The RPGeeks π§¬π²
Batesic Bitch
Often found counting ducks over at www.twitch.tv/emilywb
Founder & PI @aial.ie, @tcddublin.bsky.social
AI accountability, AI audits & evaluation, critical data studies. Cognitive scientist by training. Ethiopian in Ireland. She/her
Science journalism by Leonid Schneider πΊπ¦
Read at your own risk, people got sacked for much less. Posts may contain #antifa and #russophobia
https://forbetterscience.com/
Theorist in Waterloo. Quantum information, quantum computing, science, parenting. https://gsbsmith.ca
science journalist | good physics, bad physics, and sometimes ugly physics
Signal: dgaristo.72
Email: digaristo@gmail.com
AI writer at the Economist. I write about it, that is. Iβm still human. One of literally dozens of people online who is not American.
UK membership organisation researching and campaigning on climate change and sustainability, military emissions, nuclear disarmament, AI and robotics, and more.
https://www.sgr.org.uk/
Researching data, tech, futures, and biological sciences in education | Senior Lecturer and co-director at the Centre for Research in Digital Education | University of Edinburgh | Editor of Learning, Media and Technology @lmt-journal.bsky.social
Staff writer @quantamagazine.bsky.social covering computer science. Formerly freelance physics writer (Quanta, SciAm, Physics Today, elsewhere), ex-physicist. [Obligatory disclaimer about views being my own.]
Incoming faculty at the Max Planck Institute for Software Systems
Postdoc at UW, working on Natural Language Processing
Recruiting PhD students!
π https://lasharavichander.github.io/
Science journalist is just a fancy way of saying "professional nerd." USian in Austria, language geek, collector of fine yellow zigzagged sweaters and etymology fun facts. Get my newsletter about big questions in science: www.reviewertoo.com π½ππ¦
Australian astronomer
he/him
Sydney/Gadi
#PantaSeietai #AgentOfBohemian #Engineerπ
INTJ/INFJ 5w4β Wassertiger
#KarmicArchitect #SpockMindπ #HydrogenFutureSociety
Thoroughβ’Criticalβ’Constructive
TheseβAntitheseβSynthese
#ConsciusContextualis #Nexialist #GnΕthiSeautΓ³n
EineFrage:werBinich-werSeidIhr?
Physics, philosophy, complexity. @jhuartssciences.bsky.social & @sfiscience.bsky.social. Host, #MindscapePodcast. Married to @jenlucpiquant.bsky.social.
Latest books: The Biggest Ideas in the Universe.
https://preposterousuniverse.com/
Official Spice Girls Bluesky Account
https://linktr.ee/Spiceworld_25
A LLN - large language Nathan - (RL, RLHF, society, robotics), athlete, yogi, chef
Writes http://interconnects.ai
At Ai2 via HuggingFace, Berkeley, and normal places
Senior reporter @resprofnews.bsky.social. Tips? frances.jones@clarivate.com. Shortlisted for Newcomer of the Year 2025 @absw.bsky.social. All views my own.
We grow healthy, interconnected communities across the world. The only Asian American-run science newsroom: independent, nonprofit, from the ground up!
Get newsletter: thexylom.com/subscribe
Support us: https://fundrazr.com/sustain-the-xylom?ref=cr_1EHOhb
ICREA Research Professor in Barcelona