New Paper: Towards a science of AI agent reliability
Quantifying the capability-reliability gap
A nice piece of work: Towards a science of AI agent reliability
AI 'reliability lags capability, and that reliability will remain a barrier to deployment unless researchers and developers focus effort on improving reliability as a separate dimension from accuracy'
www.normaltech.ai/p/new-paper-...
26.02.2026 08:36 β
π 3
π 2
π¬ 2
π 0
We canβt be sure an AI βClawBotβ did post a nasty blog post when their code was rejected, but an ArsTechnica journalist did use AI hallucinated quotes when reporting the storyβ¦.
25.02.2026 09:02 β
π 0
π 0
π¬ 1
π 0
Totally agree this is a big risk. I was in a session with big consultants showing me how agents would do all sorts of clever things with our suppliers & stakeholders. Yet they were assuming they wouldnβt also be pointing agents at our services. Spam and race conditions beckonβ¦
23.02.2026 18:21 β
π 5
π 0
π¬ 0
π 0
Driving success through digital adoption
Digital adoption is about people. Our work on Data Hub shows that change is a process, and through collaboration and support teams can embrace tools that help improve workflows, strengthen business en...
It's so easy to fall into a "build it and they will come" mindset when creating digital products & services. We work to overcome this with our wonderful Engagement & Strategic Adoption team, here are reflections on how we've boosted adoption of our CRM tool digitaltrade.blog.gov.uk/2026/02/17/d...
17.02.2026 11:27 β
π 1
π 0
π¬ 0
π 0
Oh behave!! π
17.02.2026 10:57 β
π 0
π 0
π¬ 0
π 0
Humble-brag there Simon!
17.02.2026 10:33 β
π 0
π 0
π¬ 1
π 0
Walkman.land
The most complete portable pocket audio cassette player database. WML is a tribute to the Walkmans.
Nostalgia alert - illustrated catalogue of Walkman models (not just Sony ones) walkman.land via @densediscovery.bsky.social
17.02.2026 09:00 β
π 1
π 0
π¬ 0
π 0
A new evaluator role to support modern digital government
Discover the new Digital Evaluator role! Learn how it ensures accountability, value for money, and continuous improvement in UK government digital projects.
We have long championed the importance of monitoring & evaluation in digital teams, so we're delighted to have succeeded in getting 'digital evaluator' added as a role in the UK Govt Digital & Data framework digitaltrade.blog.gov.uk/2026/02/12/a...
12.02.2026 12:27 β
π 4
π 1
π¬ 0
π 0
I'm going to share a very good story about the internet. I won't tell it well because I apparently have the flu, but I think we all need a good story about the internet.
It starts in 2021, when I took a sabbatical and got a little workspace to write a design book.
05.02.2026 04:11 β
π 127
π 40
π¬ 4
π 5
Empowering Women in Tech: DBTβsΒ focusΒ onΒ inclusion andΒ innovation
βDiversity in Digital and Data leads to better outcomes for all of DBT and the businesses we support in the UK and beyond.β
Iβm so incredibly proud of our team at Dept Business & Trade who won well deserved recognition as Women in Techβs Best Public Sector Employer 2025. In this post we share practical insight into how we beat the sector for recruiting & retaining brilliant women digitaltrade.blog.gov.uk/2026/01/27/e...
28.01.2026 08:37 β
π 4
π 0
π¬ 0
π 0
A User Centred approach to helping colleagues collaborate by improving DBTβs People Finder
How DBT's Employee Experience team combined user research, design and delivery expertise to improve the department's intranet.
With about 8,000 colleagues working in more than 100 countries around the world for Department for Business and Trade, finding the right person to talk to can be a blocker to collaboration. Read more about how we've been iterating our People Finder digitaltrade.blog.gov.uk/2025/12/08/a...
08.12.2025 15:21 β
π 2
π 0
π¬ 0
π 0
Launching the Export Support Chatbot: Empowering small businesses to go global
How DBT's Export Support Chatbot will supply small businesses with the knowledge they need to export.
In the lead up to Small Business Saturday, we're sharing more about our work at DBT launching an Export Support AI chatbot on business.gov.uk and learning from our users, read more from Kai HellstrΓΆm on our blog digitaltrade.blog.gov.uk/2025/11/27/l...
27.11.2025 11:38 β
π 2
π 0
π¬ 0
π 0
Push through on Ted Lasso, it has so much heart.
19.11.2025 21:50 β
π 1
π 0
π¬ 0
π 0
Fabulous: Shrinking, Ted Lasso, Silo, Morning Show, For All Mankind, Bad Sisters, Sunny. Didnβt like Loot or Big Door Prize
19.11.2025 21:49 β
π 2
π 0
π¬ 0
π 0
Yes it's all in progress and will re-open.
17.11.2025 14:35 β
π 1
π 0
π¬ 1
π 0
How AI-ready content can boost policy effectiveness
As use of generative AI grows, so does the opportunity to meet businessesβ quests for information. Only if DBTβs online content is designed to meet usersβ needs.
Our brilliant Head of Content Design Andrea Leary writes how increasing use of AI bots to find businesses answers is driving some of Dept Business & Trade's thinking for content design, relevance and accuracy. Read the post digitaltrade.blog.gov.uk/2025/11/11/h...
11.11.2025 17:07 β
π 3
π 1
π¬ 0
π 0
Really interesting read, thanks for sharing
23.10.2025 13:21 β
π 2
π 0
π¬ 0
π 0
Launching DBT's first public-facing AI feature
How DBT created it's first public-facing online tool to help businesses find funding opportunities.
Our Data & AI teams are on a roll with great work and brilliant blog posts. Read about how we built our first public AI feature for Dept Business & Trade on business.gov.uk here digitaltrade.blog.gov.uk/2025/10/21/l...
21.10.2025 09:39 β
π 1
π 0
π¬ 0
π 0
Am rather enjoying the Liquid Glass look in iOS/WatchOS 26. Intrigued to see how it works on MacOS when I do that update...
24.09.2025 09:34 β
π 0
π 0
π¬ 0
π 0
Update: We are prioritising code on PyPi and aim to start re-opening by end of this month.
03.09.2025 11:06 β
π 2
π 0
π¬ 1
π 0
I've got a meeting today to discuss progress on re-opening, will update after.
03.09.2025 09:10 β
π 1
π 0
π¬ 1
π 0
We are working on it!
15.08.2025 10:54 β
π 1
π 0
π¬ 1
π 0
Strengthening European digital sovereignty with AI-powered tools
How a DBT data team created an AI fact-checking tool during the Paris Hackathon.
Our AI cubed team, part of the Data & AI services portfolio led by Sian Thomas MBE, are doing brilliant work. Read about their trip to visit DBT France and their participation in a hackathon to challenge misinformation digitaltrade.blog.gov.uk/2025/07/23/s...
25.07.2025 09:58 β
π 2
π 0
π¬ 0
π 0