Charlie Harris's Avatar

Charlie Harris

@harrisbio.bsky.social

PhD @ Cambridge in AI for Bio | Interested in generative modelling for drug discovery and science policy πŸ‡¬πŸ‡§ Website: cch1999.github.io Blog: harrisbio.substack.com Database: harrisbio.notion.site

971 Followers  |  370 Following  |  39 Posts  |  Joined: 20.11.2024  |  2.1064

Latest posts by harrisbio.bsky.social on Bluesky

Post image

Small personal update: very pleased to be in Singapore next week to present 2 spotlight papers at ICLR 2025 on AI for molecular design!! πŸ‡ΈπŸ‡¬

DM me if you want to meet up and chat about AI for bio, drug discovery, science policy or just chat about aviation!!!

14.04.2025 10:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Huge thanks all experts who have contributed w/ input and feedback!

@rory.bio @areeq.bsky.social @leecronin.bsky.social @erika-alden.bsky.social @econormist.bsky.social @saakohl.bsky.social @mariokrenn.bsky.social @stianwestlake.bsky.social @harrisbio.bsky.social @richardaljones.bsky.social

11/11

21.02.2025 11:40 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

I’m very biased but great list of ML for Drug Discovery and resources and blogs by @wpwalters.bsky.social !

23.01.2025 15:06 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

16/ What’s clear is that getting data and compute right is essentialβ€”not just for breakthroughs in science but for keeping the UK competitive globally.

Here’s hoping this plan gets the funding, leadership, and focus it needs to succeed!

Happy to chat about any of this.

end

13.01.2025 19:17 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

15/ Side note: UK universities could supercharge their AI teaching by embracing industry expertise (where the real knowledge is)

I teach a course at Cambridge led by a DeepMind researcher, and it’s the most popular in the department.

13.01.2025 19:17 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

14/ US universities do this well with CS minors, which foster computational literacy across disciplines.

The UK could adopt similar models to produce scientists who are not only domain experts but also skilled at applying AI tools to their fields.

13.01.2025 19:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

13/ The plan has solid ideas on AI skills, but it's not *just* about creating more "AI graduates." We need to train domain experts in the natural sciences to understand and use AI effectively.

Almost all scientists should know neural networks as well as they know Excel and stats

13.01.2025 19:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

12/ Another standout: the plan proposes an internal headhunting team within the UK Government to attract top global talent to AISI, the UK Sovereign AI Team, and UK-based companies.

Will they also have the power to fast-track visas? From experience, i hope so....

13.01.2025 19:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

11/ The UK Sovereign AI Team could be a great connector of
-Public institutions creating scientific datasets
-Industrial labs capable of training models on those datasets

This sort of collaboration could really unlock breakthroughs in science

13.01.2025 19:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

10/ The plan’s proposal to create a UK Sovereign AI Team is great. This unit will partner with private and academic sectors to back national champions and remove roadblocks in AI, with a strong focus on AI for science and robotics.

13.01.2025 19:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

(Usual reminder that AlphaFold3 was trained for 120k+ GPU hours... this is multiple times more than the whole compute budget of my lab this year)

13.01.2025 19:17 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

9/ Another question: who will the AIRR programme directors work for? UKRI? ARIA?

Will they be empowered to deploy large amount of compute into highly productive groups at the cutting edge?

There is no point in this if it means everyone only gets a few GPU hours each.

13.01.2025 19:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

9/ One question: will these AIRR programme directors also decide how funding is allocated for data generation?

For scientific initiatives, compute and data strategies are deeply interconnected. Ideally, the same person would oversee both to ensure alignment.

13.01.2025 19:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

8/ Another standout is the creation of AIRR programme directorsβ€”mission-focused individuals with autonomy to strategically allocate compute to high-potential projects.

A kind of "Compute Czar" role, this could significantly accelerate progress on big bets in AI for science.

13.01.2025 19:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

7/ However, there’s a risk of duplicating efforts where existing world-class institutions, like the EBI managing the PDBe, are already doing excellent work.

Not every problem needs to fit into a National Data Library-sizedβ„’ hole. Let’s build on what we already have!

13.01.2025 19:17 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

6/ People often say, "Big Pharma has lots of data!"β€”but much of it is unstructured and sparse, making it unsuitable for deep learning.

The plan acknowledges this challenge and recommends creating better infrastructure and incentives to make datasets AI-ready.

13.01.2025 19:17 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

5/ That’s why I’m thrilled to see the plan emphasise strategic data initiatives:
-Identifying high-impact datasets
-Improving data quality
-Incentivising researchers and companies to unlock and curate datasets

These efforts will make sparse, unstructured datasets better for AI

13.01.2025 19:17 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

4/ If we want breakthroughs beyond protein folding, we need to address data gaps across science.

AlphaFold was made possible by sustained investment in protein structure data.

Similar long term commitments are essential for other fields like materials and climate science.

13.01.2025 19:17 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

3/ AI breakthroughs like AlphaFold wouldn’t be possible without decades of work on datasets.

e.g., AlphaFold was trained on protein structures from the Protein Data Bank (PDB), which took 50+ years and ~$20 *billion* to create.

This is the kind of foundational effort AI needs.

13.01.2025 19:17 β€” πŸ‘ 18    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Post image

2/ There’s a lot to like in this plan:
- Expanding UK AI compute capacity by 20x
- Establishing AI Growth Zones
- Building up AI talent pipelines

But as a scientist, what excites me most is the report’s focus on **data**β€”an area we really need to get right.

13.01.2025 19:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

1/ Just read through the Matt Clifford AI Action Plan now.

Tl;dr: it's great but here are a few things that stood out to me as someone interested in AI for Science and sovereign compute and data capability.

A thread: 🧡

13.01.2025 19:17 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

A common issue I see in ML, both from ML "experts" and "users", is overly optimistic assumptions.

"experts" (people designing algs) usually assume the data is very simple

"users" (people using algs) usually assume that algorithms are more robust than they really are

Conclusion: always be careful!

10.01.2025 09:40 β€” πŸ‘ 19    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0

Added NewCo Kerna Labs, a new AI-first mRNA payload design company founded by former Moderna CSO with $6M in seed.

Also added new Cradle Bio series B worth $73M

11.01.2025 16:35 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Just added Graph Therapeutics, a new startup in Vienna focusing on precision medicine for inflammation and immunology

Founded by former Allcyte team

09.01.2025 09:37 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Flyer for a Benzon Symposium on Protein structure prediction and design in biology and pharmacology (Sept. 1–4, 2025). Speakers include: Gabriel Rocklin, Birte HΓΆcker, Amy Keating, Tanja Kortemme, Bruno Correia, Sarel  Fleishman, Ashutosh Chilkoti, Minkyung Baek, Noelia Ferruz, James Fraser, Alan  Moses, Susan Marqusee, Ben Lehner, Mohammed AlQuraishi, Dek Woolfson, Gustav Oberdorfer, Hannah Wayment-Steele, Ora Schueler-Furman, Jenifer Listgarten, Alexander Rives, & Max Bonomi.

Flyer for a Benzon Symposium on Protein structure prediction and design in biology and pharmacology (Sept. 1–4, 2025). Speakers include: Gabriel Rocklin, Birte HΓΆcker, Amy Keating, Tanja Kortemme, Bruno Correia, Sarel Fleishman, Ashutosh Chilkoti, Minkyung Baek, Noelia Ferruz, James Fraser, Alan Moses, Susan Marqusee, Ben Lehner, Mohammed AlQuraishi, Dek Woolfson, Gustav Oberdorfer, Hannah Wayment-Steele, Ora Schueler-Furman, Jenifer Listgarten, Alexander Rives, & Max Bonomi.

πŸ“£ Save the dates πŸ“…

We are organizing a Benzon Symposium on "Protein structure prediction and design" with what I think is an amazing set of speakers

Meeting will take place in Copenhagen πŸ‡©πŸ‡° on Sept. 1–4, 2025, and abstract submission will open in March (benzon-foundation.dk/benzon-sympo...)

13.12.2024 14:51 β€” πŸ‘ 92    πŸ” 30    πŸ’¬ 1    πŸ“Œ 2
Preview
Structure-based drug design with equivariant diffusion models - Nature Computational Science This work applies diffusion models to conditional molecule generation and shows how they can be used to tackle various structure-based drug design problems

Extremely pleased to announce that after *checks notes* 2 years, our paper on Structure-based Drug Design with diffusion models has been published in Nature Computational Science (@natcomputsci.bsky.social)!!

Thanks a lot to the great co-authors! Esp
@rne.bsky.social & Yuanqi Du.

10.12.2024 15:10 β€” πŸ‘ 41    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

Now added Aqemia as well

09.12.2024 11:17 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

How to come SOTA on protein-ligand binding….

Use Vina (and nothing better) on a single starting conformer

Works every time;)

04.12.2024 14:49 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

As always, please do reach out if you think any company should be added. :)

And thanks to those who have already done do! (Too many to tag unfortunately.)

04.12.2024 11:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

- 301.ai (Protein Design)
- ReticularAI (Protein design)
- @deepgenomics.bsky.social (RNA Therapies)
-Scala Biodesign (Protein Design)
-Protai (Protein design)
- Immunai (Antibodies)
-Mana Bio (Drug Delivery)
-Converge Bio (SaaS)
-@popvax.com (Vaccines)
-TernaryTx (Glues)

04.12.2024 11:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@harrisbio is following 20 prominent accounts