Pat Walters's Avatar

Pat Walters

@wpwalters.bsky.social

Cheminformatics, ML, Drug Discovery

1,886 Followers  |  360 Following  |  33 Posts  |  Joined: 17.11.2024  |  2.0637

Latest posts by wpwalters.bsky.social on Bluesky

Post image

New Practical Cheminformatics Post
patwalters.github.io/Three-Papers...

22.07.2025 13:40 β€” πŸ‘ 15    πŸ” 8    πŸ’¬ 0    πŸ“Œ 2
Post image

New Practical Cheminformatics Post, "Useful RDKit Utils - A MΓΆtley Collection of Helpful Routines"
patwalters.github.io/Useful-RDKit...

13.05.2025 13:31 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
The Trouble With Tautomers IntroductionOne factor often overlooked when applying machine learning (ML) in small-molecule drug discovery is the influence of tautomers on model predictions. Drug-like molecules, especially those c...

The latest Practical Cheminformatics post, β€œThe Trouble With Tautomers,” emerged from a discussion about the impact of tautomers on machine learning model predictions.

patwalters.github.io/The-Trouble-...

06.05.2025 13:16 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

Thanks, Colab can be picky

24.03.2025 14:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

The latest addition (#33) to the Practical Cheminformatics Tutorials series explores Bayesian optimization of reaction conditions.
github.com/PatWalters/p...

24.03.2025 14:13 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 1    πŸ“Œ 2

You know how I feel about bar plots for showing distributions, right? πŸ˜€

12.03.2025 00:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Even More Thoughts on ML Method Comparisons Introduction A few things motivated this post.Β Β  Some recent discussions about the virtues of LightGBM vs XGBoost Posts on TabPFN by Jon...

In a new Practical Cheminformatics post titled "Even More Thoughts on ML Method Comparisons," I share several plots that I find valuable for comparing machine learning methods.
practicalcheminformatics.blogspot.com/2025/03/even...

10.03.2025 12:56 β€” πŸ‘ 34    πŸ” 9    πŸ’¬ 0    πŸ“Œ 0
γ€€Determine bond order from XYZ format of moleculeΒ #RDKit As readers know that recent version of RDKit supports XYZ format which is used in quantum chemistry field. The format doesn't have information of bond order. So to re-construct from the information bond order determination step is required. Recently I found useful function for determining the order in rdkit and Greg wrote useful post almost two years ago I think it's worth to read.

γ€€Determine bond order from XYZ format of moleculeΒ #RDKit

As readers know that recent version of RDKit supports XYZ format which is used in quantum chemistry field. The format doesn't have information of bond order. So to re-construct from the information bond order determination step is required.…

07.02.2025 13:37 β€” πŸ‘ 13    πŸ” 2    πŸ’¬ 2    πŸ“Œ 0

All of it! Lots of great new datasets. More new blogs coming online with great content. Open science at its best!

25.01.2025 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - PatWalters/resources_2025: Machine Learning in Drug Discovery Resources 2024 Machine Learning in Drug Discovery Resources 2024. Contribute to PatWalters/resources_2025 development by creating an account on GitHub.

Machine Learning in Drug Discovery Resources page updated for 2025. github.com/PatWalters/r...

23.01.2025 14:12 β€” πŸ‘ 94    πŸ” 28    πŸ’¬ 5    πŸ“Œ 3
Preview
GitHub - Graylab/DL4Proteins-notebooks: Colab Notebooks covering deep learning tools for biomolecular structure prediction and design Colab Notebooks covering deep learning tools for biomolecular structure prediction and design - Graylab/DL4Proteins-notebooks

Deep learning for proteins tutorial: github.com/Graylab/DL4P...

28.12.2024 03:14 β€” πŸ‘ 67    πŸ” 28    πŸ’¬ 1    πŸ“Œ 0

Super cool!

15.12.2024 13:06 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Here's a nice Christmas gift - ChEMBL 35 is out! Use your well-deserved Christmas holidays to spend time with your loved ones and explore the new release of ChEMBL 35! Β Β Β Β Β Β Β Β Β Β  This fresh...

ChEMBL 35 is out. Happy Holidays!
chembl.blogspot.com/2024/12/here...

12.12.2024 15:14 β€” πŸ‘ 27    πŸ” 5    πŸ’¬ 0    πŸ“Œ 0

Thank for the invitation @freundlichgroup.bsky.social. I had a blast!

12.12.2024 03:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yes, absolutely! The approach was designed to facilitate interpretation by med chemists.

11.12.2024 03:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Ordinal Confidence Level Assignments for Regression Model Predictions We present a simple method for assigning accurate confidence levels to molecular property predictions from regression models. These confidence levels are easy to interpret and useful for making decisi...

Excellent new paper (with code) by my former colleagues Steven Kearnes and Patrick Riley describing a procedure for associating confidence levels with regression model predictions in drug discovery. pubs.acs.org/doi/10.1021/...

10.12.2024 13:02 β€” πŸ‘ 26    πŸ” 11    πŸ’¬ 1    πŸ“Œ 0

Well said, thank you Rommie!

09.12.2024 03:48 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I'm not happy with the way this all went down. I have tremendous respect for Gabriele and the rest of the DiffDock authors. Their work has broken new ground and helped advance machine learning in drug discovery. If I had to do it over again, I'd do things differently.

09.12.2024 01:17 β€” πŸ‘ 18    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1
More response to the response (DiffDock/DiffDock-L) To amplify on what Pat Walters said (quotes from Gabriel Corso's post in italics): It only took a few days (in August 2023) for Ann Cleves, Pat, and me to figure out that the original DiffDock work ha...

@prof-ajay-jain.bsky.social's response is here. www.linkedin.com/pulse/more-r...

09.12.2024 01:17 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1
My Response to the Response Responding to our analysis, the DiffDock authors contend they were not responsible for developing the benchmark dataset or its data splits. Instead, they relied on previously published datasets and sp...

My "response to the response" is here. www.linkedin.com/pulse/my-res...

09.12.2024 01:17 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1
Response to Jain et al. You may have seen a recent pre-print [1] from Jain et al. with strongly worded claims against the experimental results in our DiffDock paper [2].

Many of you have seen Gabriele Corso's response to our recent preprint. If you haven't, it's here. www.linkedin.com/pulse/respon...

09.12.2024 01:17 β€” πŸ‘ 16    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0
Post image 08.12.2024 00:35 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

I don't know. All the DiffDock results we report are from the original DiffDock paper.

06.12.2024 23:52 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This is great, thanks for sharing

06.12.2024 10:58 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Point taken

05.12.2024 16:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

For AutoDock-Vina, we directly compared with what was in the DiffDock paper.

05.12.2024 16:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Deep-Learning Based Docking Methods: Fair Comparisons to Conventional Docking Workflows The diffusion learning method, DiffDock, for docking small-molecule ligands into protein binding sites was recently introduced. Results included comparisons to more conventional docking approaches, wi...

I'm thrilled to announce a new preprint describing collaborative work with Ajay Jain and Ann Cleves Jain, "Deep-Learning Based Docking Methods: Fair Comparisons to Conventional Docking Workflows".

arxiv.org/abs/2412.02889

05.12.2024 16:21 β€” πŸ‘ 94    πŸ” 21    πŸ’¬ 10    πŸ“Œ 5

These have different uses. The one liner gets the SMILES for a particular ligand. My script uses that SMILES to assign bond orders, which are missing in PDB files.

02.12.2024 04:16 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I prefer to think of myself as a realistic enthusiast πŸ˜†

22.11.2024 19:55 β€” πŸ‘ 14    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Hey Polaris!

21.11.2024 21:16 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@wpwalters is following 20 prominent accounts