I hope you've found this post helpful.
Follow me for more.
Subscribe to my FREE newsletter chatomics to learn bioinformatics divingintogeneticsandgenomics.ck.page/profile
@tommytang.bsky.social
Director of bioinformatics at AstraZeneca. subscribe to my youtube channel @chatomics. On my way to helping 1 million people learn bioinformatics. Educator, Biotech, single cell. Also talks about leadership. tommytang.bio.link
I hope you've found this post helpful.
Follow me for more.
Subscribe to my FREE newsletter chatomics to learn bioinformatics divingintogeneticsandgenomics.ck.page/profile
5. 42basepairs 42basepairs.com/
02.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 0
4. Intervene: a tool for intersection and visualization of multiple gene or genomic region sets
bitbucket.org/CBGR/interv...
3. The goal of **g(r)osling** is to help you build interactive genomics visualizations with [Gosling](github.com/gosling-lan...). This package uses [reticulate](rstudio.github.io/reticulate/) to provide an interface to the [Gos](github.com/gosling-lan...) Python package.
02.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 0
2. plotgardener is a genomic data visualization package for R. Using `grid` graphics, `plotgardener` empowers users to programmatically and flexibly generate multi-panel figures
github.com/PhanstielLa...
I used that to plot single-cell ATACseq tracks github.com/crazyhottom..., more examples rpubs.com/crazyhottom...
02.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 0
5 tools to visualize genomic datasets 🧵
1. Karyoploter bernatgel.github.io/karyoploter...
A pan-cancer compendium of 1,294 plasma cell-free DNA methylomes and fragmentomes enabling multicancer detection www.nature.com/articles/s4...
01.03.2026 15:15 — 👍 7 🔁 2 💬 0 📌 0
I hope you've found this post helpful.
Follow me for more.
Subscribe to my FREE newsletter chatomics to learn bioinformatics divingintogeneticsandgenomics.ck.page/profile
10. whimsical.com/
01.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 09. Latex www.baeldung.com/cs/latex-fl... if you are geeky
01.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 08. Lucidchart www.lucidchart.com/pages/ very similar to draw.io
01.03.2026 14:45 — 👍 0 🔁 0 💬 2 📌 07. DiagrammeR if you want to play with R rich-iannone.github.io/DiagrammeR/
01.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 06. Inkscape github.com/fredhutchio... I use it to make figures (it is free)
01.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 05. draw.io
01.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 04. Miro miro.com/aq/ps/flowc...
01.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 03. Adobe illustrator
01.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 02. Figma www.figma.com/templates/f... recently bought by Adobe
01.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 01. Canva www.canva.com/graphs/flow... it can also make a nice logo. I made my Linkedin banner with it.
01.03.2026 14:45 — 👍 0 🔁 0 💬 1 📌 010 tools to make a better flowchart than in PPT 🧵
01.03.2026 14:45 — 👍 10 🔁 1 💬 1 📌 0
I hope you've found this post helpful.
Follow me for more.
Subscribe to my FREE newsletter chatomics to learn bioinformatics divingintogeneticsandgenomics.ck.page/profile
- Validate on truly independent data, not just held-out data from the same source
- If your model works "too well," be suspicious
The best ML in biology isn't just accurate.
It's accurate for the RIGHT reasons.
How to protect yourself:
- Always check what your model actually learned (use SHAP, LIME, GradCAM)
- Watch for confounding: are your labels correlated with batch, site, or technical variables?
It works amazingly well -- until it encounters conformations far from training data.
01.03.2026 14:15 — 👍 0 🔁 0 💬 1 📌 0
4/ AlphaFold and structure memorization.
AlphaFold is incredible. But research has shown that for fold-switching proteins, some of its predictions are driven by memorization of training-set structures rather than learned protein physics.
3/ another example: Substantial batch effects in TCGA exome sequences undermine pan-cancer analysis of germline variants
pmc.ncbi.nlm.nih.gov/articles/PM...
It Shows that deep learning models trained on TCGA histology data can infer submitting site rather than actual biology, inflating prognostic accuracy.
www.cell.com/cancer-cell...
2/ cancer predictions
Howard et al. (2023) - "Multimodal deep learning: An improvement in prognostication or a reflection of batch effect?" Cancer Cell.
But many were learning hospital-specific artifacts -- image contrast, positioning (AP vs PA), and even hospital labels embedded in the images.
The model learned WHICH HOSPITAL the X-ray came from, not what COVID lungs look like.
1/ COVID-19 chest X-ray classifiers.
Multiple deep learning models claimed to detect COVID from chest X-rays with high accuracy.