In the genomics community, we have focused pretty heavily on achieving state-of-the-art predictive performance.
While undoubtedly important, how we *use* these models after training is potentially even more important.
tangermeme v1.0.0 is out now. Hope you find it useful!
27.08.2025 16:20 β π 45 π 14 π¬ 1 π 0
YouTube video by SciPy
Nezar Abdennur - Accelerating Genomic Data Science and AI/ML with Composability | SciPy 2025
My talk on #Composability in genomic software at #SciPy2025 is up on YouTube where I showcase both #anywidget and #oxbow.
Thank you to the organizers for the opportunity to present this to both computational biologists and the wider scientific computing community!
www.youtube.com/watch?v=G22_...
25.08.2025 17:20 β π 2 π 1 π¬ 0 π 0
YouTube video by SciPy
Abdennur, Lekschas, & Manz - Bring your __repr__βs to life with anywidget | SciPy 2024
if interested in creating anywidgets of your own, our tutorial was finally shared to youtube:
www.youtube.com/watch?v=frEo...
07.08.2025 20:00 β π 7 π 3 π¬ 2 π 1
Our #anywidget tutorial from last year's #SciPy conf was uploaded to youtube! Check it out for a hands-on walkthrough to create your own web-based widgets.
19.08.2025 00:20 β π 1 π 0 π¬ 1 π 0
We anticipate that joint dimensionality reduction and projection will become a foundational norm for comparative and integrative analysis of long-range interaction profiles in Hi-C/3C+ data. e.g. existing methods for working with classic A/B vectors can be extended to joint higher-order embeddings.
11.08.2025 20:41 β π 0 π 0 π¬ 0 π 0
We jointly-hic to create an atlas of 89 human Hi-C samples, uncovering distinct patterns of nuclear architecture associated with heterochromatin composition and demonstrating how higher-order principal components capture missing information about gene expression and regulatory element activity.
11.08.2025 20:41 β π 0 π 0 π¬ 1 π 0
jointly-hic accomplishes this using mini-batch incremental PCA, allowing for joint decomposition of arbitrarily many contact matrices at any resolution with constant memory.
11.08.2025 20:41 β π 0 π 0 π¬ 1 π 0
Joint decomposition allows for robust and directly comparable low dimensional representations of arbitrarily many contact maps, providing insights into genome organization across diverse biological contexts, from different tissues to developmental stages.
11.08.2025 20:41 β π 0 π 0 π¬ 1 π 0
The classic A/B compartment track comes from matrix factorization of a contact matrix into eigenvectors or PCs. Done separately, each map is projected onto a different coordinate system. Comparing such vectors directly is problematic, especially if seeking info from **higher-order** components.
11.08.2025 20:41 β π 2 π 0 π¬ 1 π 0
How To: Reduce Friction and Save Time with Tangermeme β tangermeme v0.1.0 documentation
A huge challenge I face when doing ML + genomics analysis is *friction*: the stupid error messages (wrong device!) and dumb implementation issues that snap you out of the zone. I wrote a vignette on how tangermeme has helped me reduce this friction:
tangermeme.readthedocs.io/en/latest/ho...
13.05.2025 17:34 β π 14 π 3 π¬ 1 π 0
(4) bpnet-lite: Load official Chrom/BPNet models into PyTorch for downstream tangermeme integration. Improved command-line tools + docs. Still concerns about perf of models trained from scratch -- will be resolved next version!
github.com/jmschrei/bpn...
bsky.app/profile/jmsc...
30.06.2025 18:38 β π 2 π 1 π¬ 1 π 0
Weβre excited and eager for feedback, so please give oxbow a try!
`pip install oxbow`
07.07.2025 21:22 β π 2 π 0 π¬ 0 π 0
It also supports:
* Column projection and pushdown (parsing only the fields you need)
* Complex and nested field types (e.g. alignment tags, variant genotype call data, etc.)
* Genomic range-based queries via an index
* User-defined transports and file systems
07.07.2025 21:22 β π 2 π 0 π¬ 1 π 0
This update (v0.4.x) provides complete #ApacheArrow data models for 11 file formats and counting, including the GA4GH/htslib formats and UCSCβs BigWig/BigBed.
07.07.2025 21:22 β π 1 π 0 π¬ 1 π 0
We revamped the #rustlang backend and implemented a new "DataSource" API in #Python, which allows for streaming conventional #genomic files β in-memory, on-disk, or in the cloud β into the modern data tools you use regularly, including #Pandas, #Polars, #DuckDB, and #Dask.
07.07.2025 21:22 β π 1 π 0 π¬ 1 π 0
I'm proud to announce the latest release of 𧬠#Oxbow πΉ, with new features to make NGS data analysis more powerful, efficient, and "composable".
Learn more at: oxbow.readthedocs.io
07.07.2025 21:22 β π 23 π 12 π¬ 2 π 2
Weβre excited and eager for feedback, so please give oxbow a try!
`pip install oxbow`
07.07.2025 21:18 β π 0 π 0 π¬ 0 π 0
It also supports:
* Column projection and pushdown (parsing only the fields you need)
* Complex and nested field types (e.g. alignment tags, variant genotype call data, etc.)
* Genomic range-based queries via an index
* User-defined transports and file systems
07.07.2025 21:18 β π 0 π 0 π¬ 1 π 0
This update (v0.4.x) provides complete #ApacheArrow data models for 11 file formats and counting, including the GA4GH/htslib formats and UCSCβs BigWig/BigBed.
07.07.2025 21:18 β π 0 π 0 π¬ 1 π 0
We revamped the #rustlang backend and implemented a new "DataSource" API in #Python, which allows for streaming conventional #genomic files β in-memory, on-disk, or in the cloud β into the modern data tools you use regularly, including #Pandas, #Polars, #DuckDB, and #Dask.
07.07.2025 21:18 β π 0 π 0 π¬ 1 π 0
SciPy 2025
Schedule, talks and talk submissions for SciPy 2025
It's @scipyconf.bsky.social submission time again! This year I am running the Bioinf/Comp Bio/Neuroscience track, covering how Python is advancing science and problem-solving in those fields. Let me know if y'all have questions, and looking forward to the submissions!
10.01.2025 15:42 β π 46 π 11 π¬ 2 π 2
There is support for inequality joins, which probably don't cover everything we want. Haven't benchmarked, but wouldn't be surprised if it's competitive with bespoke data structures.
17.12.2024 22:07 β π 2 π 0 π¬ 0 π 0
Yes, we think it's simply a matter of getting bioinformatics data "out of its shell"! That task is no short order though...
17.12.2024 22:01 β π 2 π 0 π¬ 0 π 0
Whitehead Institute and Department of Biology, MIT. Lover of cell biology and cell division. Aspiring to do good science and do good.
Teif lab at the University of Essex. We work on gene regulation in chromatin and applications to liquid biopsies, using approaches of genomics, biophysics, bioinformatics & AI. Our focus is nucleosomics, TF binding, CTCF, cfDNA. https://generegulation.org
Magic Science Dude, Round 4. Uncouth, spitting out the sunroof through my missing tooth.
Chromatin structure and function, paternal effect paradigms, reproductive biology and lil embryos.
PI @ Gladstone Institutes & UCSF. Molecular technologies & the genomics / molecular biology / biochemistry of gene regulation. Views here mine & do not represent those of my affiliated institutions.
Stanford genetics PhD student in Bintu & Boettiger labs β’ Gene regulation, 4D genome, cohesin.
Chief Science and Strategy Officer, openRxiv. Co-Founder, bioRxiv and medRxiv.
Evolutionary Biologist at Stanford. Rapid Evolution, Adaptation, and Genomics. Open Science advocate.
Professor at KTH, NY Genome Center, SciLifeLab, working on functional genomics and human genetics.
I am not a deep man, but I have many shallows. ctbrown@ucdavis.edu, http://ivory.idyll.org/blog/.
Professor, Department of Biological Sciences, University of Pittsburgh. RNA polymerase II mechanism, classic films, #classiccountdown once in awhile
Ray and Stephanie Lane Professor of Computational Biology at CMU School of Computer Science. https://www.cs.cmu.edu/~jianma/
Interested in 3D chromatin, cell fate decisions and gene regulation
#3Dgenome, #3R, #chromatin, #synbio, #microbiome... Opinions are my own.
Scientist, father, founder/CEO of PhaseGenomics; opinions are my own and probably not the position of the company.
Microbiologist | Biochemist | Structural Biologist, Lausanne π¨π | Studying molecular machines (SMC, ParB, Wadjet, Lamassu & more) in Genome Integrity & Nucleic Acid Immunity.
π¬ Lab homepage: wp.unil.ch/gruberlab
An American in Germany, an ion channel guy in bioinformatics, a proponent of scientific skepticism in an irrational world.
@devon@genomic.social, @dpryan79 on twitter and github
Professor @ JHU, statistical genomics, www.hansenlab.org
𧬠Scientist (bioinformatics / genomics), π§π»βπ» software developer, πͺ΄ community builder.
Product Manager for open source @seqera.io
Professor of Computer Science @ JHU. https://www.langmead-lab.org/ https://www.youtube.com/BenLangmead
Computational biology, machine learning, AI, RNA, cancer genomics. My views are my own. https://www.morrislab.ai
He/him/his