Nezar Abdennur's Avatar

Nezar Abdennur

@nvictus.bsky.social

computational biologist / biological computer / asst prof @UMassChan / phd @MIT / http://abdenlab.org

103 Followers  |  149 Following  |  24 Posts  |  Joined: 24.11.2024  |  1.8508

Latest posts by nvictus.bsky.social on Bluesky

In the genomics community, we have focused pretty heavily on achieving state-of-the-art predictive performance.

While undoubtedly important, how we *use* these models after training is potentially even more important.

tangermeme v1.0.0 is out now. Hope you find it useful!

27.08.2025 16:20 β€” πŸ‘ 45    πŸ” 14    πŸ’¬ 1    πŸ“Œ 0
Nezar Abdennur - Accelerating Genomic Data Science and AI/ML with Composability | SciPy 2025
YouTube video by SciPy Nezar Abdennur - Accelerating Genomic Data Science and AI/ML with Composability | SciPy 2025

My talk on #Composability in genomic software at #SciPy2025 is up on YouTube where I showcase both #anywidget and #oxbow.

Thank you to the organizers for the opportunity to present this to both computational biologists and the wider scientific computing community!

www.youtube.com/watch?v=G22_...

25.08.2025 17:20 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Abdennur, Lekschas, & Manz - Bring your __repr__’s to life with anywidget | SciPy 2024
YouTube video by SciPy Abdennur, Lekschas, & Manz - Bring your __repr__’s to life with anywidget | SciPy 2024

if interested in creating anywidgets of your own, our tutorial was finally shared to youtube:

www.youtube.com/watch?v=frEo...

07.08.2025 20:00 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 2    πŸ“Œ 1

Our #anywidget tutorial from last year's #SciPy conf was uploaded to youtube! Check it out for a hands-on walkthrough to create your own web-based widgets.

19.08.2025 00:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We anticipate that joint dimensionality reduction and projection will become a foundational norm for comparative and integrative analysis of long-range interaction profiles in Hi-C/3C+ data. e.g. existing methods for working with classic A/B vectors can be extended to joint higher-order embeddings.

11.08.2025 20:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

We jointly-hic to create an atlas of 89 human Hi-C samples, uncovering distinct patterns of nuclear architecture associated with heterochromatin composition and demonstrating how higher-order principal components capture missing information about gene expression and regulatory element activity.

11.08.2025 20:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

jointly-hic accomplishes this using mini-batch incremental PCA, allowing for joint decomposition of arbitrarily many contact matrices at any resolution with constant memory.

11.08.2025 20:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

Joint decomposition allows for robust and directly comparable low dimensional representations of arbitrarily many contact maps, providing insights into genome organization across diverse biological contexts, from different tissues to developmental stages.

11.08.2025 20:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

The classic A/B compartment track comes from matrix factorization of a contact matrix into eigenvectors or PCs. Done separately, each map is projected onto a different coordinate system. Comparing such vectors directly is problematic, especially if seeking info from **higher-order** components.

11.08.2025 20:41 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - abdenlab/jointly-hic: Genomics research toolkit for jointly embedding Hi-C 3D chromatin contact matrices into the same vector space Genomics research toolkit for jointly embedding Hi-C 3D chromatin contact matrices into the same vector space - abdenlab/jointly-hic

We introduce a framework and Python toolkit (github.com/abdenlab/joi...) for analyzing compartmentalization and long-range interactions in chromosome conformation capture data.

11.08.2025 20:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Joint decomposition of Hi-C maps reveals salient features of genome architecture across tissues and development The spatial organization of chromosomes in the nucleus is fundamental to cellular processes. Contact frequency maps from Hi-C and related chromosome conformation capture assays are increasingly availa...

We're excited to share our new preprint, "Joint decomposition of Hi-C maps reveals salient features of genome architecture across tissues and development", led by Thomas Reimonn. www.biorxiv.org/content/10.1...

11.08.2025 20:41 β€” πŸ‘ 23    πŸ” 10    πŸ’¬ 1    πŸ“Œ 1
Preview
Analysis-ready VCF at Biobank scale using Zarr AbstractBackground. Variant Call Format (VCF) is the standard file format for interchanging genetic variation data and associated quality control metrics.

Yes, and more recently Zarr too academic.oup.com/gigascience/...

While oxbow makes legacy data more accessible, it is a good conduit to more general-purpose persistent storage.

09.07.2025 07:02 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
How To: Reduce Friction and Save Time with Tangermeme β€” tangermeme v0.1.0 documentation

A huge challenge I face when doing ML + genomics analysis is *friction*: the stupid error messages (wrong device!) and dumb implementation issues that snap you out of the zone. I wrote a vignette on how tangermeme has helped me reduce this friction:

tangermeme.readthedocs.io/en/latest/ho...

13.05.2025 17:34 β€” πŸ‘ 14    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

(4) bpnet-lite: Load official Chrom/BPNet models into PyTorch for downstream tangermeme integration. Improved command-line tools + docs. Still concerns about perf of models trained from scratch -- will be resolved next version!

github.com/jmschrei/bpn...

bsky.app/profile/jmsc...

30.06.2025 18:38 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

We’re excited and eager for feedback, so please give oxbow a try!

`pip install oxbow`

07.07.2025 21:22 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Breaking the silo: composable bioinformatics through cross-disciplinary open standards SciPy 2025 The practice of data science in genomics and computational biology is fraught with friction. This is in large part because bioinformatic tools tend to be tightly coupled to file input/output. As a res...

I’m also excited to be presenting Oxbow as part of my talk on composability at the #SciPy2025 Conference on Wednesday! Hope to see some of you there.

cfp.scipy.org/scipy2025/ta...

07.07.2025 21:22 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 2    πŸ“Œ 0

It also supports:

* Column projection and pushdown (parsing only the fields you need)
* Complex and nested field types (e.g. alignment tags, variant genotype call data, etc.)
* Genomic range-based queries via an index
* User-defined transports and file systems

07.07.2025 21:22 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This update (v0.4.x) provides complete #ApacheArrow data models for 11 file formats and counting, including the GA4GH/htslib formats and UCSC’s BigWig/BigBed.

07.07.2025 21:22 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We revamped the #rustlang backend and implemented a new "DataSource" API in #Python, which allows for streaming conventional #genomic files – in-memory, on-disk, or in the cloud – into the modern data tools you use regularly, including #Pandas, #Polars, #DuckDB, and #Dask.

07.07.2025 21:22 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

I'm proud to announce the latest release of 🧬 #Oxbow 🏹, with new features to make NGS data analysis more powerful, efficient, and "composable".

Learn more at: oxbow.readthedocs.io

07.07.2025 21:22 β€” πŸ‘ 23    πŸ” 12    πŸ’¬ 2    πŸ“Œ 2

We’re excited and eager for feedback, so please give oxbow a try!

`pip install oxbow`

07.07.2025 21:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Breaking the silo: composable bioinformatics through cross-disciplinary open standards SciPy 2025 The practice of data science in genomics and computational biology is fraught with friction. This is in large part because bioinformatic tools tend to be tightly coupled to file input/output. As a res...

I’m also excited to be presenting Oxbow as part of my talk on composability at the #SciPy2025 Conference on Wednesday! Hope to see some of you there.

cfp.scipy.org/scipy2025/ta...

07.07.2025 21:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

It also supports:

* Column projection and pushdown (parsing only the fields you need)
* Complex and nested field types (e.g. alignment tags, variant genotype call data, etc.)
* Genomic range-based queries via an index
* User-defined transports and file systems

07.07.2025 21:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This update (v0.4.x) provides complete #ApacheArrow data models for 11 file formats and counting, including the GA4GH/htslib formats and UCSC’s BigWig/BigBed.

07.07.2025 21:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We revamped the #rustlang backend and implemented a new "DataSource" API in #Python, which allows for streaming conventional #genomic files – in-memory, on-disk, or in the cloud – into the modern data tools you use regularly, including #Pandas, #Polars, #DuckDB, and #Dask.

07.07.2025 21:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Year in Review Code and composition in 2024

A belated ode to 2024! lifeinbytes.substack.com/p/year-in-re...

06.02.2025 14:36 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
SciPy 2025 Schedule, talks and talk submissions for SciPy 2025

It's @scipyconf.bsky.social submission time again! This year I am running the Bioinf/Comp Bio/Neuroscience track, covering how Python is advancing science and problem-solving in those fields. Let me know if y'all have questions, and looking forward to the submissions!

10.01.2025 15:42 β€” πŸ‘ 46    πŸ” 11    πŸ’¬ 2    πŸ“Œ 2

There is support for inequality joins, which probably don't cover everything we want. Haven't benchmarked, but wouldn't be surprised if it's competitive with bespoke data structures.

17.12.2024 22:07 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yes, we think it's simply a matter of getting bioinformatics data "out of its shell"! That task is no short order though...

17.12.2024 22:01 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
GitHub - abdenlab/oxbow: Read specialized NGS formats as data frames in R, Python, and more. Read specialized NGS formats as data frames in R, Python, and more. - abdenlab/oxbow

here’s a project i’ve been involved with trying to slurp up various file formats as apache arrow github.com/abdenlab/oxbow

17.12.2024 21:49 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0

@nvictus is following 20 prominent accounts