This review examines #opensource solutions for managing heterogeneous #bioinformatics data using the data lakehouse paradigm. The authors assess tools for storage, ingestion, and metadata management, define key system requirements, and evaluate alignment with FAIR principles.
05.08.2025 10:30 β π 1 π 1 π¬ 0 π 0
ποΈ Just out in Bioinformatics Advances: "Review of open-source software for developing heterogeneous data management systems for bioinformatics applications"Β Β
Explore the full study: https://doi.org/10.1093/bioadv/vbaf168
Authors include: @tuliodna.bsky.social
05.08.2025 10:30 β π 0 π 0 π¬ 1 π 0
ScaleSC is a GPU-accelerated pipeline for single-cell #RNA-seq analysis that supports datasets with up to 20 million cells. Built on Scanpy and Rapids-singlecell, it resolves key CPU/GPU discrepancies, improves scalability, and includes tools for marker identification and cluster merging.
05.08.2025 09:30 β π 0 π 0 π¬ 1 π 0
π§ Just out in Bioinformatics Advances: "ScaleSC: A superfast and scalable single cell RNA-seq data analysis pipeline powered by GPU"Β Β
Explore the full study: https://doi.org/10.1093/bioadv/vbaf167
05.08.2025 09:30 β π 1 π 0 π¬ 1 π 0
GitHub - BioinfoUNIBA/IsoPrimer: A primer-designing pipeline for the amplification of splicing variants
A primer-designing pipeline for the amplification of splicing variants - BioinfoUNIBA/IsoPrimer
π οΈ IsoPrimer is freely accessible on GitHub, complete with documentation, example data, and configuration templates to support custom RNA-seq validation workflows:
04.08.2025 10:02 β π 0 π 0 π¬ 0 π 0
IsoPrimer automates the design of PCR primer pairs that account for RNA-seq-derived expression of alternative splicing variants. It prioritizes primers based on expression, targets exonβexon junctions, and performs in silico validation to ensure specificity.
04.08.2025 10:02 β π 0 π 0 π¬ 1 π 0
𧬠Now published in Bioinformatics Advances: "IsoPrimer: a pipeline for designing isoform-aware primer pairs for comprehensive gene expression quantification"Β
Explore the full study: https://doi.org/10.1093/bioadv/vbaf171
04.08.2025 10:02 β π 3 π 0 π¬ 1 π 0
GitHub - malikmtahir/LOCO-m6A
Contribute to malikmtahir/LOCO-m6A development by creating an account on GitHub.
π» Access the source code, example datasets, and detailed documentation to apply these models for robust m6A site prediction across genomic regions:
04.08.2025 09:02 β π 0 π 0 π¬ 0 π 0
Evaluated with Random-Split and Leave-One-Chromosome-Out strategies, the models outperform existing approaches by improving accuracy and generalizability across chromosomes.
04.08.2025 09:02 β π 0 π 0 π¬ 1 π 0
This study introduces two hybrid #deeplearningmodels that combine k-mer composition and one-hot encoded CNN features to predict m6A RNA modification sites.
04.08.2025 09:02 β π 0 π 0 π¬ 1 π 0
𧬠Just out in Bioinformatics Advances: "Hybrid representation learning for human m6A modifications with chromosome-level generalizability"Β Β
Full article available: https://doi.org/10.1093/bioadv/vbaf170
04.08.2025 09:02 β π 0 π 0 π¬ 1 π 0
By simulating steady-state fluxes from a kinetic model and translating them as tighter flux bounds, the approach resolves artificial pathway bifurcations between growth and citramalate production in engineered strains.
01.08.2025 10:02 β π 0 π 0 π¬ 1 π 0
This work integrates kinetic data into genome-scale constraint-based models of E. coli to produce more realistic flux predictions.
01.08.2025 10:02 β π 0 π 0 π¬ 1 π 0
π§ͺ Now available in Bioinformatics Advances: "Enhancing genome-scale metabolic models with kinetic data: Resolving growth and citramalate production trade-offs in E. coli"Β Β
Explore the full study: https://doi.org/10.1093/bioadv/vbaf166
01.08.2025 10:02 β π 1 π 0 π¬ 1 π 0
GitHub - asvarvara/MitSorter: Stand-alone tool for methylation-based discrimination of mitochondrial DNA ONT reads
Stand-alone tool for methylation-based discrimination of mitochondrial DNA ONT reads - asvarvara/MitSorter
π» You can find MitSorterβs source code, documentation, and setup instructions to support precise ONT read classification and reproducible mitochondrial variant analyses here:
01.08.2025 09:04 β π 1 π 0 π¬ 0 π 0
It aligns reads to the mitochondrial genome with base modification calls, partitions them by methylation status, and improves accuracy in mtDNA variant calling and downstream analyses.
01.08.2025 09:04 β π 0 π 0 π¬ 1 π 0
MitSorter is a command-line tool that separates Oxford Nanopore long reads into mtDNA and NuMT categories by exploiting differences in CpG methylation.
01.08.2025 09:04 β π 0 π 0 π¬ 1 π 0
π§« Recently published in Bioinformatics Advances: "MitSorter: A standalone tool for accurate discrimination of mtDNA and NuMT ONT reads based on differential methylation"Β Β
Read the full paper here: https://doi.org/10.1093/bioadv/vbaf135
01.08.2025 09:04 β π 1 π 1 π¬ 1 π 0
GitHub - muellan/rmapalign3n: fast 3N read aligner
fast 3N read aligner. Contribute to muellan/rmapalign3n development by creating an account on GitHub.
π₯οΈ You can access RMapAlign3N along with detailed documentation, example data, and setup instructions to enable reproducible mapping of nucleotide conversion reads:
31.07.2025 10:01 β π 0 π 0 π¬ 0 π 0
It combines MinHash-based indexing with multiple sequence alignment filtering, outperforming HISAT-3N, BSMAP, Bismark, and SLAM-DUNK on simulated and real BS-seq and SLAM-seq datasets while scaling efficiently on multi-core CPUs.
31.07.2025 10:01 β π 0 π 0 π¬ 1 π 0
RMapAlign3N is a C++ tool for mapping 3N-converted sequencing reads with improved speed and accuracy.
31.07.2025 10:01 β π 0 π 0 π¬ 1 π 0
𧬠Now published in Bioinformatics Advances: "RMapAlign3N: Fast Mapping of 3N-Reads"Β
Explore the full study: https://doi.org/10.1093/bioadv/vbaf164
31.07.2025 10:01 β π 0 π 0 π¬ 1 π 0
GitHub - liviurotiul/PeGAS
Contribute to liviurotiul/PeGAS development by creating an account on GitHub.
π» PeGAS is open source and freely available with documentation, example data, and installation instructions for reproducible bacterial genome analysis:
31.07.2025 09:02 β π 0 π 0 π¬ 0 π 0
It offers quality control, assembly assessment, and comprehensive visualization in streamlined, reproducible workflows.
31.07.2025 09:02 β π 0 π 0 π¬ 1 π 0
PeGAS is an integrated pipeline for bacterial genome analysis that combines AMR profile prediction, virulence factor detection, plasmid replicon assignment, MLST typing, and pangenome analysis.
31.07.2025 09:02 β π 0 π 0 π¬ 1 π 0
π¦ Now available in Bioinformatics Advances: "PeGAS: A Versatile Bioinformatics Pipeline for AMR, Virulence and Pangenome Analysis"Β Β
Full article available:
31.07.2025 09:02 β π 1 π 0 π¬ 1 π 0
It supports flexible prior specification, handles missing age data, and includes options for sex-specific modeling to improve genetic risk assessment in hereditary syndromes.
30.07.2025 11:01 β π 0 π 0 π¬ 1 π 0
Computational Biology at Goethe University Frankfurt
Neuroscientist and molecular engineer at the Vollum Institute (OHSU). Our lab maps the molecular, cellular and synaptic organization of the brain through viral genomic technology. Opinions my own.
ISCB Student Council Symposium (SCS) 2025 is in Liverpool, UK, on July 20, ahead of ISMB/ECCB 2025 -the largest & most influential bioinformatics conference! ππ¬
π Call for Abstracts Now Open!
https://easychair.org/account2/signin
Boeing Endowed Professor in the Allen School of Computer Science & Engineering at the University of Washington. Interested in AI/ML, computational biology, and AI in medicine. https://suinlee.cs.washington.edu/
Regulatory genomics, machine learning, networks, systems biology, evolution
Chief AI Officer @ UHN; Assistant Prof. @ U of Toronto; CIFAR AI Chair @ Vector Institute; AI & Biology
Book: https://thecon.ai
Web: https://faculty.washington.edu/ebender
My lab works on genome evolution, using yeast genetics and genomics. Chair, Department of Genome Sciences at University of Washington.
Lab website: https://depts.washington.edu/dunhamlab/
Statistical geneticist. Associate Prof at Dana-Farber / Harvard Medical School.
www.gusevlab.org
machine learning and functional/statistical genetics. Associate Prof @Columbia and Core Faculty @nygenome. he/him/his. https://daklab.github.io/
Scientist interested in all aspects of brain and behavior. Loves to talk math and data. Prof of Biology at University of Washington, Seattle. Mom of 2 epsilons; paints in watercolor here, there, and anywhere.
AI4Science researcher. Associate Professor @CSHL. My lab advances AI for genomics and healthcare!
http://koo-lab.github.io
physicist, interested in biology --- Assoc. Prof. of physics @UW
(Seattle)
Physician/scientist, partner at Khosla Ventures, investing in the future. 25 years of experience in the intersection of AI/ML, biotech and healthcare.
Personal interests in art (painting, drawing, photography), sci-fi/fantasy, anthropology, and history.
co-founder & president @newlimitbio.bsky.social | Prev PI @calico, PhD @ucsf | interested in aging, machine learning, genomics, & the technology production function
https://jck.bio
Asst prof at Duke University. Co-founder at LayerHealth.
ML and NLP for healthcare.
PhD MIT, BS/MS Stanford. She/her.
Assoc Prof @UW π³οΈβπ. Engineering human organoids, organs, and societies. Views my own. She/her.
assistant prof of CS @RiceUniversity https://ylab.rice.edu
computational biology, disease genomics w/ a focus on AD and cancer
learning to use bsky
Computational Biology, Genomics, Machine Learning, AI. Assistant Professor at UTSW