Data analysis in Rust - Data Analysis in Rust
A very good read for machine learning if you using RUST π¦ and Polars. so a very good book. ericfecteau.ca/data/rust-da.... You can use the Linfa or the smartcore for the implementation of the machine learning algorithm.
06.11.2025 16:28 β π 2 π 3 π¬ 0 π 0
how do we feel about putting JSON in the INFO field of a VCF?
06.11.2025 23:03 β π 2 π 1 π¬ 6 π 0
Vcfexpress: flexible, rapid user-expressions to filter and format VCFs
AbstractMotivation. Variant Call Format (VCF) files are the standard output format for various software tools that identify genetic variation from DNA sequ
This is very cool work and I'm happy to see it published. Vcfexpress by @brent-p.bsky.social and @aaronquinlan.bsky.social allows building (essentially) arbitrary VCF filters expressed in lua code with parsing & eval powered by rust!
academic.oup.com/bioinformati...
06.03.2025 15:05 β π 25 π 8 π¬ 1 π 1
vcfexpress/examples at main Β· brentp/vcfexpress
expressions on VCFs. Contribute to brentp/vcfexpress development by creating an account on GitHub.
vcfexpress applies simple user expressions variants in a VCF.
it can replace one-off python scripts to manipulate VCFs, likely with better performance.
we'd like to collect use-cases here: github.com/brentp/vcfex...
if you have a use-case and want some pointers, open an issue
04.02.2025 17:05 β π 4 π 1 π¬ 0 π 0
vcfexpress is a command-line tool built in rust that lets users apply lua expressions to modify or filter a vcf from the command-line
github.com/brentp/vcfex...
new release with better docs github.com/brentp/vcfex...
and examples github.com/brentp/vcfex...
28.01.2025 17:45 β π 3 π 0 π¬ 0 π 0
awesome! it'll take me some time to digest this but this gives me a great start. thank you!
27.01.2025 17:48 β π 1 π 0 π¬ 1 π 0
and thanks for your offer to help!
27.01.2025 17:17 β π 0 π 0 π¬ 1 π 0
error[E0599]: the method `indexed_records` exists for struct `Query<'_, &mut Reader<BufReader<File>>>`, but its trait bounds were not satisfied
--> src/lib.rs:374:27
|
374 | let q = q.indexed_records(&header).filter_by_region(®ion);
| ^^^^^^^^^^^^^^^ method cannot be called on `Query<'_, &mut Reader<BufReader<File>>>` due to unsatisfied trait bounds
|
= note: the following trait bounds were not satisfied:
`&mut noodles::noodles_bgzf::Reader<BufReader<File>>: noodles::noodles_bgzf::io::BufRead`
`&mut noodles::noodles_bgzf::Reader<BufReader<File>>: noodles::noodles_bgzf::io::Seek`
I think it might require more interactive help, but here's the error. I understand why, but not how to architect my code to fix.
27.01.2025 17:17 β π 0 π 0 π¬ 1 π 0
I am looking for a mentor for the rust programming language. My latest issue is with trait bounds (github.com/brentp/simpl...) but I have a few things I generally hit. I can compensate with $$ or interesting problems. :)
Please share with relevant people and feel free to DM.
27.01.2025 15:42 β π 4 π 2 π¬ 1 π 0
with Jason K from @aaronquinlan.bsky.social lab, I have been dusting off fraguracy, which evaluates sequencing error rates using the portion of bases from paired end reads that overlap. New release adds, among other niceties, tracking for distance to homopolymers
github.com/brentp/fragu...
25.11.2024 17:48 β π 5 π 1 π¬ 1 π 0
also see BITS alg by Ryan Layer and @aaronquinlan.bsky.social academic.oup.com/bioinformati...
20.11.2024 22:59 β π 4 π 0 π¬ 1 π 0
Assistant Professor at Pitt Biostatistics and Health Data Science. Previously WEHI Bioinformatics and UNC Biostatistics.
Bioinformatics Software Engineer. PhD from Schloss Lab at UMich.
https://sovacool.dev
ππ»ββοΈπ΄π»ββοΈπ§π»ββοΈ
#python #rstats #nextflow #snakemake
she/her
My views are my own.
Freelancing: Biostats / Genomics / CRISPR
github.com/Jfortin1
Assistant Professor at Baylor College of Medicine & Texas Children's Hospital | Human genetics and single-cell genomics | Formerly Columbia Med & Duke
Our long-term research goal is to understand and predict gene regulation based on DNA sequence information and genome-wide experimental data.
Postdoc at Princess Maxima Centre for pediatric oncology. Utrecht, NL.
Working to realize the promise of human genomics to better predict, prevent, and treat cancer. For more info: https://labs.dana-farber.org/collins-genomics/
He/Him | GitHub-> https://github.com/omicscode | Plants, Microbes, Bioinformatician, AI/ML/DL , Scientific Developer, Data Engineer -> https://www.icloud.com/iclouddrive/088QrZHmTgS5_iJv8RpT1-yLg#GauravSablok | Read all Post & Replies | I donβt vibe code
Ellumigen is the Intelligent Transcriptome company. We supply insights and evidence that enable data-driven decisions, provide confidence in the underlying biology, and increase the probability of technical and clinical success at every step.
shuh-SHAHNK | PhD Candidate, Computational Biologist, Ailurophile :)
PhD Candidate @ UT Dallas in the Functional Genomics Lab working on Nascent RNA identification
@nf-co.re
Bioinformatics Engineer @seqera.io
https://link.edmundmiller.dev/
Biologist using genes 𧬠and computers π» to study evolution and spread of invasive Insect pests. Senior Research Scientist @ Agriculture Victoria. Adjunct Research Fellow @ LaTrobe University π¦πΊ
Assistant Professor, Cold Spring Harbor Laboratory
Systems immunology to understand thymus physiology and T cell development
@RingAScientist, @SkypeScientist
Bioinformatician working with plant trancriptomics, genetic variants and genomic stuff.
Currently, I am a postdoc researcher at University of Minnesota.
Discover the Languages of Biology
Build computational models to (help) solve biology? Join us! https://www.deboramarkslab.com
DM or mail me!
Aging lab @Stanford. Our interests include mechanisms of aging, brain aging and rejuvenation, neural stem cell aging, genetics of lifespan and suspended animation in killifish
Scientist | Computational Biologist | Genome Assembly and Annotation | Comparative genomics | Plant Pangenomics |
Bioinformatics Scientist at the Arc Institute.
Working at the intersection of functional genomics, systems biology, and machine learning. I also build rusty bioinformatics tools
https://github.com/noamteyssier
Postdoctoral researcher at UC Berkeley | Microalgae, environmental stress, molecular and synthetic biology | Formerly @ UniversitΓ© Paris Sud, Sorbonne UniversitΓ©, AgroParisTech, and ISA Lille.
https://scholar.google.com/citations?user=f5eOF_YAAAAJ