Laurent Jacob's Avatar

Laurent Jacob

@laurentjacob.bsky.social

Researcher in statistics and machine learning for genomics https://laurent-jacob.github.io/

297 Followers  |  223 Following  |  30 Posts  |  Joined: 20.10.2023  |  2.3444

Latest posts by laurentjacob.bsky.social on Bluesky

legend 2025 will be next Tuesday/Wednesday/Thursday.

The detailed program is on legend2025.sciencesconf.org?lang=en

If you are not attending the conference you can still follow the presentations online (Zoom links are on the website).

05.12.2025 11:45 โ€” ๐Ÿ‘ 2    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
STORIES: learning cell fate landscapes from spatial transcriptomics using optimal transport - Nature Methods By learning a differentiation potential using an optimal transport-based approach, STORIES models and infers cell fate trajectories using spatiotemporal omics data.

Happy to share STORIES out now on Nature Methods

STORIES learns cell fate landscapes from spatial tramscripromics data profiled at several time points, thus allowing prediction of future cell states.

Led by Geert-Jan Huizing and Jules Samaran

www.nature.com/articles/s41...

@pasteur.fr

04.11.2025 07:25 โ€” ๐Ÿ‘ 45    ๐Ÿ” 13    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Very happy about this work on phylogenetic neural inference, led by @lblassel.bsky.social :)

17.10.2025 05:27 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
legend2025 : Machine Learning for Evolutionary Genomics Data - Sciencesconf.org Evolutionary genomics and population genetics investigate patterns of genetic diversity between species or between populations within a species and play a fundamental role in many aspects, from theoretical facets of evolution to practical ones, such as conservation genetics and biomedical sciences.

October 17 is your last chance to register for the 2nd conference on Machine Learning for Evolutionary Genomics Data (Dec 8-12), in the French Alps at legend2025.sciencesconf.org
The conference talks are online at legend2025.sciencesconf.org/data/book_le...

13.10.2025 11:24 โ€” ๐Ÿ‘ 3    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
โ€˜Google for DNAโ€™ brings order to biologyโ€™s big data MetaGraph compresses vast data archives into a search engine for scientists, opening up new frontiers of biological discovery.

Ca n'est pas si souvent, un article publiรฉ dans Nature met ma communautรฉ ร  l'honneur (la bioinformatique des sรฉquences). Je vous raconte ?
www.nature.com/articles/d41...

09.10.2025 15:00 โ€” ๐Ÿ‘ 28    ๐Ÿ” 14    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Exploring the space of self-reproducing RNA using generative models, Martin Weigt

Exploring the archaic introgression landscape of admixed populations through
joint ancestry inference, Jazeps Medina Tretmanis [et al.]

Predicting natural variation in the yeast phenotypic landscape with machine
learning, Sakshi Khaiwal [et al.]

Phylodynamic modeling with unsupervised Bayesian neural networks, Marino
Gabriele [et al.]

Likelihood-free inference of phylogenetic tree posterior distributions, Luc Blas-
sel [et al.]

Generative continuous time model reveals epistatic signatures in protein evolu-
tion, Barrat-Charlaix Pierre

Neural posterior estimation for high-dimensional genomic data from complex pop-
ulation genetic models, Jiseon Min [et al.]

A differentiable model for detecting diversifying selection directly from alignments
in large-scale bacterial datasets, Leonie Lorenz [et al.]

Detecting interspecific positive selection using transformers, Charlotte West [et al.]

Predicting Multiple Sequence Alignment Uncertainty via Machine Learning, Lucia
Martin-Fernandez [et al.]

Graph Neural Networks for Likelihood-Free Inference in Diversification Mod-
els, Amรฉlie Leroy [et al.]

Popformer: learning general signatures of genetic variation and natural selection
with a self-supervised transformer, Leon Zong [et al.]

Exploring the space of self-reproducing RNA using generative models, Martin Weigt Exploring the archaic introgression landscape of admixed populations through joint ancestry inference, Jazeps Medina Tretmanis [et al.] Predicting natural variation in the yeast phenotypic landscape with machine learning, Sakshi Khaiwal [et al.] Phylodynamic modeling with unsupervised Bayesian neural networks, Marino Gabriele [et al.] Likelihood-free inference of phylogenetic tree posterior distributions, Luc Blas- sel [et al.] Generative continuous time model reveals epistatic signatures in protein evolu- tion, Barrat-Charlaix Pierre Neural posterior estimation for high-dimensional genomic data from complex pop- ulation genetic models, Jiseon Min [et al.] A differentiable model for detecting diversifying selection directly from alignments in large-scale bacterial datasets, Leonie Lorenz [et al.] Detecting interspecific positive selection using transformers, Charlotte West [et al.] Predicting Multiple Sequence Alignment Uncertainty via Machine Learning, Lucia Martin-Fernandez [et al.] Graph Neural Networks for Likelihood-Free Inference in Diversification Mod- els, Amรฉlie Leroy [et al.] Popformer: learning general signatures of genetic variation and natural selection with a self-supervised transformer, Leon Zong [et al.]

PRIVET: PRIVacy metric based on Extreme value Theory, Antoine Szatkownik [et
al.]

Generative models for inferring the evolutionary history of the malaria vector
Anopheles gambiae, Amelia Eneli [et al.]

Language Models Outperform Supervised-Only Approaches for Conserved Ele-
ment Comprehension, Eyes Robson [et al.]

Identification and Classification of Orphan Genes, Spurious Orphan Genes, and
Conserved Genes from the human microbiome, Chen Chen

Neural Simulation-based inference of demography and selection, Francisco De
Borja Campuzano Jimรฉnez [et al.]

Species Identification and aDNA Read Mapping Using k-mer Embeddings, Filip
Thor [et al.]

Contrastive Learning for Population Structure and Trait Prediction, Filip Thor [et
al.]

Protein and genomic language models chart a vast landscape of antiphage de-
fenses, Mordret Ernest

The Phylogenomics and Sparse Learning of Trait Innovations, Gaurav Diwan [et
al.]

PRIVET: PRIVacy metric based on Extreme value Theory, Antoine Szatkownik [et al.] Generative models for inferring the evolutionary history of the malaria vector Anopheles gambiae, Amelia Eneli [et al.] Language Models Outperform Supervised-Only Approaches for Conserved Ele- ment Comprehension, Eyes Robson [et al.] Identification and Classification of Orphan Genes, Spurious Orphan Genes, and Conserved Genes from the human microbiome, Chen Chen Neural Simulation-based inference of demography and selection, Francisco De Borja Campuzano Jimรฉnez [et al.] Species Identification and aDNA Read Mapping Using k-mer Embeddings, Filip Thor [et al.] Contrastive Learning for Population Structure and Trait Prediction, Filip Thor [et al.] Protein and genomic language models chart a vast landscape of antiphage de- fenses, Mordret Ernest The Phylogenomics and Sparse Learning of Trait Innovations, Gaurav Diwan [et al.]

The decisions for LEGEND are out: legend2025.sciencesconf.org/data/book_le...

I'm really looking forward to hearing these 21 exciting presentations (and additional 30 posters) next December.

If you want to attend too, registration is open until October 17th through legend2025.sciencesconf.org

08.10.2025 11:04 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Only a few hours left to submit your abstract for a talk at the Machine Learning in Evolutionary Genomics conference in December in Aussois in the French alps!

22.09.2025 14:46 โ€” ๐Ÿ‘ 2    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
MLCB - Schedule The in-person component will be held at the New York Genome Center, 101 6th Ave, New York, NY 10013.

#MLCB2025 is tomorrow & Thursday with a fantastic lineup of keynotes & contributed talks www.mlcb.org/schedule. We'll be livestreaming through our YouTube channel www.youtube.com/@mlcbconf. Thanks to www.corteva.com, instadeep.com, the Simons Center at CSHL & NYGC for generous support!

10.09.2025 00:16 โ€” ๐Ÿ‘ 9    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image

Achievement unlocked: defend your habilitation thesis on the same day than your partner. That was quite a science + celebration day, thanks to all involved ๐Ÿ’™โœจ

05.09.2025 16:51 โ€” ๐Ÿ‘ 45    ๐Ÿ” 7    ๐Ÿ’ฌ 6    ๐Ÿ“Œ 0
Post image

๐ŸŒŽ๐Ÿ‘ฉโ€๐Ÿ”ฌ For 15+ years biology has accumulated petabytes (million gigabytes) of๐ŸงฌDNA sequencing data๐Ÿงฌ from the far reaches of our planet.๐Ÿฆ ๐Ÿ„๐ŸŒต

Logan now democratizes efficient access to the worldโ€™s most comprehensive genetics dataset. Free and open.

doi.org/10.1101/2024...

03.09.2025 08:39 โ€” ๐Ÿ‘ 218    ๐Ÿ” 118    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 16

The call for abstract for LEGEND is now open:
legend2025.sciencesconf.org

It will close on September 22nd (oral presentations) and October 1st (posters).

Send us your best work on Machine Learning for Evolutionary Genomics and come discuss it with us in the French Alps next December!

02.09.2025 06:51 โ€” ๐Ÿ‘ 3    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

#TalentCNRS ๐Ÿฅ‰| Flora Jay, entre gรฉnomes synthรฉtiques et rรฉcits รฉvolutifs, reรงoit la mรฉdaille de bronze du CNRS.
โžก๏ธ www.ins2i.cnrs.fr/fr/cnrsinfo/...
๐Ÿค @lisnlab.bsky.social @cnrs-paris-saclay.bsky.social

21.07.2025 12:01 โ€” ๐Ÿ‘ 5    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Preprint alert! ๐ŸฆŒ
Our new abundance index, REINDEER2, is out!
It's cheap to build and update, offers tunable abundance precision at kmer level, and delivers very high query throughput.

Short thread!

www.biorxiv.org/content/10.1...

github.com/Yohan-Hernan...

19.06.2025 09:12 โ€” ๐Ÿ‘ 23    ๐Ÿ” 13    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 4

Registration is now open!

The 580โ‚ฌ include housing and all meals.

We will close on October 17th or when reaching 80 participants.

18.06.2025 07:22 โ€” ๐Ÿ‘ 4    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Home - ProbGen 2026 Your Site Description

The 2026 Probabilistic Modeling in Genomics (ProbGen) meeting will be held at UC Berkeley, March 25-28, 2026. We have an amazing list of keynote speakers and session chairs:
probgen2026.github.io

Please help spread the news.

06.06.2025 17:52 โ€” ๐Ÿ‘ 69    ๐Ÿ” 36    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Merci ร  @cnrs-rhoneauvergne.bsky.social et @astropierre.com pour cette interview sur mes travaux en IA pour la gรฉnomique รฉvolutive!

02.06.2025 10:53 โ€” ๐Ÿ‘ 15    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

There is a nice example in @stephaneguindon.bsky.social's Ph.D thesis p.55

theses.hal.science/tel-00843343...

03.04.2025 12:29 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The design matrix of the regression should be nPairs x nBranches, and have a 1 at coordinates (i,j) such that branch j belongs to the path defined by pair i in the tree, 0 otherwise.

03.04.2025 12:26 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I think one way to do this is the least squares method, which gives you the set of branch lengths on your given topology such that the sum of squared differences between your given distances and the distances on the tree are minimal.

03.04.2025 12:23 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Phyloformer is finally published in MBE! ๐ŸŽ‰

academic.oup.com/mbe/advance-...

The thread below provides a summary of our neural network for likelihood-free phylogenetic reconstruction.

12.03.2025 11:49 โ€” ๐Ÿ‘ 16    ๐Ÿ” 11    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
People having breakfast in front of the Alps in the Centre Paul Langevin.

People having breakfast in front of the Alps in the Centre Paul Langevin.

Come hear about the latest advances in the field and discuss your own work at Centre Paul Langevin in beautiful Aussois.

24.02.2025 08:58 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
A headshot of Dr Burak Yelmen.

A headshot of Dr Burak Yelmen.

Burak Yelmen from the University of Tartu will give a keynote presentation on "A perspective on generative neural networks in genomics with applications in synthetic data generation".

24.02.2025 08:58 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
A headshot of Dr Claudia Solis-Lemus.

A headshot of Dr Claudia Solis-Lemus.

Claudia Solรญs-Lemus from the University of Wisconsin-Madison will give a keynote presentation on "The good, the bad and the ugly of deep learning in phylogenetic inference".

24.02.2025 08:58 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
A headshot of Dr Anne-Florence Bitbol.

A headshot of Dr Anne-Florence Bitbol.

Anne-Florence Bitbol from EPFL will give a keynote presentation on "Coevolution-aware language models".

24.02.2025 08:58 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
A legendary being holds a phylogenetic tree in the palm of their hand, with snowy mountains in the background.

A legendary being holds a phylogenetic tree in the palm of their hand, with snowy mountains in the background.

The next LEGEND conference on machine learning for evolutionary genomics will be in Aussois (French Alps) between December 8th and 12th.

Mark your calendars and make sure your best work is ready next September when the call for abstracts opens ๐Ÿ™‚

legend2025.sciencesconf.org

24.02.2025 08:58 โ€” ๐Ÿ‘ 10    ๐Ÿ” 7    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 3
Preview
MUSET: Set of utilities for constructing abundance unitig matrices from sequencing data AbstractSummary. MUSET is a novel set of utilities designed to efficiently construct abundance unitig matrices from sequencing data. Unitig matrices extend

๐Ÿงฌ Excited to share our latest work, MUSET ๐ŸŒญ, a new tool for creating abundance unitig matrices from sequencing data. It was published yesterday in Oxford Bioinformatics if you want to have a look๐Ÿ‘€ :

academic.oup.com/bioinformati...

Let's break it down:

04.02.2025 14:46 โ€” ๐Ÿ‘ 20    ๐Ÿ” 13    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image

My book is (at last) out, just in time for Christmas!
A blog post to celebrate and present it: francisbach.com/my-book-is-o...

21.12.2024 15:23 โ€” ๐Ÿ‘ 142    ๐Ÿ” 35    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 3

Ok, I tried to create my own list of people working on developing statistical or machine learning models applied to omics data. I am sure I missed a lot of cool people. If you'd like to be added, let me know. #Stats #ML #Omics
go.bsky.app/73rcuJn

24.11.2024 07:50 โ€” ๐Ÿ‘ 95    ๐Ÿ” 36    ๐Ÿ’ฌ 38    ๐Ÿ“Œ 4

Hi Raphael, thanks for putting this together, I'll be happy to be in the list if you think it makes sense :)

26.11.2024 08:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
A sketch summarizing the entire Phyloformer process.

A sketch summarizing the entire Phyloformer process.

All this work was done by Luca Nesterenko and
@lblassel.bsky.social , assisted by P. Veber, Bastien Boussau
and myself.

The code and data are available at github.com/lucanest/Phy...

Please share if you find this interesting, and we welcome your feedback :)

24.06.2024 08:35 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@laurentjacob is following 20 prominent accounts