Scaling down protein language modeling with MSA Pairformer
Recent efforts in protein language modeling have focused on scaling single-sequence models and their training data, requiring vast compute resources that limit accessibility. Although models that use ...
Excited to share work with
Zhidian Zhang, @milot.bsky.social, @martinsteinegger.bsky.social, and @sokrypton.org
biorxiv.org/content/10.1...
TLDR: We introduce MSA Pairformer, a 111M parameter protein language model that challenges the scaling paradigm in self-supervised protein language modelingπ§΅
05.08.2025 06:29 β π 61 π 29 π¬ 1 π 1
Causal clarity in statistical software
Imagine running a simple regression in any statistical software of choiceβbut this time, you only get a point estimate of the regression coefficient. There
Should statistical software that estimates causal effects also tell you the causal assumptions under which that estimate can be interpreted as causal?
I don't know but my PhD student Maurice Korf has some thoughts (and software) to get the conversation going:
academic.oup.com/ije/article/...
29.07.2025 07:47 β π 22 π 3 π¬ 1 π 0
The biggest blemish of mine is not being able to speak Japanese despite all that time watching animes.
28.07.2025 01:46 β π 3 π 0 π¬ 0 π 0
commutes should count toward work hours
28.07.2025 01:35 β π 24 π 2 π¬ 3 π 1
Always surprised by such ambitious titles, and then again by the use of clever data dug up from historical records.
I read this when it was a working paper but still...
27.07.2025 12:29 β π 2 π 0 π¬ 0 π 0
One takeaway is that same-sex sexual behavior is not a special trait that needs explaining. Rather, it can follow from reasonable mating strategies under imperfect information. Indeed, attempting to mate *only* with the opposite sex is a derived trait that only arises under some conditions.
27.07.2025 12:05 β π 3 π 1 π¬ 0 π 0
Home - ProbGen 2026
Your Site Description
The 2026 Probabilistic Modeling in Genomics (ProbGen) meeting will be held at UC Berkeley, March 25-28, 2026. We have an amazing list of keynote speakers and session chairs:
probgen2026.github.io
Please help spread the news.
06.06.2025 17:52 β π 63 π 35 π¬ 2 π 0
Looking south from the boardwalk around Lake Harriet. In view are 6 sailboats and a variety of brave folks about to board these vessels.
Nature is my psychiatrist. Walking down a tree lined cement path going toward the Mississippi River near Lake St in Minneapolis.
Looking east under the Lake-Marshall bridge. In view are the bridge pilings with graffiti and haze in the air.
Image of the front wheel of a moving bicycle casting a shadow of the wheel onto the gravel road.
Views from the urban hellscape they call Minneapolis. River, lake, gravel, he, they, she, radio, river otter, joy.
26.07.2025 15:37 β π 201 π 12 π¬ 9 π 3
In a new Perspective article, Josh Morgan discusses how progress in cell biology is hindered by significance testing and the need for a shift to effect size estimation. rupress.org/jcb/article/...
#Technology #Reproducibility #CellCycle #CellDivision #Statistics
23.07.2025 17:15 β π 37 π 16 π¬ 3 π 4
Finally a biblic reference in which I understand
25.07.2025 21:50 β π 2 π 1 π¬ 0 π 0
Legend says the ancient Babylonians once tried to sequence and annotate God's own genome, and for their ambition and hubris they were forever cursed to have different annotation formats and standards so they could never do genomics with ease again.
25.07.2025 21:38 β π 147 π 35 π¬ 5 π 3
New preprint: SBI with foundation models!
Tired of training or tuning your inference network, or waiting for your simulations to finish? Our method NPE-PF can help: It provides training-free simulation-based inference, achieving competitive performance with orders of magnitude fewer simulations! β‘οΈ
23.07.2025 14:27 β π 22 π 9 π¬ 1 π 2
Super excited to see this out. What started as some math in a grant in 2020, to a student deciding to take this on in 2022, to published in 2025.
These things can take time and patience is key!
21.07.2025 18:54 β π 57 π 17 π¬ 3 π 2
rip ozzy
23.07.2025 01:04 β π 4 π 0 π¬ 0 π 0
If I were to write a program, I would just do Poisson regression with posthoc corrected variance like glmGamPoi because it's cheaper and doesn't lose power. Theoretically it should lose some amount of power relative to NB but I've never seem such a case in practice.
22.07.2025 14:51 β π 0 π 0 π¬ 0 π 0
This is technically a quasi-likelihood test and not a negative binomial MLE. Nevertheless, for many reasons, people keeps mistaking it as a NB regression. In the literature, some people do Poisson reg. with posthoc correction and some do NB MLE under the same name causing confusion.
22.07.2025 14:50 β π 0 π 0 π¬ 1 π 0
This is actually what glmGamPoi is doing. It fits a Poisson regression. It estimates the dispersion parameter afterwards. Unlike NB regression, the dispersion and the regression coefficients are not jointly estimated.
22.07.2025 14:45 β π 0 π 0 π¬ 1 π 0
One thing that is overlooked in omics literature is that Poisson regression is correct not only for Poisson distributed data, but also for all sorts of count data as well as positive continuous outcomes (e.g Gamma distribution).
You only need to correct the standard error properly.
22.07.2025 14:42 β π 3 π 2 π¬ 1 π 0
I also hate both of the publishers you've mentioned but most of the papers I read from post-90/00s always have a latex-typesetted preprint on the web, so the publisher doesn't matter. The problem mostly happens with older papers with both old notations and unfamiliar fonts pre-80s :(
22.07.2025 10:39 β π 1 π 0 π¬ 1 π 0
I guess this is a common theoretical econ problem with a caveat that I don't know much about ecoh.
22.07.2025 04:38 β π 1 π 0 π¬ 0 π 0
Is there a model that compares random grant assignments versus current merit-based award schemes? There must be a form of competition btw productivity loss due to sending money to less competent (whatever that means) ppl vs time wasted on writing grants/fiddling with administrative chores.
22.07.2025 04:10 β π 2 π 0 π¬ 1 π 0
Am I the only person who sturbbonly seek for more recent references of the same content to avoid old papers with old typesetting?
22.07.2025 02:16 β π 2 π 0 π¬ 1 π 0
okay I'm doing slim vibe coding
21.07.2025 10:42 β π 1 π 0 π¬ 0 π 0
Currently reading Stiglerβs History of Statistics again (and loving it). Anyone know of a similar book for 1900-1960 period. Efron & Hastie, is great but itβs a different kind of book. Apparwntly Lehmann wrote specifically about Neyman v Fisher but Iβm looking for other options too #stats #statsky
21.07.2025 01:30 β π 10 π 5 π¬ 3 π 1
Assistant Prof at D-BSSE, ETH Zurich, studying genetics of psychiatric disorders
www.nacailab.com
Or just βLiβ |
Assist. Prof. @ Plant Bio Michigan State U. |
Also post data visualization |
Lab: https://cxli233.github.io/cxLi_lab/ |
GitHub: https://github.com/cxli233
Assistant Professor in the Department of Human Genetics at Emory University working at the nexus of human genetics, computation, and statistics
weinstocklab.org
PhD Student, MRC Biostatistics Unit
University of Cambridge
Gates Cambridge Scholar
Bioinformatics, genetics, single-cell, statistics
Australian π¦πΊ
Ecologist/ornithologist/naturalist. Not necessarily in that order.
πΏπ¦She/Herπͺ²πΊοΈ
Researcher in urban ecology & conservation.
Follow for map mumblings, urban nature-based solutions & natural history nerdery.
Check the alt text!
πWhadjuk Noongar Country, WA
Applied ecologist, conservation science, structured decision-making, spatial modelling, waterways research, she/her
#StayGrounded #NoFlyForWork #NoGenerativeAI
Proud to #PayTheRent
environmental policy, governance, finance β’ Senior Lecturer UNSW Canberra, Australia π¦ β’ she/her β’ likes dogs β’ often grumpy, mostly about climate change, biodiversity loss and injustice π΅πΈ https://www.unsw.edu.au/staff/megan-evans
Scientist, ecologist, Antarctica, impacts of climate change & writer of musicals - Antarctica, Beneath the Storm (out soonish)
#BluePlanetNeedsπOurLove Lutruwita/ Tasmania.
Graphic art on Redbubble: SundogProducts
Just your friendly neighborhood wildlife scientist on a mission to save nature in cities.
πPresenter, writer, and insufferable noticer of nature
πExpert in biodiversity conservation + urban ecology
πOccasionally witty
Mum β’ Scientist β’ Ecologist β’ Science Communicator β’ Leader, UniMelb Science Communication Teaching β’ Co-host, Let's Talk SciComm podcast @letstalkscicomm.bsky.social β’ Dr Jen on 3RRR π» β’ Author of 'Why Am I Like This?' β’ MC β’ Writer β’ Runner πββοΈ β’ she/her
Dr Judy Dunlop, a passionate ecologist who helps threatened Australian #WildOz fauna π¦ππ―π¦π¦π. #QuollPatrol #TCZ. Likes deserts better than desserts. Posts my own.
ARC Industry Fellow @QUT & @BushHeritageAus. Acoustic ecology, threatened species, conservation π¦π πΈπ΅
Lecturer, nature conservation #consocsci #humandimensions
Invertebrate zoologist; writer; artist; museum fan; bird watcher; living in the tropics; she/her. Sometimes in Guardian Australia. Otherwise https://snailseyeview.medium.com/
Herbivore π¦ Bird Observer π¦ Educator Sydney Zoo π Rainforest Flaneur π Author π Rational Humanist π©π»βπ¬ She/Her Birding tours and nature discovery walks in the Sydney region: http://aussiewild.com.au
Working to create sustainable cities ποΈ
& ecosystems full of invertebrates π
Inordinately fond of spiders π·οΈ
Research Fellow at ECU π±
Research Fellow at ICON Science, RMIT University working on Biodiversity Sensitive Urban Design π³π π¦ | She/Her | linktr.ee/jacintahumphrey
#UrbanEcology #WomeninSTEM
Ecologist. Mum. Writer.
Senior Lecturer Uni New England (Australia).
Editor in Chief: Insect Conservation and Diversity.
Anaiwan Country. My words. She/her.
https://ecologyisnotadirtyword.com
https://saundersecologylab.com/