Btw, if you hate your PhD you can just leave. You can transfer (I did), you can drop out. More people probably should.
28.03.2025 01:51 β π 50 π 3 π¬ 4 π 0@zhang-ouyang.bsky.social
ML + Bio (prev CV) http://jozhang97.github.io
Btw, if you hate your PhD you can just leave. You can transfer (I did), you can drop out. More people probably should.
28.03.2025 01:51 β π 50 π 3 π¬ 4 π 0First they came for Columbia...
10.03.2025 01:43 β π 15 π 3 π¬ 0 π 0ooh also very curious π
27.11.2024 14:06 β π 1 π 0 π¬ 0 π 0Could you add me to this list?
21.11.2024 15:51 β π 0 π 0 π¬ 0 π 0Implementation is extremely simple. If you are using ESM2, you're just one line of code away from upgrading to ISM's enhanced capabilities. (7/7)
π paper www.biorxiv.org/content/10.1...
π» github: github.com/jozhang97/ISM
π€ huggingface: huggingface.co/jozhang97/is...
To conclude, ISM takes sequence-only input but produces structurally-rich representations. After all, the amino acid sequence is the only genetic information necessary for protein folding. Our structural loss better enables transformers to learn sequence-structure mapping. (6/7)
13.11.2024 00:40 β π 1 π 0 π¬ 1 π 0On structural benchmarks, we found that our model's structural representations outperform those from existing sequence models and even match performance with representations from models that take structure and sequence as input. (5/7)
13.11.2024 00:40 β π 0 π 0 π¬ 1 π 0ISM's secret sauce is a microenvironment-based autoencoder. The all-atom autoencoder learns to embed the tertiary structure surrounding a residue into a structure token. We distill these per-residue tokens (and MutRank tokens) into ESM2. (4/7)
13.11.2024 00:40 β π 0 π 0 π¬ 1 π 0Masked language modeling enables ESM2 to learn rich evolutionary features which capture a view of the structural landscape. However, it often underperforms structure models on downstream tasks.
We fine-tune ESM2 to predict representations from structure models. (3/7)
ISM is our latest protein language model which enhances ESM2 with enriched structural representations. (2/7)
13.11.2024 00:40 β π 0 π 0 π¬ 1 π 0Are you using ESM2 for your sequence embeddings? Try out ISM, a one-line code change that will incorporate improved structure and sequence information, without a structure as input. (1/7)
13.11.2024 00:40 β π 5 π 0 π¬ 1 π 0