๐ Presenting at #ICML2025 tomorrow!
Come and explore how representational similarities behave across datasets :)
๐
Thu Jul 17, 11 AM-1:30 PM PDT
๐ East Exhibition Hall A-B #E-2510
Huge thanks to @lorenzlinhardt.bsky.social, Marco Morik, Jonas Dippel, Simon Kornblith, and @lukasmut.bsky.social!
16.07.2025 21:07 โ ๐ 9 ๐ 3 ๐ฌ 0 ๐ 0
Objective drives the consistency of representational similarity across datasets
The Platonic Representation Hypothesis claims that recent foundation models are converging to a shared representation space as a function of their downstream task performance, irrespective of the obje...
I am deeply grateful to @lorenzlinhardt.bsky.social, Marco Morik, Jonas Dippel, Simon Kornblith, and @lukasmut.bsky.social for their great work and support in this project! We also thank our collaborators, @bifold.berlin and HFA 7/7
๐Paper: arxiv.org/abs/2411.05561
๐ปCode: github.com/lciernik/sim...
06.06.2025 14:14 โ ๐ 6 ๐ 3 ๐ฌ 0 ๐ 0
2nd key insight: The link between model similarity & behavior varies by dataset. Single-domain sets show strong correlations, while some multi-domain ones have high-performing, dissimilar models. Thus, the Platonic Representation Hypothesis may depend on the dataset's nature. ๐งต 6/7
06.06.2025 14:14 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
Key finding: Training objective is a crucial factor for similarity consistency! SSL models show remarkably consistent representations across stimulus sets compared to image-text and supervised models, which show high variance in their consistency due to dataset dependence. ๐งต 5/7
06.06.2025 14:14 โ ๐ 7 ๐ 0 ๐ฌ 1 ๐ 1
Thus, we suggest a framework to systematically study if relative representational similarities between models remain consistent. We measure similarities between sets of models with different traits and their correlation across dataset pairs to assess stability across stimuli. ๐งต4/7
06.06.2025 14:14 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
Representational similarity using linear CKA. Left to right: natural multi- and single-domain, and specialized datasets, followed by mean and standard deviation across all datasets. Models (rows and columns) are ordered by a hierarchical clustering of the mean matrix. Yellow and white boxes highlight regions with more stable similarity patterns across datasets, corresponding to some image-text (yellow) and self-supervised model pairs (white), while cyan boxes show higher variability for mainly supervised model pairs.
First finding: Representational similarities do not transfer directly across datasets, showing high variability across datasets, such as different ranges and patterns. ๐งต 3/7
06.06.2025 14:14 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
The Platonic Rep. Hypothesis @phillipisola.bsky.social et al. suggests foundation models converge to a shared representation space. Yet, most studies consider single datasets when measuring representational similarity. Thus, we were wondering: Does this convergence hold more broadly? ๐งต 2/7
06.06.2025 14:14 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
If two models are more similar to each other than a third on ImageNet, will this hold for medical/satellite images?
Our #icml2025 paper analyses how vision model similarities generalize across datasets, the factors that influence them, and their link to downstream task behavior. ๐งต1/7
06.06.2025 14:14 โ ๐ 24 ๐ 4 ๐ฌ 1 ๐ 3
๐ History repeats itself: We investigated how early modern communities have embraced scholarly advancements, reshaping scientific views and exploring scientific roots amidst a changing world.
www.science.org/doi/10.1126/...
@mpiwg.bsky.social @tuberlin.bsky.social @bifold.berlin @science.org
27.12.2024 09:20 โ ๐ 15 ๐ 3 ๐ฌ 1 ๐ 2
๐ขIf you are interested in single-cell foundation models (scFMs), stop by our poster (West 109) at the AiDrugX Workshop at Neurips 2024. We will present CancerFoundation, a scFM tailored for studying cancer biology๐งฌ.
Preprint: biorxiv.org/content/10.1...
15.12.2024 19:38 โ ๐ 7 ๐ 2 ๐ฌ 1 ๐ 1
๐ New preprint from our lab, Ekaterina Krymova, and @fabiantheis.bsky.social: UniversalEPI, an attention-based method to predict enhancer-promoter interactions from DNA sequence and ATAC-seq๐ Read the full preprint: www.biorxiv.org/content/10.1... by @aayushgrover.bsky.social, L. Zhang & I.L. Ibarra
26.11.2024 13:40 โ ๐ 51 ๐ 13 ๐ฌ 1 ๐ 0
Hi ๐ I'm a postdoc in the #Neuroimmunology and #Imaging group at the @dzne.science Bonn ๐งช๐ฌ Passionate about #ComputationalNeuroscience ๐ง ๐ป and #NeuralModeling ๐งฎ
๐ fabriziomusacchio.com
๐จโ๐ป github.com/FabrizioMusacchio
๐ sigmoid.social/@pixeltracker
PhD at TU Berlin ๐ฉ๐ช | Master's at ETH Zurich ๐จ๐ญ
physician-scientist, author, editor
https://www.scripps.edu/faculty/topol/
Ground Truths https://erictopol.substack.com
SUPER AGERS https://www.simonandschuster.com/books/Super-Agers/Eric-Topol/9781668067666
PhD Student @ ML Group TU Berlin, BIFOLD
Elihu Professor of Biostatistics @yalesph.bsky.social--Cancer, Infectious disease, Evolutionary biology, Fungi, sometimes running
PhD Student at Theis and Gagneur lab @TU Munich - Interested in ML, gene regulation and epigenetics ๐งฌ. Previously Cambridge University and Heidelberg University. she/her
PhD Student in Computational Cancer Genomics Group at ETH Zurich
PhD student in NLP at ETH Zurich.
anejsvete.github.io
PhD at Schapiro lab in Heidelberg | Spatial biology | Bioinformatics | Tissue organization
Nature Portfolioโs high-quality products and services across the life, physical, chemical and applied sciences is dedicated to serving the scientific community.
Cutting-edge research, news, commentary, and visuals from the Science family of journals. https://www.science.org
PhD student in Interpretable Machine Learning at TU Berlin & BIFOLD
Neuroscience PhD student, Cambridge UK. Confused but excited.
"Everything around me was somebody's lifework"
๐ https://rory.bio
๐จ https://flywhl.dev
๐ง https://compmotifs.com/
Senior Editor @Nature for cancer and cell cycle. Views my own
PhD Student @ https://bifold.berlin/, TU Berlin.
Computational Pathology, XAI, Multimodal ML, Representation Learning.
Github: https://github.com/bifold-pathomics
PhD Student @ TU Berlin @BIFOLD Berlin | ML for molecular simulation
PhD student at @bifold.berlin, Machine Learning Group, TU Berlin.
Automatic Differentiation, Explainable AI and #JuliaLang.
Open source person: adrianhill.de/projects
Senior Researcher Machine Learning at BIFOLD | TU Berlin ๐ฉ๐ช
Prev at IPAM | UCLA | BCCN
Interpretability | XAI | NLP & Humanities | ML for Science
Scientist and music nerd. All things machine learning and genomics for gene regulation.
Faculty at Max Delbruck Centrum and Humboldt University Berlin
@mdc-berlin.bsky.social
http://www.mdc-berlin.de/ohler
Baritone at www.byrdland.org
Assistant Professor at TU Wien
Machine Learning & Security