Laure Ciernik's Avatar

Laure Ciernik

@lciernik.bsky.social

PhD @ ML Group TU Berlin, BIFOLD, HFA, @ellis.eu | BSc & MSc @ethzurich.bsky.social

115 Followers  |  111 Following  |  8 Posts  |  Joined: 23.11.2024  |  1.702

Latest posts by lciernik.bsky.social on Bluesky

๐ŸŽ‰ Presenting at #ICML2025 tomorrow!
Come and explore how representational similarities behave across datasets :)

๐Ÿ“… Thu Jul 17, 11 AM-1:30 PM PDT
๐Ÿ“ East Exhibition Hall A-B #E-2510

Huge thanks to @lorenzlinhardt.bsky.social, Marco Morik, Jonas Dippel, Simon Kornblith, and @lukasmut.bsky.social!

16.07.2025 21:07 โ€” ๐Ÿ‘ 9    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Objective drives the consistency of representational similarity across datasets The Platonic Representation Hypothesis claims that recent foundation models are converging to a shared representation space as a function of their downstream task performance, irrespective of the obje...

I am deeply grateful to @lorenzlinhardt.bsky.social, Marco Morik, Jonas Dippel, Simon Kornblith, and @lukasmut.bsky.social for their great work and support in this project! We also thank our collaborators, @bifold.berlin and HFA 7/7
๐Ÿ“„Paper: arxiv.org/abs/2411.05561
๐Ÿ’ปCode: github.com/lciernik/sim...

06.06.2025 14:14 โ€” ๐Ÿ‘ 6    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

2nd key insight: The link between model similarity & behavior varies by dataset. Single-domain sets show strong correlations, while some multi-domain ones have high-performing, dissimilar models. Thus, the Platonic Representation Hypothesis may depend on the dataset's nature. ๐Ÿงต 6/7

06.06.2025 14:14 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Key finding: Training objective is a crucial factor for similarity consistency! SSL models show remarkably consistent representations across stimulus sets compared to image-text and supervised models, which show high variance in their consistency due to dataset dependence. ๐Ÿงต 5/7

06.06.2025 14:14 โ€” ๐Ÿ‘ 7    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image

Thus, we suggest a framework to systematically study if relative representational similarities between models remain consistent. We measure similarities between sets of models with different traits and their correlation across dataset pairs to assess stability across stimuli. ๐Ÿงต4/7

06.06.2025 14:14 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Representational similarity using linear CKA. Left to right: natural multi- and single-domain, and specialized datasets, followed by mean and standard deviation across all datasets. Models (rows and columns) are ordered by a hierarchical clustering of the mean matrix. Yellow and white boxes highlight regions with more stable similarity patterns across datasets, corresponding to some image-text (yellow) and self-supervised model pairs (white), while cyan boxes show higher variability for mainly supervised model pairs.

Representational similarity using linear CKA. Left to right: natural multi- and single-domain, and specialized datasets, followed by mean and standard deviation across all datasets. Models (rows and columns) are ordered by a hierarchical clustering of the mean matrix. Yellow and white boxes highlight regions with more stable similarity patterns across datasets, corresponding to some image-text (yellow) and self-supervised model pairs (white), while cyan boxes show higher variability for mainly supervised model pairs.

First finding: Representational similarities do not transfer directly across datasets, showing high variability across datasets, such as different ranges and patterns. ๐Ÿงต 3/7

06.06.2025 14:14 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

The Platonic Rep. Hypothesis @phillipisola.bsky.social et al. suggests foundation models converge to a shared representation space. Yet, most studies consider single datasets when measuring representational similarity. Thus, we were wondering: Does this convergence hold more broadly? ๐Ÿงต 2/7

06.06.2025 14:14 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

If two models are more similar to each other than a third on ImageNet, will this hold for medical/satellite images?

Our #icml2025 paper analyses how vision model similarities generalize across datasets, the factors that influence them, and their link to downstream task behavior. ๐Ÿงต1/7

06.06.2025 14:14 โ€” ๐Ÿ‘ 24    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 3
Post image

๐Ÿ“œ History repeats itself: We investigated how early modern communities have embraced scholarly advancements, reshaping scientific views and exploring scientific roots amidst a changing world.

www.science.org/doi/10.1126/...

@mpiwg.bsky.social @tuberlin.bsky.social @bifold.berlin @science.org

27.12.2024 09:20 โ€” ๐Ÿ‘ 15    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2
Post image

๐Ÿ“ขIf you are interested in single-cell foundation models (scFMs), stop by our poster (West 109) at the AiDrugX Workshop at Neurips 2024. We will present CancerFoundation, a scFM tailored for studying cancer biology๐Ÿงฌ.
Preprint: biorxiv.org/content/10.1...

15.12.2024 19:38 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image

๐Ÿš€ New preprint from our lab, Ekaterina Krymova, and @fabiantheis.bsky.social: UniversalEPI, an attention-based method to predict enhancer-promoter interactions from DNA sequence and ATAC-seq๐ŸŒŸ Read the full preprint: www.biorxiv.org/content/10.1... by @aayushgrover.bsky.social, L. Zhang & I.L. Ibarra

26.11.2024 13:40 โ€” ๐Ÿ‘ 51    ๐Ÿ” 13    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Non-White scientists appear on fewer editorial boards, spend more time under review, and receive fewer citations | PNAS Disparities continue to pose major challenges in various aspects of science. One such aspect is editorial board composition, which has been shown t...

"Non-White scientists appear on fewer editorial boards, spend more time under review, and receive fewer citations"

www.pnas.org/doi/abs/10.1...

25.11.2024 23:46 โ€” ๐Ÿ‘ 661    ๐Ÿ” 302    ๐Ÿ’ฌ 21    ๐Ÿ“Œ 35

@lciernik is following 20 prominent accounts