Stella Frank's Avatar

Stella Frank

@scfrank.bsky.social

Thinking about multimodal representations | Postdoc at UCPH/Pioneer Centre for AI (DK).

85 Followers  |  395 Following  |  19 Posts  |  Joined: 18.10.2024  |  2.0368

Latest posts by scfrank.bsky.social on Bluesky

1/8 🧡 GPT-5's storytelling problems reveal a deeper AI safety issue. I've been testing its creative writing capabilities, and the results are concerning - not just for literature, but for AI development more broadly. 🚨

26.08.2025 15:15 β€” πŸ‘ 62    πŸ” 25    πŸ’¬ 3    πŸ“Œ 12
Post image

My Lab at the University of EdinburghπŸ‡¬πŸ‡§ has funded PhD positions for this cycle!

We study the computational principles of how people learn, reason, and communicate.

It's a new lab, and you will be playing a big role in shaping its culture and foundations.

Spread the words!

17.08.2025 11:52 β€” πŸ‘ 53    πŸ” 20    πŸ’¬ 2    πŸ“Œ 5
Post image

πŸš€ DinoV3 just became the new go-to backbone for geoloc!
It outperforms CLIP-like models (SigLip2, finetuned StreetCLIP)… and that’s shocking 🀯
Why? CLIP models have an innate advantage β€” they literally learn place names + images. DinoV3 doesn’t.

18.08.2025 15:14 β€” πŸ‘ 46    πŸ” 14    πŸ’¬ 1    πŸ“Œ 1
A non-anthropomorphized view of LLMs In many discussions where questions of "alignment" or "AI safety" crop up, I am baffled by seriously intelligent people imbuing almost magic...

I wrote a short rant about what irks me when people anthropomorphize LLMs:

addxorrol.blogspot.com/2025/07/a-no...

07.07.2025 06:37 β€” πŸ‘ 39    πŸ” 14    πŸ’¬ 5    πŸ“Œ 1
Preview
Postdoc in Natural Language Processing

πŸ“’I am hiring a Postdoc to work on post-training methods for low-resource languages. Apply by August 15 employment.ku.dk/faculty/?sho....
Let's talk at #ACL2025NLP in Vienna if you want to know more about the position and life in Denmark.

07.07.2025 12:47 β€” πŸ‘ 23    πŸ” 12    πŸ’¬ 0    πŸ“Œ 0
Preview
Computer-vision research powers surveillance technology - Nature An analysis of research papers and citing patents indicates the extensive ties between computer-vision research and surveillance.

New paper hot off the press www.nature.com/articles/s41...

We analysed over 40,000 computer vision papers from CVPR (the longest standing CV conf) & associated patents tracing pathways from research to application. We found that 90% of papers & 86% of downstream patents power surveillance

1/

25.06.2025 17:29 β€” πŸ‘ 775    πŸ” 456    πŸ’¬ 26    πŸ“Œ 61

"Researching and reflecting on the harms of AI is not itself harm reduction. It may even contribute to rationalizing, normalizing, and enabling harm. Critical reflection without appropriate action is thus quintessentially critical washing."

15.06.2025 14:18 β€” πŸ‘ 37    πŸ” 14    πŸ’¬ 0    πŸ“Œ 3
Jingle-jangle fallacies - Wikipedia

Fallacy of the Day:
Calling two different things by the same name doesn't make them the same (jingle) and calling the same thing by different names doesn't make them different (jangle)
en.wikipedia.org/wiki/Jingle-...

(this is going to be so useful for reviewing)

17.06.2025 15:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Seeing What Tastes Good: Revisiting Multimodal Distributional Semantics in the Billion Parameter Era Human learning and conceptual representation is grounded in sensorimotor experience, in contrast to state-of-the-art foundation models. In this paper, we investigate how well such large-scale models, ...
13.06.2025 15:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Des presenting at VisCon CVPR 2025

Des presenting at VisCon CVPR 2025

Sad not to be there in person but this work will also be presented at ACL in Vienna 2025 - see you there!

13.06.2025 15:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

πŸ“― Best Paper Award at CVPR workshop on Visual concepts for our (@doneata.bsky.social + @delliott.bsky.social) paper on probing vision/lang/ vision+lang models for semantic norms!

TLDR: SSL vision models (swinV2, dinoV2) are surprisingly similar to LLM & VLMs even w/o lang πŸ‘€
arxiv.org/abs/2506.03994

13.06.2025 15:15 β€” πŸ‘ 12    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Paper title "Cultural Evaluations of Vision-Language Models
Have a Lot to Learn from Cultural Theory"

Paper title "Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory"

I am excited to announce our latest work πŸŽ‰ "Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory". We review recent works on culture in VLMs and argue for deeper grounding in cultural theory to enable more inclusive evaluations.

Paper πŸ”—: arxiv.org/pdf/2505.22793

02.06.2025 10:36 β€” πŸ‘ 57    πŸ” 18    πŸ’¬ 3    πŸ“Œ 5

Check out our new paper led by @srishtiy.bsky.social and @nolauren.bsky.social! This work brings together computer vision, cultural theory, semiotics, and visual studies to provide new tools and perspectives for the study of ~culture~ in VLMs.

02.06.2025 12:37 β€” πŸ‘ 26    πŸ” 8    πŸ’¬ 1    πŸ“Œ 0
Post image

as an extra take-away, this implies that our eval tends to be overly precision focused. we should really think of what we lose in terms of recalls, as this directly relates to what we miss out for whom when we build these large-scale, general-purpose models.

(4/4)

20.05.2025 12:17 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

πŸš€ We are excited to introduce Kaleidoscope, the largest culturally-authentic exam benchmark.

πŸ“Œ Most VLM benchmarks are English-centric or rely on translationsβ€”missing linguistic & cultural nuance. Kaleidoscope expands in-language multilingual 🌎 & multimodal πŸ‘€ VLMs evaluation

10.04.2025 20:24 β€” πŸ‘ 18    πŸ” 7    πŸ’¬ 1    πŸ“Œ 2
Preview
Postdoc in Robotics and Computer Vision for Life Science Laboratory Automation - DTU Electro As part of a joint research collaboration between DTU and Novo Nordisk, we are looking for a postdoc to join our multidisciplinary research program focusing on the interplay between AI-Protein design,...

Join us and revolutionize Life Science Lab Automation! πŸŽ“πŸ€–πŸ’‰

I am hiring a Postdoc in Robotics and Computer Vision for Life Science Laboratory Automation, in Copenhagen, Denmark.

Is that you? πŸ™‹β€β™€οΈ
efzu.fa.em2.oraclecloud.com/hcmUI/Candid...

11.04.2025 13:53 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 0    πŸ“Œ 1

Yes! It's the monolithic nature of the single value system that is the target of alignment that's so problematic. (But then we also have to agree to be ok with models that generate content that we as individuals are extremely un-aligned with, right?)

10.04.2025 15:10 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Today we are releasing Kaleidoscope πŸŽ‰

A comprehensive multimodal & multilingual benchmark for VLMs! It contains real questions from exams in different languages.

🌍 20,911 questions and 18 languages
πŸ“š 14 subjects (STEM β†’ Humanities)
πŸ“Έ 55% multimodal questions

10.04.2025 10:31 β€” πŸ‘ 25    πŸ” 6    πŸ’¬ 1    πŸ“Œ 1

The Panopticon is amazing! and thanks for this thread - my libby holds list just got a bit longer :-)

10.04.2025 12:12 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We are looking for two PhD students at our institute in Munich.

Both postions are open-topic, so anything between cognitive science and machine learning is possible.

More information: hcai-munich.com/PhDHCAI.pdf

Feel free to share broadly!

09.04.2025 12:11 β€” πŸ‘ 6    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1
Post image

πŸ“’Excited to announce our upcoming workshop - Vision Language Models For All: Building Geo-Diverse and Culturally Aware Vision-Language Models (VLMs-4-All) @CVPR 2025!
🌐 sites.google.com/view/vlms4all

14.03.2025 15:55 β€” πŸ‘ 17    πŸ” 11    πŸ’¬ 1    πŸ“Œ 4
Post image

BirdCLEF25: Audio-based species identification focused on birds, amphibians, mammals, and insects in Colombia.
πŸ‘‰ www.kaggle.com/competitions...
@cvprconference.bsky.social @kaggle.com
#FGVC #CVPR #CVPR2025 #LifeCLEF
[1/4]

09.04.2025 10:22 β€” πŸ‘ 10    πŸ” 11    πŸ’¬ 1    πŸ“Œ 0
Above: Casing of Ironoquia dubia (RMNH.INS.1544419) collected on May 18th 1971 in Loenen, The Netherlands. b) The label of the specimen.  Depicted on the right: detail of the artificial items.  Photographs: overview: Auke-Florian Hiemstra, details: Pasquale Ciliberti. Below: Caddisfly larvae in the studio of Hubert Duprat, carrying cases made from mostly gold. Β© Hubert Duprat, adagp, 2024, Courtesy the Artist and Art : Concept, Paris, Photo F. Delpech.

Above: Casing of Ironoquia dubia (RMNH.INS.1544419) collected on May 18th 1971 in Loenen, The Netherlands. b) The label of the specimen. Depicted on the right: detail of the artificial items. Photographs: overview: Auke-Florian Hiemstra, details: Pasquale Ciliberti. Below: Caddisfly larvae in the studio of Hubert Duprat, carrying cases made from mostly gold. Β© Hubert Duprat, adagp, 2024, Courtesy the Artist and Art : Concept, Paris, Photo F. Delpech.

Thanks to these insects, we can now study environmental microplastics retrospectively. πŸ” Even before Duprat began his now famous experiments with caddisfly larvae, insects in the wild were already experimenting with plastic... πŸ› 14/x

09.04.2025 10:06 β€” πŸ‘ 132    πŸ” 9    πŸ’¬ 1    πŸ“Œ 0
Post image

The Wikimedia Foundation, which owns Wikipedia, says its bandwidth costs have gone up 50% since Jan 2024 β€”Β a rise they attribute to AI crawlers.

AI companies are killing the open web by stealing visitors from the sources of information and making them pay for the privilege

02.04.2025 09:12 β€” πŸ‘ 5646    πŸ” 2640    πŸ’¬ 67    πŸ“Œ 178

Now your browser can look like Vivaldi! (except maybe the floating video thing) ❀️

07.04.2025 11:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What a weekend to find Heinrich von Kleist's ErzΓ€hlungen next to the skip, in which the first story is literally about a man wreaking murderous havoc because of the imposition of arbitrary trade tariffs en.wikipedia.org/wiki/Michael...

06.04.2025 18:15 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Portland, ME showed up. #HandsOff

05.04.2025 15:17 β€” πŸ‘ 20097    πŸ” 4006    πŸ’¬ 249    πŸ“Œ 212
Preview
Billionaires Lose Combined $208 Billion in One Day From Trump Tariffs The world’s 500 richest people saw their combined wealth plunge by $208 billion Thursday as broad tariffs announced by President Donald Trump sent global markets into a tailspin.

The world's 500 richest people saw their combined wealth fall by a combined $208 billion, the most since Covid, as tariffs sent markets into a tailspin

03.04.2025 21:55 β€” πŸ‘ 620    πŸ” 113    πŸ’¬ 135    πŸ“Œ 234
Comic page layouts and their directions

Comic page layouts and their directions

Map depicting how many comics in the TINTIN Corpus come from different countries, along with how many styles per region of the world

Map depicting how many comics in the TINTIN Corpus come from different countries, along with how many styles per region of the world

I’m excited to share our newest paper which is the first to analyze all of our in our TINTIN Corpus: 1,030 comics from 144 countries. We asked: How much are comic layouts influenced by the writing systems of their authors? www.sciencedirect.com/science/arti...

04.04.2025 11:35 β€” πŸ‘ 35    πŸ” 11    πŸ’¬ 2    πŸ“Œ 0

@scfrank is following 19 prominent accounts