Stella Frank's Avatar

Stella Frank

@scfrank.bsky.social

Thinking about multimodal representations | Postdoc at UCPH/Pioneer Centre for AI (DK).

78 Followers  |  389 Following  |  19 Posts  |  Joined: 18.10.2024  |  2.0102

Latest posts by scfrank.bsky.social on Bluesky

A non-anthropomorphized view of LLMs In many discussions where questions of "alignment" or "AI safety" crop up, I am baffled by seriously intelligent people imbuing almost magic...

I wrote a short rant about what irks me when people anthropomorphize LLMs:

addxorrol.blogspot.com/2025/07/a-no...

07.07.2025 06:37 β€” πŸ‘ 39    πŸ” 15    πŸ’¬ 5    πŸ“Œ 1
Preview
Postdoc in Natural Language Processing

πŸ“’I am hiring a Postdoc to work on post-training methods for low-resource languages. Apply by August 15 employment.ku.dk/faculty/?sho....
Let's talk at #ACL2025NLP in Vienna if you want to know more about the position and life in Denmark.

07.07.2025 12:47 β€” πŸ‘ 22    πŸ” 12    πŸ’¬ 0    πŸ“Œ 0
Preview
Computer-vision research powers surveillance technology - Nature An analysis of research papers and citing patents indicates the extensive ties between computer-vision research and surveillance.

New paper hot off the press www.nature.com/articles/s41...

We analysed over 40,000 computer vision papers from CVPR (the longest standing CV conf) & associated patents tracing pathways from research to application. We found that 90% of papers & 86% of downstream patents power surveillance

1/

25.06.2025 17:29 β€” πŸ‘ 752    πŸ” 448    πŸ’¬ 24    πŸ“Œ 59

"Researching and reflecting on the harms of AI is not itself harm reduction. It may even contribute to rationalizing, normalizing, and enabling harm. Critical reflection without appropriate action is thus quintessentially critical washing."

15.06.2025 14:18 β€” πŸ‘ 37    πŸ” 14    πŸ’¬ 0    πŸ“Œ 3
Jingle-jangle fallacies - Wikipedia

Fallacy of the Day:
Calling two different things by the same name doesn't make them the same (jingle) and calling the same thing by different names doesn't make them different (jangle)
en.wikipedia.org/wiki/Jingle-...

(this is going to be so useful for reviewing)

17.06.2025 15:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Seeing What Tastes Good: Revisiting Multimodal Distributional Semantics in the Billion Parameter Era Human learning and conceptual representation is grounded in sensorimotor experience, in contrast to state-of-the-art foundation models. In this paper, we investigate how well such large-scale models, ...
13.06.2025 15:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Des presenting at VisCon CVPR 2025

Des presenting at VisCon CVPR 2025

Sad not to be there in person but this work will also be presented at ACL in Vienna 2025 - see you there!

13.06.2025 15:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

πŸ“― Best Paper Award at CVPR workshop on Visual concepts for our (@doneata.bsky.social + @delliott.bsky.social) paper on probing vision/lang/ vision+lang models for semantic norms!

TLDR: SSL vision models (swinV2, dinoV2) are surprisingly similar to LLM & VLMs even w/o lang πŸ‘€
arxiv.org/abs/2506.03994

13.06.2025 15:15 β€” πŸ‘ 12    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Paper title "Cultural Evaluations of Vision-Language Models
Have a Lot to Learn from Cultural Theory"

Paper title "Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory"

I am excited to announce our latest work πŸŽ‰ "Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory". We review recent works on culture in VLMs and argue for deeper grounding in cultural theory to enable more inclusive evaluations.

Paper πŸ”—: arxiv.org/pdf/2505.22793

02.06.2025 10:36 β€” πŸ‘ 57    πŸ” 18    πŸ’¬ 3    πŸ“Œ 5

Check out our new paper led by @srishtiy.bsky.social and @nolauren.bsky.social! This work brings together computer vision, cultural theory, semiotics, and visual studies to provide new tools and perspectives for the study of ~culture~ in VLMs.

02.06.2025 12:37 β€” πŸ‘ 26    πŸ” 8    πŸ’¬ 1    πŸ“Œ 0
Post image

as an extra take-away, this implies that our eval tends to be overly precision focused. we should really think of what we lose in terms of recalls, as this directly relates to what we miss out for whom when we build these large-scale, general-purpose models.

(4/4)

20.05.2025 12:17 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Preview
Postdoctoral Researcher (m,f,x) in Philosophy of Consciousness and Cognition, 39,8300 hours/week, TV-L E13

Reminder! Job offer in our team for postdoc/researcher specializing in consciousness & cognition (full-time)
πŸ‘‡πŸ‘‡πŸ‘‡πŸ‘‡

jobs.ruhr-uni-bochum.de/jobposting/6...

10.04.2025 15:44 β€” πŸ‘ 29    πŸ” 24    πŸ’¬ 2    πŸ“Œ 0
Post image

πŸš€ We are excited to introduce Kaleidoscope, the largest culturally-authentic exam benchmark.

πŸ“Œ Most VLM benchmarks are English-centric or rely on translationsβ€”missing linguistic & cultural nuance. Kaleidoscope expands in-language multilingual 🌎 & multimodal πŸ‘€ VLMs evaluation

10.04.2025 20:24 β€” πŸ‘ 18    πŸ” 7    πŸ’¬ 1    πŸ“Œ 2
Preview
Postdoc in Robotics and Computer Vision for Life Science Laboratory Automation - DTU Electro As part of a joint research collaboration between DTU and Novo Nordisk, we are looking for a postdoc to join our multidisciplinary research program focusing on the interplay between AI-Protein design,...

Join us and revolutionize Life Science Lab Automation! πŸŽ“πŸ€–πŸ’‰

I am hiring a Postdoc in Robotics and Computer Vision for Life Science Laboratory Automation, in Copenhagen, Denmark.

Is that you? πŸ™‹β€β™€οΈ
efzu.fa.em2.oraclecloud.com/hcmUI/Candid...

11.04.2025 13:53 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 0    πŸ“Œ 1

Yes! It's the monolithic nature of the single value system that is the target of alignment that's so problematic. (But then we also have to agree to be ok with models that generate content that we as individuals are extremely un-aligned with, right?)

10.04.2025 15:10 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Today we are releasing Kaleidoscope πŸŽ‰

A comprehensive multimodal & multilingual benchmark for VLMs! It contains real questions from exams in different languages.

🌍 20,911 questions and 18 languages
πŸ“š 14 subjects (STEM β†’ Humanities)
πŸ“Έ 55% multimodal questions

10.04.2025 10:31 β€” πŸ‘ 25    πŸ” 6    πŸ’¬ 1    πŸ“Œ 1

The Panopticon is amazing! and thanks for this thread - my libby holds list just got a bit longer :-)

10.04.2025 12:12 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We are looking for two PhD students at our institute in Munich.

Both postions are open-topic, so anything between cognitive science and machine learning is possible.

More information: hcai-munich.com/PhDHCAI.pdf

Feel free to share broadly!

09.04.2025 12:11 β€” πŸ‘ 6    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1
Post image

πŸ“’Excited to announce our upcoming workshop - Vision Language Models For All: Building Geo-Diverse and Culturally Aware Vision-Language Models (VLMs-4-All) @CVPR 2025!
🌐 sites.google.com/view/vlms4all

14.03.2025 15:55 β€” πŸ‘ 17    πŸ” 11    πŸ’¬ 1    πŸ“Œ 4
Post image

BirdCLEF25: Audio-based species identification focused on birds, amphibians, mammals, and insects in Colombia.
πŸ‘‰ www.kaggle.com/competitions...
@cvprconference.bsky.social @kaggle.com
#FGVC #CVPR #CVPR2025 #LifeCLEF
[1/4]

09.04.2025 10:22 β€” πŸ‘ 10    πŸ” 11    πŸ’¬ 1    πŸ“Œ 0
Above: Casing of Ironoquia dubia (RMNH.INS.1544419) collected on May 18th 1971 in Loenen, The Netherlands. b) The label of the specimen.  Depicted on the right: detail of the artificial items.  Photographs: overview: Auke-Florian Hiemstra, details: Pasquale Ciliberti. Below: Caddisfly larvae in the studio of Hubert Duprat, carrying cases made from mostly gold. Β© Hubert Duprat, adagp, 2024, Courtesy the Artist and Art : Concept, Paris, Photo F. Delpech.

Above: Casing of Ironoquia dubia (RMNH.INS.1544419) collected on May 18th 1971 in Loenen, The Netherlands. b) The label of the specimen. Depicted on the right: detail of the artificial items. Photographs: overview: Auke-Florian Hiemstra, details: Pasquale Ciliberti. Below: Caddisfly larvae in the studio of Hubert Duprat, carrying cases made from mostly gold. Β© Hubert Duprat, adagp, 2024, Courtesy the Artist and Art : Concept, Paris, Photo F. Delpech.

Thanks to these insects, we can now study environmental microplastics retrospectively. πŸ” Even before Duprat began his now famous experiments with caddisfly larvae, insects in the wild were already experimenting with plastic... πŸ› 14/x

09.04.2025 10:06 β€” πŸ‘ 132    πŸ” 9    πŸ’¬ 1    πŸ“Œ 0
Post image

The Wikimedia Foundation, which owns Wikipedia, says its bandwidth costs have gone up 50% since Jan 2024 β€”Β a rise they attribute to AI crawlers.

AI companies are killing the open web by stealing visitors from the sources of information and making them pay for the privilege

02.04.2025 09:12 β€” πŸ‘ 5687    πŸ” 2660    πŸ’¬ 68    πŸ“Œ 178

Now your browser can look like Vivaldi! (except maybe the floating video thing) ❀️

07.04.2025 11:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What a weekend to find Heinrich von Kleist's ErzΓ€hlungen next to the skip, in which the first story is literally about a man wreaking murderous havoc because of the imposition of arbitrary trade tariffs en.wikipedia.org/wiki/Michael...

06.04.2025 18:15 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Portland, ME showed up. #HandsOff

05.04.2025 15:17 β€” πŸ‘ 20251    πŸ” 4041    πŸ’¬ 252    πŸ“Œ 216
Preview
Billionaires Lose Combined $208 Billion in One Day From Trump Tariffs The world’s 500 richest people saw their combined wealth plunge by $208 billion Thursday as broad tariffs announced by President Donald Trump sent global markets into a tailspin.

The world's 500 richest people saw their combined wealth fall by a combined $208 billion, the most since Covid, as tariffs sent markets into a tailspin

03.04.2025 21:55 β€” πŸ‘ 625    πŸ” 114    πŸ’¬ 138    πŸ“Œ 237
Comic page layouts and their directions

Comic page layouts and their directions

Map depicting how many comics in the TINTIN Corpus come from different countries, along with how many styles per region of the world

Map depicting how many comics in the TINTIN Corpus come from different countries, along with how many styles per region of the world

I’m excited to share our newest paper which is the first to analyze all of our in our TINTIN Corpus: 1,030 comics from 144 countries. We asked: How much are comic layouts influenced by the writing systems of their authors? www.sciencedirect.com/science/arti...

04.04.2025 11:35 β€” πŸ‘ 35    πŸ” 11    πŸ’¬ 2    πŸ“Œ 0
Preview
Aleatoric or epistemic? Does it matter?

I don't think I got further back than Kendall & Gal (2017) citing Der Kiureghian & Ditlevsen (2007/2009) www.ripid.ethz.ch/Paper/DerKiu... orbit.dtu.dk/en/publicati...

04.04.2025 12:41 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
NeurIPS participation in Europe We seek to understand if there is interest in being able to attend NeurIPS in Europe, i.e. without travelling to San Diego, US. In the following, assume that it is possible to present accepted papers ...

Would you present your next NeurIPS paper in Europe instead of traveling to San Diego (US) if this was an option? SΓΈren Hauberg (DTU) and I would love to hear the answer through this poll: (1/6)

30.03.2025 18:04 β€” πŸ‘ 280    πŸ” 159    πŸ’¬ 6    πŸ“Œ 14

@scfrank is following 19 prominent accounts