Derek Arnold's Avatar

Derek Arnold

@visnerd.bsky.social

Vision Scientist, Aphant

401 Followers  |  167 Following  |  56 Posts  |  Joined: 12.10.2023  |  1.5138

Latest posts by visnerd.bsky.social on Bluesky

Preview
Visual language models show widespread visual deficits on neuropsychological tests - Nature Machine Intelligence Tangtartharakul and Storrs use standardized neuropsychological tests to compare human visual abilities with those of visual language models (VLMs). They report that while VLMs excel in high-level obje...

Our latest paper, β€œVisual language models show widespread visual deficits on neuropsychological tests”, is now out in Nature Machine Intelligence: www.nature.com/articles/s42...

Non-paywalled version:
arxiv.org/abs/2504.10786

Tweet thread below from first author @genetang.bsky.social...

09.02.2026 02:40 β€” πŸ‘ 66    πŸ” 33    πŸ’¬ 1    πŸ“Œ 2

No, but we probably should have done. Still can of course, but I doubt if it would work because we asked people for absolute ratings, as opposed to asking them to rate their own experiences relative to one another. That approach may be better for detecting within p variance

09.02.2026 00:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Redirecting

Now available without randomly placed repeated figures...

doi.org/10.1016/j.co...

08.02.2026 11:34 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Say him in Brisbane. Brilliant

07.02.2026 10:33 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Imagery modulates the pupillary response, but this does not reliably index differences in imagery vividness. Most people report that they can imagine seeing things in their mind’s eye. But there are large individual differences. A small proportion of people r…

New Paper: Pupillary responses are not a reliable index of differences in imagery vividness.

Our search for more reliable metrics of imagery continues...
www.sciencedirect.com/science/arti...

04.02.2026 05:20 β€” πŸ‘ 37    πŸ” 12    πŸ’¬ 1    πŸ“Œ 0

If you keep it short (an hour or less) and include coffee and cake, you can always can it if it does not work, but the cake should prove popular

23.01.2026 08:14 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Interested in other experiences.

I think it has a lot to do with how time poor you are. Individual meetings are usually better, but can get challenging depending on other demands on your time.

Lab meetings create opportunities for cross pollination, but are often unproductive virtue signaling

23.01.2026 07:35 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
People report having consistent idiosyncratic β€˜diets’ of imagined sensations when they re-experience the past, and pre-experience the future To some extent, humans can re-experience the sensations of past events and pre-experience the future. These capacities are inter-related. But there are substantial individual differences. At the extre...

New Preprint: People in general have idiosyncratic imagined experiences characterised by salience differences. Some have more salient imagined sensations of smell than of imagery, while most have the opposite - and these differences shape people's daily lives.
www.biorxiv.org/content/10.6...

20.12.2025 23:57 β€” πŸ‘ 7    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Corollary Discharge Dysfunction to Inner Speech and its Relationship to Auditory Verbal Hallucinations in Patients with Schizophrenia Spectrum Disorders AbstractBackground and Hypothesis. Auditory-verbal hallucinations (AVH)β€”the experience of hearing voices in the absence of auditory stimulationβ€”are a cardi

New paper with Tom Whitford, using EEG to investigate inner speech in people with auditory verbal hallucinations in schizophrenia.

academic.oup.com/schizophreni...

22.10.2025 07:06 β€” πŸ‘ 5    πŸ” 5    πŸ’¬ 2    πŸ“Œ 0

It's really not. As described here, N is a subjective self-report. You may as well ask how many fairies people can see dancing on the head of a pin. Conceptually, this is simply not a verifiable performance metric.

21.10.2025 21:04 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

SAVE THE DATE! Australasian Society for Experimental Psychology (EPC) & Asia-Pacific Conference on Vision (APCV) Joint Meeting from 1-4 July at the University of Auckland, NZ.
#PsychSciSky #VisionScience #neuroskyence

More information to follow!
visualneuroscience.auckland.ac.nz/epc-apcv-2026/

20.10.2025 20:54 β€” πŸ‘ 12    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0

I so wish : )

Failing that - I'll raise a glass to your continued good health at that time : )

20.08.2025 05:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

If the DV are RTs - it would be important to control for local image contrasts. If the DV is recognition, controlling for ~all image properties is futile, as these are what we recognize. If you want to know what properties we rely on, well that is a different question (its some of them)

20.08.2025 05:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The problem - if there is one, is you didn't control for oriented contrast energy, spatial frequency content, local or long range curvature ect ect... Rotating an image causes big changes in these properties. Deciding if control is futile or sensible depends on context.

20.08.2025 05:03 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Exactly - attention kicks in and re-weights image properties ect - but as you say, the images are cool, and I want a coaster : )

20.08.2025 04:57 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

If you are worried that detection or RTs might be related to contrast diffs ect - sure, control for that type of thing. But I think claiming to control for ~all image stats is futile if you still want to be able to recognize things in the image

20.08.2025 04:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Obv it depends on context, but if you control for all image stats, you could not recognize - as that depends on image stats we have learnt to associate with meaning. So I never understand when papers claim to have controlled for image stats - as they haven't if people can still recognize things

20.08.2025 04:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Bluesky is not a great platform for nuance : )

I also find it really hard to follow conversations here, and think people should use tildes more often

If there is any disagreement - it is with the idea that controlling for low-level confounds is a sensible goal.

20.08.2025 04:12 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The info that is mapped to semantics is correlated image structure, that is changed when the images are reoriented. So it is a super cool demo of anagram images (I want a coaster), but it does not show that 'high-level' effects are driven by identical stimuli. You have to change the stimuli

20.08.2025 02:47 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

You just taught me a word : )

Will be looking for opportunities to refer to 'elides" : )

20.08.2025 02:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Replications (stroop-like interference, aesthetics shaped by knowledge) tapped understanding entrained by recognition of correlated image structure. This doesn't seem much different to 'house' and 'horse' having different meanings. The task that didn't (a visual search) had a detection component.

20.08.2025 02:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yes, but even with multi-stable images attention acts to re-weight how we process the different features of the image. But at least that is all happening within the brain/mind, and does not rely on people detecting reliable image features

20.08.2025 01:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I think attempts to dissociate the cognitive processes entrained by an image from all low-level image properties are misguided, as it is ultimately some minimal set of correlated low-level image properties that allow us to recognize anything (including faces as faces, and rabbits as rabbits)

19.08.2025 22:44 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Only images that are multi-stable without any change (in orientation or anything else) could be said to dissociate meaning from the image structure, but even there attention kicks in, and our brains re-weight the encoded image structure by via selective processing.

19.08.2025 22:40 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The words 'house' and 'horse' are similar, but due to subtle image structure differences it is no surprise they trigger very different associations. Similarly your images are only subtly different when rotated, but they are different and it is not surprising they can trigger different associations

19.08.2025 22:34 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I think you are attempting the impossible. To extract meaning from an image, we detect correlations between image structure and high-level meaning. It is telling that all your tasks that have positive results are high-level, and the only null result comes from a search task that involves detection

19.08.2025 22:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

By changing the orientation of the image, you have changed the low-level image statistics. Once again you have shown that you cannot manipulate high-level visual properties while holding all low-level content constant

19.08.2025 22:00 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0

Obviously I have no idea in this space.

15.08.2025 12:40 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

When I dream, I am Fully emmersed and embodied in the scene. At least until I guess I am dreaming, which I can conform as I don’t have any sense of touch.

People’s descriptions of imagery don’t sound like that? They sound like they are more selective no?

15.08.2025 12:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

In my mind, if you have to close your eyes you are not a projector. Whatever representation your brain can generate clearly cannot be projected into the world you are seeing.

14.08.2025 10:01 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

@visnerd is following 20 prominent accounts