Open Mind's Avatar

Open Mind

@openmindjournal.bsky.social

Cognitive science journal published by MIT Press. https://direct.mit.edu/opmi

681 Followers  |  1 Following  |  107 Posts  |  Joined: 22.11.2024  |  2.1298

Latest posts by openmindjournal.bsky.social on Bluesky

Detection, Inspection, Return: An Object-Based Classification and Metric of Fixations in Complex Scenes AbstractAnalyses of human gaze behaviour towards complex scenes typically aim to explain heatmaps or scan-paths. While heatmaps lack temporal information, scan-paths aim for a level of detail which often is impractical. We introduce a novel approach, based on the premise that most fixations target objects and do so in meaningfully different ways, depending on temporal context: Detection fixations (D) foveate an object for the first time; Inspection fixations (I) successively target object details; and Return fixations (R) revisit a previously fixated object after going elsewhere. To test the hypothesis that these classes capture distinct fixation profiles, we reanalysed a large dataset of scene fixations. We computed separate heatmaps for D, I, and R and found significantly higher inter-observer consistency within than between classes. Across fixations landing on different semantic features, the proportion of D, I, and R fixations varied consistently, and a semantic salience model trained to predict each type of fixations independently learned diverging distributions of feature weights. Further, we found a shift from D to I and R across viewing time, in line with previous findings on ambient and focal viewing modes. We tested and confirmed that the dynamics of this shift varied as a function of trial duration. Finally, we highlight the recent application of the D, I, R classification as a metric for gaze comparisons in the context of dynamic scenes, in which scan-path similarity metrics fail. We propose the D, I, and R classification as a computationally simple yet powerful tool to classify spatiotemporal aspects of scene fixations in an object-based and intuitive manner and provide well-documented code to implement it. Future research may explore potential functional differences between D, I, and R fixations.
02.02.2026 20:45 — 👍 0    🔁 0    💬 0    📌 0
Using Artificial Neural Networks to Relate External Sensory Features to Internal Decisional Evidence AbstractAll theories of perceptual decision-making postulate that external sensory information is transformed into the internal evidence that is used to judge the identity of the stimulus. However, the nature of this external-to-internal transformation is generally unknown. In two experiments, we examined how a particular stimulus feature—orientation—is transformed into internal evidence. Subjects judged whether Gabors were tilted clockwise or counterclockwise. The results of Experiment 1 demonstrated that increasing the stimulus tilt in fine-scale increments resulted in a linear increase in sensitivity. However, the results of Experiment 2 demonstrated that increasing the stimulus tilt in coarse-scale increments had little effect on sensitivity, suggesting a highly non-linear transformation. Critically, artificial neural networks (ANNs) trained on the orientation task reproduced the empirical results, providing a framework for examining this external-to-internal transformation. The ANNs’ internal activations revealed that fine-scale increments in tilt magnitude results in increasingly greater discriminability between the stimulus categories, but the degree of discriminability does not increase further after tilt magnitude becomes sufficiently large. Taken together, these results begin to reveal how external sensory information is transformed into the internal evidence that is used to judge the identity of a stimulus and suggest that ANNs could serve as a platform for understanding the mechanism underlying this critical transformation.
02.02.2026 20:45 — 👍 1    🔁 0    💬 0    📌 0
An Efficient Communication Analysis of Modal Typology AbstractThe meanings expressed by the world’s languages have been argued to support efficient communication. Across diverse semantic domains, crosslinguistic analyses show that natural language vocabularies are jointly optimized for two competing pressures: cognitive simplicity and informative communication. This paper applies an efficiency analysis to modals (e.g., can, ought, might), a central mechanism for talking about situations other than the here-and-now. We define and measure the simplicity and informativeness of a large number of logically possible modal systems, together with a sample of twenty-seven natural language inventories. We also consider a recently-introduced semantic universal for modal expressions in natural language, dubbed the Independence of Force and Flavor (IFF). Our analysis yields three main results: (i) every optimal modal system perfectly satisfies the IFF universal; (ii) as systems contain more IFF modals, they become more efficient; (iii) attested modal systems are more efficient than merely possible systems. These results indicate that general pressures for efficient communication can explain typological variation in the lexicalization of modality.
02.02.2026 20:45 — 👍 1    🔁 0    💬 0    📌 0
How Prediction of the Future Affects Encoding of the Present: Cooperation or Competition? AbstractEach day brings new experiences and the opportunity to form new episodic memories. However, our everyday experiences are not isolated episodes; rather, there is significant spatial and temporal structure that holds across experiences, allowing us to build up structured knowledge about the world. We can then leverage this known structure to make predictions about how new experiences will unfold. In a recent study in Open Mind, Poskanzer, Tarder-Stoll, et al. found that such predictions benefitted episodic memory. Specifically, on trials in which participants were successfully able to make predictions about an upcoming experience, participants were more likely to encode that predictive information into memory. These findings seem to stand in stark contrast to other recent work, which found the opposite of worse episodic memory for predictive cues (Sherman et al., 2022; Sherman & Turk-Browne, 2020). How can these discrepant findings be reconciled? Here, I discuss several key task differences that might explain the discrepancy and highlight avenues for future research which might help to theoretically disentangle the contexts under which prediction may impede vs. facilitate episodic encoding.
02.02.2026 20:45 — 👍 0    🔁 0    💬 0    📌 0
The Scope and Limits of Iconic Prosody: Head Angle Predicts f 0 Changes While Object Size Effects Are Absent AbstractThe relation between the fundamental frequency of the voice (f0) and vertical space has been shown in previous studies; however, the underlying mechanisms are less clear. This study investigates the relationship between head angle and f0 in iconic prosody, along with the influence of object size on lip opening and formant frequencies. In the experiment, participants pointed to objects of two different sizes and in various vertical positions while saying the words “piff” or “paff,” which induced vertical head position change. Head angle emerged as a reliable predictor of f0, with a larger angle increasing the f0. This effect was consistent despite individual variations in head movement. While the vertical position of the object also showed a reliable effect on f0, head angle substantially outperformed it as a predictor, suggesting that head angle represents the primary physiological mechanism predicting f0 changes. Conversely, object size did not predict either lip opening or formant dispersion. Lip opening and formant dispersion were purely indexical, tracking vowel-specific articulatory configurations rather than external object properties. These findings underscore the role of head position in modulating f0 through direct physiological coupling, potentially underpinning iconic prosody, while revealing the limits of size-related iconicity in parameters constrained by phonemic requirements.
15.12.2025 04:36 — 👍 0    🔁 0    💬 0    📌 0
The Scope and Limits of Iconic Prosody: Head Angle Predicts f 0 Changes While Object Size Effects Are Absent AbstractThe relation between the fundamental frequency of the voice (f0) and vertical space has been shown in previous studies; however, the underlying mechanisms are less clear. This study investigates the relationship between head angle and f0 in iconic prosody, along with the influence of object size on lip opening and formant frequencies. In the experiment, participants pointed to objects of two different sizes and in various vertical positions while saying the words “piff” or “paff,” which induced vertical head position change. Head angle emerged as a reliable predictor of f0, with a larger angle increasing the f0. This effect was consistent despite individual variations in head movement. While the vertical position of the object also showed a reliable effect on f0, head angle substantially outperformed it as a predictor, suggesting that head angle represents the primary physiological mechanism predicting f0 changes. Conversely, object size did not predict either lip opening or formant dispersion. Lip opening and formant dispersion were purely indexical, tracking vowel-specific articulatory configurations rather than external object properties. These findings underscore the role of head position in modulating f0 through direct physiological coupling, potentially underpinning iconic prosody, while revealing the limits of size-related iconicity in parameters constrained by phonemic requirements.
15.12.2025 03:51 — 👍 0    🔁 0    💬 0    📌 0
Agenda Setting and The Emperor’s New Clothes : People Diagnose Information Cascades during Sequential Testimony by Reasoning About Informants’ Speaking Order and Social Status AbstractConsensus-based social learning strategies often outcompete other strategies in evolutionary models. But while formal proofs suggest that consensus’ reliability is compromised when individual judgments are not independent, this makes for a notoriously implausible assumption in the biological world: the people we learn from are constantly learning from each other as well. How do we avoid being misled by consensus? We present three experiments and a computational model examining commonsense reasoning about how people’s public and private judgments are influenced by the consensus and social status of those around them. Results suggest that while people realize that these two factors can cause others’ public and private judgments to diverge, their own trust in public consensus depends on how accurately they believe it reflects their informants’ true beliefs.
10.12.2025 04:06 — 👍 0    🔁 0    💬 0    📌 0
What is Balance? A Vital Mechano-Regulation Paradigm AbstractWithin minutes of birth a newborn gnu or giraffe works to stand and walk, asserting postural balance and organised animate behaviour in an apparently goal-directed manner. In contrast, robots learning to stand and walk from scratch begin with random flailing, the behaviour cohering over time as the robot internalises some reward/value signal. How does the newborn gnu ‘innately know’ what goal to aim for, and decide to work towards it? How could similar goal-directed balance learning be implemented in robots? Currently, animate balance inherits its axiomatic definition from the Newtonian formulation for inanimate balance; static mechanical equilibrium. This is arguably inappropriate for animate balance, because animals need to move and are never in static mechanical equilibrium, giving rise to the ‘posture-movement paradox’. The present Perspective proposes a more fluid, dynamical axiomatic task definition and goal which (a) isolates resisting gravity, (b) admits and enables movement, and (c) subsumes static mechanical equilibrium as a special case. This novel definition is founded upon inevitable biophysical requirements and observable developmental process. The article explains how animals apprehend and embed this goal through prenatal development suspended in equidense amniotic fluid, and then are challenged to self-maintain it by the perinatal transition. The account entails a paradigmatic shift in putative physiological organisation and associated conceptual framework for balance; from a subsidiary sensorimotor control task to a vital mechano-regulation task, organisationally akin to thermo-regulation. This vital mechano-regulation model of balance has practical implications and implies a range of predictions.
10.12.2025 04:06 — 👍 0    🔁 0    💬 0    📌 0
Agenda Setting and The Emperor’s New Clothes : People Diagnose Information Cascades during Sequential Testimony by Reasoning About Informants’ Speaking Order and Social Status AbstractConsensus-based social learning strategies often outcompete other strategies in evolutionary models. But while formal proofs suggest that consensus’ reliability is compromised when individual judgments are not independent, this makes for a notoriously implausible assumption in the biological world: the people we learn from are constantly learning from each other as well. How do we avoid being misled by consensus? We present three experiments and a computational model examining commonsense reasoning about how people’s public and private judgments are influenced by the consensus and social status of those around them. Results suggest that while people realize that these two factors can cause others’ public and private judgments to diverge, their own trust in public consensus depends on how accurately they believe it reflects their informants’ true beliefs.
10.12.2025 03:57 — 👍 1    🔁 0    💬 0    📌 0
What is Balance? A Vital Mechano-Regulation Paradigm AbstractWithin minutes of birth a newborn gnu or giraffe works to stand and walk, asserting postural balance and organised animate behaviour in an apparently goal-directed manner. In contrast, robots learning to stand and walk from scratch begin with random flailing, the behaviour cohering over time as the robot internalises some reward/value signal. How does the newborn gnu ‘innately know’ what goal to aim for, and decide to work towards it? How could similar goal-directed balance learning be implemented in robots? Currently, animate balance inherits its axiomatic definition from the Newtonian formulation for inanimate balance; static mechanical equilibrium. This is arguably inappropriate for animate balance, because animals need to move and are never in static mechanical equilibrium, giving rise to the ‘posture-movement paradox’. The present Perspective proposes a more fluid, dynamical axiomatic task definition and goal which (a) isolates resisting gravity, (b) admits and enables movement, and (c) subsumes static mechanical equilibrium as a special case. This novel definition is founded upon inevitable biophysical requirements and observable developmental process. The article explains how animals apprehend and embed this goal through prenatal development suspended in equidense amniotic fluid, and then are challenged to self-maintain it by the perinatal transition. The account entails a paradigmatic shift in putative physiological organisation and associated conceptual framework for balance; from a subsidiary sensorimotor control task to a vital mechano-regulation task, organisationally akin to thermo-regulation. This vital mechano-regulation model of balance has practical implications and implies a range of predictions.
10.12.2025 03:57 — 👍 1    🔁 0    💬 0    📌 0
Linguistic Rule Generalisation Creates the Same Distributional Structure That Feeds It AbstractPart of language’s great expressivity comes from its users creating new forms by applying familiar rules to novel items. But linguistic rules aren’t all created equal—some are more readily generalisable than others. In this paper, we focus on how rule generalisation is affected by certain properties of frequency distributions. In an artificial language learning experiment that asks adult learners to generalise using one of two suffixes, we find that they probability-match their input but slightly prefer whichever suffix they encountered with more low-frequency stems. Then with an urn model of learning, we show that previous explanations of generalisation that focus only on a distribution’s type count or its skew fail to capture participants’ behaviour—only the low-frequency preference yields convergent results. We model learners’ behaviour in terms of rational Bayesian inference about how likely a rule is to apply to more word types than somebody has already encountered. Overall, we suggest that linguistic rule generalisation is a self-sustaining process: by creating novel and therefore low-frequency items, rule generalisation produces the very same distributional structure that feeds it.
14.11.2025 04:05 — 👍 3    🔁 0    💬 0    📌 0
The Multifaceted Ganzfeld at the Crossroad Between Visual Perception and Consciousness: Behavioral, Neural and Qualitative Aspects AbstractA Ganzfeld is a homogeneous visual field, devoid of any focal points. Such a stimulus has been used by researchers to study perceptual phenomena in the absence of changes in sensory structure. Others have used it to study altered states of consciousness (ASCs). Until now, these different facets have been studied separately with little attention for the emotional subjective experience. This study aimed to elucidate the perceptual, phenomenal, and emotional experience of the multifaceted Ganzfeld using a multi-method approach combining behavioral (eye-tracking) and neural (electroencephalography; EEG) measures, with qualitative (interviews) and quantitative (questionnaires) assessments. We show that Ganzfeld spaces induce ASCs and offer immersive, full-body experiences, including bodily effects. Our results pertaining to bodily sensations further prompted us to identify a perceptually grounded cognitive processing type with either an inward-directed or externally-directed focus. We also identified the presence of an abstract cognitive processing type characterized by an introspective focus and meditative experiences. At the behavioral level, decays were characterized by decreased eye movements. The lag in reporting decays and the subjective experience of decays point to the notion of mind blanking. At the neural level, we found increased theta activity preceding decays, further hinting at a potential interrelation between perceptual decays and mind blanking. Finally, decays were characterized by more alpha activity, a pattern often associated with attenuated sensory processing and states of reduced external engagement (Jensen & Mazaheri, 2010), such as relaxation. Our findings contribute to a more in-depth understanding of all the components contributing to the rich Ganzfeld experiences.
14.11.2025 04:05 — 👍 0    🔁 0    💬 0    📌 0
Learning to Decompose: Human-Like Subgoal Preferences Emerge in Neural Networks Learning Graph Traversal AbstractCognitive scientists have discovered normative and heuristic principles that capture human subgoal preferences when partitioning problems into smaller ones. However, it remains unclear where such preferences come from and why they tend to be both effective and efficient. In this work, we study the processes through which these preferences may be implicitly encoded over learning as learners improve towards optimal traversals. We build on the graph-based environments from prior work and use neural networks as model learners to test if learning shortest-path traversal can lead to human-like path decomposition. We find that simple transformer models develop a preference for paths containing nodes that occur frequently on the shortest paths, consistent with human subgoal preferences found in prior work. This preference is observed when models solve shortest path traversals for unseen problems in both known graphs and new graphs, demonstrating that human-like subgoal preferences can arise without requiring explicit preference computation or exhaustively searching over all possible paths. The same preference does not emerge when models learn to perform random or Hamiltonian traversals. Our findings are robust across several transformer variants as well as recurrent neural networks, suggesting they depend more on the data distribution than the network architecture.
14.11.2025 04:05 — 👍 2    🔁 0    💬 0    📌 0
Linguistic Rule Generalisation Creates the Same Distributional Structure That Feeds It AbstractPart of language’s great expressivity comes from its users creating new forms by applying familiar rules to novel items. But linguistic rules aren’t all created equal—some are more readily generalisable than others. In this paper, we focus on how rule generalisation is affected by certain properties of frequency distributions. In an artificial language learning experiment that asks adult learners to generalise using one of two suffixes, we find that they probability-match their input but slightly prefer whichever suffix they encountered with more low-frequency stems. Then with an urn model of learning, we show that previous explanations of generalisation that focus only on a distribution’s type count or its skew fail to capture participants’ behaviour—only the low-frequency preference yields convergent results. We model learners’ behaviour in terms of rational Bayesian inference about how likely a rule is to apply to more word types than somebody has already encountered. Overall, we suggest that linguistic rule generalisation is a self-sustaining process: by creating novel and therefore low-frequency items, rule generalisation produces the very same distributional structure that feeds it.
14.11.2025 03:56 — 👍 0    🔁 0    💬 0    📌 0
The Multifaceted Ganzfeld at the Crossroad Between Visual Perception and Consciousness: Behavioral, Neural and Qualitative Aspects AbstractA Ganzfeld is a homogeneous visual field, devoid of any focal points. Such a stimulus has been used by researchers to study perceptual phenomena in the absence of changes in sensory structure. Others have used it to study altered states of consciousness (ASCs). Until now, these different facets have been studied separately with little attention for the emotional subjective experience. This study aimed to elucidate the perceptual, phenomenal, and emotional experience of the multifaceted Ganzfeld using a multi-method approach combining behavioral (eye-tracking) and neural (electroencephalography; EEG) measures, with qualitative (interviews) and quantitative (questionnaires) assessments. We show that Ganzfeld spaces induce ASCs and offer immersive, full-body experiences, including bodily effects. Our results pertaining to bodily sensations further prompted us to identify a perceptually grounded cognitive processing type with either an inward-directed or externally-directed focus. We also identified the presence of an abstract cognitive processing type characterized by an introspective focus and meditative experiences. At the behavioral level, decays were characterized by decreased eye movements. The lag in reporting decays and the subjective experience of decays point to the notion of mind blanking. At the neural level, we found increased theta activity preceding decays, further hinting at a potential interrelation between perceptual decays and mind blanking. Finally, decays were characterized by more alpha activity, a pattern often associated with attenuated sensory processing and states of reduced external engagement (Jensen & Mazaheri, 2010), such as relaxation. Our findings contribute to a more in-depth understanding of all the components contributing to the rich Ganzfeld experiences.
14.11.2025 03:56 — 👍 1    🔁 0    💬 0    📌 1
Learning to Decompose: Human-Like Subgoal Preferences Emerge in Neural Networks Learning Graph Traversal AbstractCognitive scientists have discovered normative and heuristic principles that capture human subgoal preferences when partitioning problems into smaller ones. However, it remains unclear where such preferences come from and why they tend to be both effective and efficient. In this work, we study the processes through which these preferences may be implicitly encoded over learning as learners improve towards optimal traversals. We build on the graph-based environments from prior work and use neural networks as model learners to test if learning shortest-path traversal can lead to human-like path decomposition. We find that simple transformer models develop a preference for paths containing nodes that occur frequently on the shortest paths, consistent with human subgoal preferences found in prior work. This preference is observed when models solve shortest path traversals for unseen problems in both known graphs and new graphs, demonstrating that human-like subgoal preferences can arise without requiring explicit preference computation or exhaustively searching over all possible paths. The same preference does not emerge when models learn to perform random or Hamiltonian traversals. Our findings are robust across several transformer variants as well as recurrent neural networks, suggesting they depend more on the data distribution than the network architecture.
14.11.2025 03:56 — 👍 2    🔁 0    💬 0    📌 0
Vowel- and Diphthong-Like Spectral Patterns in Sperm Whale Codas AbstractThe sperm whale communication system, consisting of groups of clicks called codas, has been primarily analyzed in terms of the number of clicks and their inter-click timing. This paper reports spectral properties in sperm whale vocalizations and demonstrates that spectral properties are highly structured, discretely distributed across codas, and uttered in dialogues, rather than being a physical artefact of whale movement. We report formant structure in whale codas and uncover previously unobserved spectral patterns. We argue that these spectral properties freely combine with the traditionally analyzed properties. We present a visualization technique that allows the description of several previously unobserved patterns. Codas are on many levels analogous to human vowels and diphthongs and can be conceptualized in terms of the source-filter theory: vowel duration and pitch correspond to the number of clicks and their timing (traditional coda types), while spectral properties of clicks correspond to formants in human vowels. We identify two recurrent and discrete coda-level spectral patterns that appear across individual sperm whales and across traditional coda types: the a- and i-coda vowels. We also report that sperm whales have diphthongal patterns on individual codas: with rising, falling, rising-falling and falling-rising formant patterns observed. These uncovered patterns suggest that spectral properties have the potential to add to the communicative complexity of codas independent of the traditionally analyzed properties and add a new dimension to the study of a cetacean communication system.
13.11.2025 04:19 — 👍 1    🔁 0    💬 0    📌 0
Delayed First Language Exposure Negatively Impacts Representation of Small Quantities: Evidence from Deaf and Hard-of-Hearing Children AbstractMost deaf and hard-of-hearing children are born to hearing parents, often delaying exposure to their first language. This negatively influences development of not only language, but also many other aspects of cognition, including exact representations of large quantities. The core knowledge view of numeracy predicts that delays in language exposure should not affect nonverbal representations of small quantities (1–3). This study is the first to investigate effects of language modality (spoken vs. signed) and timing of language experience (early, from birth vs. later) on the representation of small quantities of objects. We adapted the “Mr. Elephant” task (Shusterman et al., 2017) and examined whether children (age 3 to 7 years) succeeded on trials involving quantities 2 and 3. A logistic regression found that Timing and Socioeconomic Status significantly predicted Mr. Elephant performance, while Modality and Age did not. Early-exposed children were more likely to succeed on the task than Later-exposed children. For an exploratory follow-up, two measures of language were added into the analysis: Highest Count, which records children’s recitation of the count list, and Give-a-Number (‘Give-N’), which assesses children’s understanding of the cardinal principle (CP). This logistic regression found that Timing and Give-N performance significantly and independently predicted Mr. Elephant performance, but Socioeconomic Status and Highest Count did not. Children who were CP-knowers were more likely to succeed on Mr. Elephant than non-CP-knowers. These results suggest that the representation of small quantity representations is associated with the timing of children’s language exposure and their knowledge of the cardinal principle.
13.11.2025 04:19 — 👍 0    🔁 0    💬 0    📌 0
Information-Theoretic Measures of Metacognition: Bounds and Relation to Group Performance AbstractMetacognition comprises the ability to differentiate the accuracy of predictions about the world. This is often called Type 2 performance (with Type 1 performance being the overall accuracy). Typical measures of metacognition are based on signal detection theory and require the strong assumption of truncated normal noise underlying confidence ratings. To minimize distributional assumptions, measures based on classical information theory have been proposed. We further this approach by providing bounds on its key quantity, the transmitted information. We show that classifiers making predictions with a certain accuracy can transmit information only within a limited range, depending on the underlying noise distribution: The lowest transmitted information indicates the worst Type 2 performance and corresponds to binary noise; the highest transmitted information indicates the best Type 2 performance and corresponds to uniform noise. Because normal noise is only an intermediate case, traditional measures based on this assumption can bias interpretations of Type 2 performance. Based on these bounds, we suggest a new measure: Relative metainformation (RMI). RMI scales from 0 (lower bound) to 1 (upper bound) and therefore advances towards the much-needed decoupling of Type 2 from Type 1 performance measures. To demonstrate the strengths of RMI, we apply it to groups: In a setting where multiple independent group members with fixed accuracies combine their predictions in an optimal way, we show that the group performance depends directly on RMI: Group accuracy is best vs. worst if the group members have highest vs. lowest RMI values. Overall, our theoretical bounds allow to better evaluate measures of Type 2 and group performance.
13.11.2025 04:19 — 👍 4    🔁 0    💬 0    📌 0
The Curious U : Integrating Theories Linking Knowledge and Information-Seeking Behavior AbstractMany empirical studies have found a curvilinear (inverted-U) relationship between knowledge and curiosity, such that curiosity is induced when stimuli are neither unknown nor too familiar. While various theoretical accounts have been proposed to explain this phenomenon, no clear link between them have been delineated. In this Perspective, we review seven psychological accounts of the inverted-U relationship between knowledge and curiosity (“the U”) and provide a coherent framework integrating them. According to this framework, the U emerges as a consequence of the imperative to pursue learning progress and thus maximize knowledge. We show that some theories of curiosity address this issue by explicitly stipulating knowledge maximization as the computational objective, and learning-progress maximization as an optimal means of achieving it (i.e., normative theories). Other theories focus on psychological mechanisms or factors that drive curiosity (i.e., process theories). We propose that these process-theoretic mechanisms could also work in a manner that maximizes learning by signaling situations in which some relevant prior knowledge exists, but is incomplete. The implications of this framework for future theoretical work on curiosity and its connections to related phenomena are discussed.
13.11.2025 04:19 — 👍 8    🔁 2    💬 0    📌 0
Vowel- and Diphthong-Like Spectral Patterns in Sperm Whale Codas AbstractThe sperm whale communication system, consisting of groups of clicks called codas, has been primarily analyzed in terms of the number of clicks and their inter-click timing. This paper reports spectral properties in sperm whale vocalizations and demonstrates that spectral properties are highly structured, discretely distributed across codas, and uttered in dialogues, rather than being a physical artefact of whale movement. We report formant structure in whale codas and uncover previously unobserved spectral patterns. We argue that these spectral properties freely combine with the traditionally analyzed properties. We present a visualization technique that allows the description of several previously unobserved patterns. Codas are on many levels analogous to human vowels and diphthongs and can be conceptualized in terms of the source-filter theory: vowel duration and pitch correspond to the number of clicks and their timing (traditional coda types), while spectral properties of clicks correspond to formants in human vowels. We identify two recurrent and discrete coda-level spectral patterns that appear across individual sperm whales and across traditional coda types: the a- and i-coda vowels. We also report that sperm whales have diphthongal patterns on individual codas: with rising, falling, rising-falling and falling-rising formant patterns observed. These uncovered patterns suggest that spectral properties have the potential to add to the communicative complexity of codas independent of the traditionally analyzed properties and add a new dimension to the study of a cetacean communication system.
13.11.2025 04:11 — 👍 0    🔁 0    💬 0    📌 0
The Minds That Matter: How Robots’ Mental Capacities Shape Children’s Evaluations and Trust AbstractRobots express a great deal of diverse human-like capacities, ranging from communicating in natural languages to displaying emotions to responding to physical touch. Here we examined the role of different kinds of mental capacities on children’s evaluations of, and trust in, robots. We presented 6- to 9-year-olds with identical-looking humanoid robots described as having one (or none) of the following capacities: cognitive-perceptual, social-emotional, or physiological. Across three studies (N = 287), we found that children differentially evaluated (Studies 1A and 1B) and selectively trusted (Study 2) robots with different types of minds. The diverging evaluations (i.e., of benevolence, intelligence, affinity, and epistemic appeal) of robots with different minds emerged between ages 7 and 8 and became stronger with age. Moreover, these differences translated into selective trust choices: children trusted robots with cognitive-perceptual capacities over robots with social-emotional capacities in a factual, but not a social, context, and over robots with bodily capacities across both contexts. Altogether, these findings open avenues for future interdisciplinary research on children’s reasoning about emerging technologies.
29.10.2025 17:30 — 👍 1    🔁 0    💬 0    📌 0
The Minds That Matter: How Robots’ Mental Capacities Shape Children’s Evaluations and Trust AbstractRobots express a great deal of diverse human-like capacities, ranging from communicating in natural languages to displaying emotions to responding to physical touch. Here we examined the role of different kinds of mental capacities on children’s evaluations of, and trust in, robots. We presented 6- to 9-year-olds with identical-looking humanoid robots described as having one (or none) of the following capacities: cognitive-perceptual, social-emotional, or physiological. Across three studies (N = 287), we found that children differentially evaluated (Studies 1A and 1B) and selectively trusted (Study 2) robots with different types of minds. The diverging evaluations (i.e., of benevolence, intelligence, affinity, and epistemic appeal) of robots with different minds emerged between ages 7 and 8 and became stronger with age. Moreover, these differences translated into selective trust choices: children trusted robots with cognitive-perceptual capacities over robots with social-emotional capacities in a factual, but not a social, context, and over robots with bodily capacities across both contexts. Altogether, these findings open avenues for future interdisciplinary research on children’s reasoning about emerging technologies.
18.10.2025 03:14 — 👍 1    🔁 0    💬 0    📌 0
Initial Expectations and Confidence Affect the Formation of Novel Self—Beliefs and Their Revision AbstractHuman self-beliefs hinge on social feedback, but their formation and revision are not solely based on new information. Biases during learning, such as confirming initial expectations, can lead to inaccurate beliefs. This study uses computational modeling to explore how initial expectations about one’s own and others’ abilities and confidence in these beliefs affect processes of belief formation and belief revision in novel behavioral domains. In the first session, participants formed performance beliefs through trial-by-trial feedback. In the second session, feedback contingencies were reversed to promote a revision of beliefs. Results showed that people form and revise beliefs in a confirmatory manner, with lower initial expectations being linked to more negatively biased belief formation and revision, while growing confidence strengthened these beliefs over time. Once formed, these beliefs proved resistant to change even when faced with contradictory feedback. The findings suggest that newly formed beliefs become entrenched and resistant to new, contradictory information in a short period of time. Understanding how self-beliefs are formed, the role that confidence plays in this process, and why established beliefs are difficult to revise can inform the development of interventions aimed at promoting more adaptive learning in educational, clinical, and social contexts.
18.10.2025 03:14 — 👍 5    🔁 1    💬 0    📌 0
Epistemic Curiosity in Kea Parrots and Human Children AbstractBoth human children and animals seek information following a violation-of-expectation event, but little research suggests the latter do so for the sake of it. In this preregistered experiment, we compared epistemic curiosity—the pursuit of information for its own sake—in kea parrots (Nestor notabilis) and three-year-old human children (Homo sapiens) following a violation-of-expectation event. Subjects were trained to push a tool into an apparatus that produced a reward before the apparatus was surreptitiously made non-functional in following trials. In both functional and non-functional trials, after solving the task, subjects were rewarded and allowed to explore the apparatus for thirty seconds with the opportunity to peek into the side of the apparatus. We found that relatively more kea peeked than children, but the children and not the kea were significantly more likely to peek in the non-functional versus functional trials, particularly when the researcher was absent. While both species showed markers of curiosity in the experiment, we found expectancy-violation-induced epistemic curiosity only in the children and not the kea in this context.
18.10.2025 03:14 — 👍 2    🔁 1    💬 0    📌 0
The Missing Half of Language Learning in Current Developmental Language Models: Exogenous and Endogenous Linguistic Input AbstractDevelopmental language models (DLMs) aim to replicate the efficiency of child language acquisition but often focus solely on the estimation of exogenous linguistic input. We argue that a child’s linguistic growth is also critically shaped by endogenous processes, including (1) co-opting language in non-linguistic perception and cognition, (2) engaging in private and inner speech, and (3) benefiting from neural replay of linguistic information during sleep. These endogenous processes amplify and refine exogenous linguistic input in ways that current DLMs do not replicate. To align DLMs with child language acquisition, we propose redefining “linguistic exposure“ to encompass both exogenous and endogenous linguistic input. By integrating label feedback, self-generated speech, and sleep-like consolidation, researchers can narrow the gap between artificial and human learning. Collaborations across machine learning, psychology, and linguistics will be essential to ground models in empirical data on child behavior and build DLMs that truly reflect the marvel of language acquisition.
18.10.2025 03:14 — 👍 1    🔁 0    💬 0    📌 0
Semantic Anchors Facilitate Task Encoding in Continual Learning AbstractHumans are remarkably efficient at learning new tasks, in large part by relying on the integration of previously learned knowledge. However, research on task learning typically focuses on the learning of abstract task rules on minimalist stimuli, to study behavior independent of the learning history that humans come equipped with (i.e., semantic knowledge). In contrast, several theories suggest that the use of semantic knowledge and labels may help the learning of new task information. Here, we tested whether providing existing, semantically rich task embeddings and response labels allowed for more robust task rule encoding and less (catastrophic) forgetting and interference. Our results show that providing semantically rich task settings and response labels resulted in less task forgetting (Experiment 1), both when using pictorial symbols or words as labels (Experiment 2), or when contrasted with visually matched shape labels without inherent meaning (Experiment 4). Using a subsequent value-based decision-making task and reinforcement learning modeling (Experiment 3), we demonstrate how the learned embedding of novel stimuli in semantically rich, representations, further allowed for a more efficient, feature-specific processing when learning new task information. Finally, using artificial recurrent neural networks fitted to our participants’ task performance, we found that task separation during learning was more predictive of learning and task performance in the semantically rich conditions. Together, our findings show the benefit of using semantically rich task rules and response labels during novel task learning, thereby offering important insights into why humans excel in continual learning and are less susceptible to catastrophic forgetting compared to most artificial agents.
27.09.2025 02:50 — 👍 0    🔁 0    💬 0    📌 0
Predictive Structure Emerges During the Generalisation of Kin Terms to New Referents AbstractDespite cross-linguistic diversity in how kin relations map to terminology, there are constraints on which kin may be categorised together. But what are the constraints on kin term variation, and where do they come from? One proposed constraint is internal co-selection—an evolutionary process where terminological changes in one generation of kin co-occur with parallel changes in other generations. This results in kin categories which are predictable on the basis of other kin categories, a property we call predictive structure. To determine the strength of this constraint, we measured the predictive structure of kinship terminology systems from 731 languages. We found that kinship terminologies exhibit a significant degree of predictive structure, and we argue that its prevalence reflects a cognitive pressure for simplicity imposed during the generalisation of known kin categories to new referent types. We tested this claim using an artificial kin term generalisation task. Our results suggest that people do favour predictive structure when generalising from known kin categories to new referents, but that this preference faces interference from other pressures to distinguish kin by features like gender.
27.09.2025 02:50 — 👍 0    🔁 0    💬 0    📌 0
The Relative Contributions of Traits and Contexts on Social Network Learning AbstractNavigating the social world is guided by remembering which people know each other. Yet, different factors might influence how social relationships are remembered, where people’s shared attributes could distort a social network’s mnemonic representation. Here, we study whether dyadically shared contexts and personality traits impact how people remember relationships in social networks. Through varying levels of network topological complexity, we find the contexts where people know each other are most memorable and that better contextual retrieval predicts relationship recall. In contrast, shared personality traits affect relationship recall differently depending on social network complexity, where shared negatively valenced traits relate to worse relationship recall in the simple network. Subsequent modeling revealed that as networks become more complex, relationships between more centrally positioned individuals that share negatively valenced traits are better recalled compared to less well-connected individuals. These results suggest contextual memory can serve as a scaffold for remembering relationships in a social network, while affective traits’ impact on social network retrievability depends on emotional valence and the individuals involved. More generally, our findings give insight into how the same social network can be represented differently based on one’s past experience.
27.09.2025 02:50 — 👍 0    🔁 0    💬 0    📌 0
People Evaluate Agents Based on the Algorithms That Drive Their Behavior AbstractWhen people see an agent perform a task, do they care if the underlying algorithm driving it is ‘intelligent’ or not? More generally, when people intuitively evaluate the performance of others, do they value external performance metrics (intuitive behaviorism) or do they also take into account the underlying algorithm driving the agent’s behavior (intuitive cognitivism)? We propose 3 dimensions for examining this distinction: Action Efficiency, Representation Efficiency, and Generalization. Across 3 tasks (N = 598), we showed people pairs of maze-solving agents, together with the programs driving the agents’ behavior. Participants were asked to pick the ‘better’ of the two programs, based on a single example of the two programs, evaluated on the same maze. Each pair of programs varied along one of our 3 proposed dimensions. Our framework predicts people’s choice of program across the tasks, and the results support the idea that people are intuitive cognitivists.
13.09.2025 03:32 — 👍 0    🔁 0    💬 0    📌 0

@openmindjournal is following 1 prominent accounts