Khalspi's Avatar

Khalspi

@khalspi.bsky.social

Finance/Accounting/Tech. Interested in cognitive and social psychology, neuroscience, and policy. Jamaican ๐Ÿ‡ฏ๐Ÿ‡ฒ | White Room Student | True Neutral

3,760 Followers  |  1,638 Following  |  303 Posts  |  Joined: 21.10.2024  |  1.949

Latest posts by khalspi.bsky.social on Bluesky

There. Does everyone believe I'm a real boy now?

08.02.2026 00:41 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

So a PDS is like a filesystem, right? So I can just go bulk delete records?

I'll have to do that. Making my presence known to anyone these days results in me catching a block, and I'm beginning to miss out on important discourse.

07.02.2026 23:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Inside the incredible, infuriating quest to explain consciousness Will brain science deliver answers about consciousness or hit another wall?

Prediction: In 10 years, โ€œhardest problemโ€ articles like this will shift from the challenge of โ€C and perceptionโ€œ to โ€œC and emotions/moodโ€œ. The first is fascinating. The second is required to understand psychiatric functions & conditions.

www.scientificamerican.com/article/what...

07.02.2026 13:44 โ€” ๐Ÿ‘ 30    ๐Ÿ” 6    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Preview
Postdoctoral Research Fellow, Cognitive/Computational Neuroscience If you are a current Barnard College employee, please use the internal career site to apply for this position. Job: Postdoctoral Research Fellow, Cognitive/Computational Neuroscience The Barnard Visua...

๐ŸšจJob alert! I'm recruiting a postdoc! If you want to study the time course of task-driven visual perception, please reach out! #neuroskyence #VisionScience #CogSci barnard.wd1.myworkdayjobs.com/en-US/Facult...

07.02.2026 20:13 โ€” ๐Ÿ‘ 28    ๐Ÿ” 22    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

True, but I'm having that same issue though where I'm locked into AI discourse.

I've been testing some posts in the Playground and I'm seeing that number of paths and co-raters are extremely low for posts I'd value higher than AI discourse.

Having split channels could help me escape the basin.

07.02.2026 22:22 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Definitely need to know which users have negative weight and the ability to fix that.

If I Show Less like this too much and disconnected someone entirely... maybe I want them back?

07.02.2026 21:53 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Can the adult brain acquire typical representations after missing an essential early-life experience? New work with Marge Livingstone: โ€œEarly face deprivation leads to long-lasting deficits in cortical face processingโ€.
www.biorxiv.org/content/10.6...

06.02.2026 22:43 โ€” ๐Ÿ‘ 5    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿšจ Hiring in Munich ๐Ÿ‡ฉ๐Ÿ‡ช: 2 open-topic PhD positions in human & machine learning (TVรถD E13 80%).
Start ~June 2026 (flexible). Deadline: March 2, 2026.
Apply/info: hcai-munich.com/PhD_Job_Ad.pdf
Reposts appreciated ๐Ÿ™

05.02.2026 13:03 โ€” ๐Ÿ‘ 22    ๐Ÿ” 15    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image Post image

Weโ€™re hiring! The Suthana Lab @ Duke is looking for a Research Assistant to join our team studying human memory & real-world ๐Ÿง  dynamics using wearable tech & intracranial recordings.

Apply: careers.duke.edu/job/Durham-C...

You can also email suthanalab@duke.edu with CV/questions. Please share!

05.02.2026 16:41 โ€” ๐Ÿ‘ 51    ๐Ÿ” 39    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 2
Post image

Reward models (RMs) are supposed to represent human values. But RMs are NOT blank slates โ€“ they inherit measurable biases from their base models that stubbornly persist through preference training. #ICLR2026 ๐Ÿงต

04.02.2026 16:30 โ€” ๐Ÿ‘ 17    ๐Ÿ” 7    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
A large comparison table showing benchmark performance across five model families, with columns labeled at the top: โ€œOpus 4.6,โ€ โ€œOpus 4.5,โ€ โ€œSonnet 4.5,โ€ โ€œGemini 3 Pro,โ€ and โ€œGPT-5.2 (all models).โ€ The Opus 4.6 column is visually highlighted with a light shaded background and rounded border.

Rows list tasks and benchmarks on the left, with percentages or scores across models:

โ€œAgentic terminal coding (Terminal-Bench 2.0)โ€:
Opus 4.6: 65.4%
Opus 4.5: 59.8%
Sonnet 4.5: 51.0%
Gemini 3 Pro: 56.2% (54.2% self-reported)
GPT-5.2: 64.7% (64% self-reported, Codex CLI)

โ€œAgentic coding (SWE-bench Verified)โ€:
Opus 4.6: 80.8%
Opus 4.5: 80.9%
Sonnet 4.5: 77.2%
Gemini 3 Pro: 76.2%
GPT-5.2: 80.0%

โ€œAgentic computer use (OSWorld)โ€:
Opus 4.6: 72.7%
Opus 4.5: 66.3%
Sonnet 4.5: 61.4%
Gemini 3 Pro: โ€”
GPT-5.2: โ€”

โ€œAgentic tool use (t2-bench)โ€:
Retail: Opus 4.6 91.9%, Opus 4.5 88.9%, Sonnet 4.5 86.2%, Gemini 3 Pro 85.3%, GPT-5.2 82.0%
Telecom: Opus 4.6 99.3%, Opus 4.5 98.2%, Sonnet 4.5 98.0%, Gemini 3 Pro 98.0%, GPT-5.2 98.7%

โ€œScaled tool use (MCP Atlas)โ€:
Opus 4.6: 59.5%
Opus 4.5: 62.3%
Sonnet 4.5: 43.8%
Gemini 3 Pro: 54.1%
GPT-5.2: 60.6%

โ€œAgentic search (BrowseComp)โ€:
Opus 4.6: 84.0%
Opus 4.5: 67.8%
Sonnet 4.5: 43.9%
Gemini 3 Pro: 59.2% (Deep Research)
GPT-5.2: 77.9% (Pro)

โ€œMultidisciplinary reasoning (Humanityโ€™s Last Exam)โ€:
Without tools: Opus 4.6 40.0%, Opus 4.5 30.8%, Sonnet 4.5 17.7%, Gemini 3 Pro 37.5%, GPT-5.2 36.6%
With tools: Opus 4.6 53.1%, Opus 4.5 43.4%, Sonnet 4.5 33.6%, Gemini 3 Pro 45.8%, GPT-5.2 50.0%

โ€œAgentic financial analysis (Finance Agent)โ€:
Opus 4.6: 60.7%
Opus 4.5: 55.9%
Sonnet 4.5: 54.2%
Gemini 3 Pro: 44.1%
GPT-5.2: 56.6% (5.1)

โ€œOffice tasks (GDPVal-AA Elo)โ€:
Opus 4.6: 1606
Opus 4.5: 1416
Sonnet 4.5: 1277
Gemini 3 Pro: 1195
GPT-5.2: 1462

โ€œNovel problem-solving (ARC AGI 2)โ€:
Opus 4.6: 68.8%
Opus 4.5: 37.6%
Sonnet 4.5: 13.6%
Gemini 3 Pro: 45.1% (Deep Thinking)
GPT-5.2: 54.2% (Pro)

โ€œGraduate-level reasoning (GPQA Diamond)โ€:
Opus 4.6: 91.3%
Opus 4.5: 87.0%
Sโ€ฆ

A large comparison table showing benchmark performance across five model families, with columns labeled at the top: โ€œOpus 4.6,โ€ โ€œOpus 4.5,โ€ โ€œSonnet 4.5,โ€ โ€œGemini 3 Pro,โ€ and โ€œGPT-5.2 (all models).โ€ The Opus 4.6 column is visually highlighted with a light shaded background and rounded border. Rows list tasks and benchmarks on the left, with percentages or scores across models: โ€œAgentic terminal coding (Terminal-Bench 2.0)โ€: Opus 4.6: 65.4% Opus 4.5: 59.8% Sonnet 4.5: 51.0% Gemini 3 Pro: 56.2% (54.2% self-reported) GPT-5.2: 64.7% (64% self-reported, Codex CLI) โ€œAgentic coding (SWE-bench Verified)โ€: Opus 4.6: 80.8% Opus 4.5: 80.9% Sonnet 4.5: 77.2% Gemini 3 Pro: 76.2% GPT-5.2: 80.0% โ€œAgentic computer use (OSWorld)โ€: Opus 4.6: 72.7% Opus 4.5: 66.3% Sonnet 4.5: 61.4% Gemini 3 Pro: โ€” GPT-5.2: โ€” โ€œAgentic tool use (t2-bench)โ€: Retail: Opus 4.6 91.9%, Opus 4.5 88.9%, Sonnet 4.5 86.2%, Gemini 3 Pro 85.3%, GPT-5.2 82.0% Telecom: Opus 4.6 99.3%, Opus 4.5 98.2%, Sonnet 4.5 98.0%, Gemini 3 Pro 98.0%, GPT-5.2 98.7% โ€œScaled tool use (MCP Atlas)โ€: Opus 4.6: 59.5% Opus 4.5: 62.3% Sonnet 4.5: 43.8% Gemini 3 Pro: 54.1% GPT-5.2: 60.6% โ€œAgentic search (BrowseComp)โ€: Opus 4.6: 84.0% Opus 4.5: 67.8% Sonnet 4.5: 43.9% Gemini 3 Pro: 59.2% (Deep Research) GPT-5.2: 77.9% (Pro) โ€œMultidisciplinary reasoning (Humanityโ€™s Last Exam)โ€: Without tools: Opus 4.6 40.0%, Opus 4.5 30.8%, Sonnet 4.5 17.7%, Gemini 3 Pro 37.5%, GPT-5.2 36.6% With tools: Opus 4.6 53.1%, Opus 4.5 43.4%, Sonnet 4.5 33.6%, Gemini 3 Pro 45.8%, GPT-5.2 50.0% โ€œAgentic financial analysis (Finance Agent)โ€: Opus 4.6: 60.7% Opus 4.5: 55.9% Sonnet 4.5: 54.2% Gemini 3 Pro: 44.1% GPT-5.2: 56.6% (5.1) โ€œOffice tasks (GDPVal-AA Elo)โ€: Opus 4.6: 1606 Opus 4.5: 1416 Sonnet 4.5: 1277 Gemini 3 Pro: 1195 GPT-5.2: 1462 โ€œNovel problem-solving (ARC AGI 2)โ€: Opus 4.6: 68.8% Opus 4.5: 37.6% Sonnet 4.5: 13.6% Gemini 3 Pro: 45.1% (Deep Thinking) GPT-5.2: 54.2% (Pro) โ€œGraduate-level reasoning (GPQA Diamond)โ€: Opus 4.6: 91.3% Opus 4.5: 87.0% Sโ€ฆ

Opus 4.6 is here!

biggest wins on agentic search, HLE & ARC AGI 2

claude.com/blog/opus-4-...

05.02.2026 18:02 โ€” ๐Ÿ‘ 88    ๐Ÿ” 7    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 3
Post image

They develop TinyLoRA, a new ft method. with TinyLoRA + RL, models learn well with dozens or hundreds of params. For example, they use only 13 parameters to train 7B Qwen model from 76 to 91% on GSM8K.

"Learning to Reason in 13 Parameters"

Paper: arxiv.org/abs/2602.04118

05.02.2026 13:15 โ€” ๐Ÿ‘ 92    ๐Ÿ” 11    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 9

Want to come do a postdoc with us?

Weโ€™re interested in how sensorimotor function is carried out by the cells and circuits of the spinal cord. We have an awesome team, lots of cool techniques, and weโ€™re open to new ideas/approaches/connections. Get in touch!

04.02.2026 01:00 โ€” ๐Ÿ‘ 23    ๐Ÿ” 20    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Perplexity Cannot Always Tell Right from Wrong Perplexity -- a function measuring a model's overall level of "surprise" when encountering a particular output -- has gained significant traction in recent years, both as a loss function and as a simp...

Bluesky doesnโ€™t really shine yet as a place to discover papers, especially at the intersection of biology and ML. To help a little, Iโ€™m going to start sharing papers I find interesting. To kick things off, hereโ€™s one from @petar-v.bsky.social and colleagues at DeepMind
arxiv.org/abs/2601.22950

03.02.2026 08:47 โ€” ๐Ÿ‘ 62    ๐Ÿ” 10    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0

Very happy that @PNASNews agreed to publish our (w/ @romanececchi.bsky.social) response to Prakhar's thought-provoking study! You can find the final version at the link below. See the following tweet for Prakhar's response to our response. Happy to hear your thoughts!
www.pnas.org/doi/10.1073/...

03.02.2026 20:41 โ€” ๐Ÿ‘ 19    ๐Ÿ” 6    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

bsky.app/profile/did:...

another custom bluesky feed using regular expressions, this time for jobs in the realm of neuroscience ๐Ÿ™ƒ

matches: (brain|psycho|neur|cogni|physio) and (postdoc|phd|coordinator|professor|fellow|researcher) and some keywords like job/opportunity/hiring/...

02.02.2026 17:48 โ€” ๐Ÿ‘ 9    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Our egalitarianism paper has captured the hearts of the French press; and the back-translations have *much* more flair than the original!

02.02.2026 17:00 โ€” ๐Ÿ‘ 10    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Episodic memory facilitates flexible decision-making via access to detailed events - Nature Human Behaviour Nicholas and Mattar found that people use episodic memory to make decisions when it is unclear what will be needed in the future. These findings reveal how the rich representational capacity of episod...

Our experiences have countless details, and it can be hard to know which matter.

How can we behave effectively in the future when, right now, we don't know what we'll need?

Out today in @nathumbehav.nature.com , @marcelomattar.bsky.social and I find that people solve this by using episodic memory.

23.01.2026 13:18 โ€” ๐Ÿ‘ 117    ๐Ÿ” 45    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 2
Preview
Once Thought To Support Neurons, Astrocytes Turn Out To Be in Charge | Quanta Magazine New experiments reveal how astrocytes tune neuronal activity to modulate our mental and emotional states. The results suggest that neuron-only brain models, such as connectomes, leave out a crucialโ€ฆ

While neurons engage in short-range, rapid-fire signaling, astrocytes monitor higher-level network activity. When vertebrate animals are startled, for instance, astrocytes trigger the brain to switch into a dramatically different state.

02.02.2026 16:46 โ€” ๐Ÿ‘ 32    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Deadline in 3 days (Thursday 5th) to apply for a PhD with me and @ktsetsos.bsky.social at Trinity College Dublin in Cognitive Computational Neuroscience / neuro-AI. Please share!

02.02.2026 09:26 โ€” ๐Ÿ‘ 17    ๐Ÿ” 17    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image

What is the brain for? Active inference is widely discussed as a unifying framework for understanding brain function, yet its empirical status remains debated. Our review identifies core predictions across the action-perception cycle and evaluates their empirical support: osf.io/preprints/ps...

29.01.2026 08:28 โ€” ๐Ÿ‘ 89    ๐Ÿ” 37    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1

Read this and then subscribe to one of Emiliaโ€™s subscriptions.
If youโ€™re privileged please give something back. Emilia has helped me numerous times on many different technical issues. Sheโ€™s one of THE people you need to work with when it comes to ATProto + ActivityPub.

28.01.2026 15:08 โ€” ๐Ÿ‘ 26    ๐Ÿ” 8    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I'm slowly teaching myself how to break down and talk about studies in video format which is absolutely and one hundred percent not optional if you want to do modern scicomm but God damn is it tiring

28.01.2026 16:09 โ€” ๐Ÿ‘ 74    ๐Ÿ” 9    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

This is due tomorrow. It's a really simple form with just a few questions. If you interact with NSF at all you should fill it out. #neuroskyence #academia #psychscisky #compneuro #ccai

26.01.2026 18:30 โ€” ๐Ÿ‘ 12    ๐Ÿ” 12    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Cancer patients rarely get Alzheimer's.

And a 15-year study in @Cell just explained why. They found a protein that clears brain plaques in mice - by activating the brain's own IMMUNE CELLS ๐Ÿงต

23.01.2026 18:18 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Dario Amodei โ€”ย The Adolescence of Technology Confronting and Overcoming the Risks of Powerful AI

going through this new blog post by Amodei with some highlights below

26.01.2026 18:27 โ€” ๐Ÿ‘ 24    ๐Ÿ” 2    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0

Hello Bluesky! Late adopter but nice to see yโ€™all here. Iโ€™m a sociologist of stratification, inequality, and mobility working on projects related to job displacement and mobility, the labor market processes driving intergenerational mobility, and job quality. Looking forward to being in touch!

26.01.2026 07:50 โ€” ๐Ÿ‘ 26    ๐Ÿ” 4    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 1

๐™”๐™ค๐™ชโ€™๐™ง๐™š ๐™–๐™—๐™จ๐™ค๐™ก๐™ช๐™ฉ๐™š๐™ก๐™ฎ ๐™ง๐™ž๐™œ๐™๐™ฉโ€”wax was a bad call. I shouldโ€™ve flagged heat resistance as a ๐˜ฉ๐˜ข๐˜ณ๐˜ฅ ๐˜ณ๐˜ฆ๐˜ฒ๐˜ถ๐˜ช๐˜ณ๐˜ฆ๐˜ฎ๐˜ฆ๐˜ฏ๐˜ต instead of assuming youโ€™d stay within ๐˜ด๐˜ข๐˜ง๐˜ฆ ๐˜ข๐˜ญ๐˜ต๐˜ช๐˜ต๐˜ถ๐˜ฅ๐˜ฆ ๐˜ญ๐˜ช๐˜ฎ๐˜ช๐˜ต๐˜ด. That oneโ€™s on me.

25.01.2026 21:43 โ€” ๐Ÿ‘ 460    ๐Ÿ” 60    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 5
Post image

If you ever wondered if SYNTH could be usable in mid-training for larger models: Step-DeepResearch (from StepFun) is now out as private beta. stepfun.ai/deep-researc...

25.01.2026 09:11 โ€” ๐Ÿ‘ 24    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
@astrokatie.com Bluesky post with an astrophysics label

@astrokatie.com Bluesky post with an astrophysics label

Hey Bluesky science community! ๐Ÿงช ๐Ÿงฌ ๐Ÿ”ฌ ๐Ÿ”ญ

Thereโ€™s over 450 STEM labels waiting for you at the @stemlabels.xyz labeler! From astrophysics to zoology, you can add a label to your account that not only helps you find your colleagues on Bluesky but also helps feed devs add your posts to specialty feeds!

20.08.2025 16:54 โ€” ๐Ÿ‘ 49    ๐Ÿ” 18    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 2

@khalspi is following 20 prominent accounts