Vlad Larin's Avatar

Vlad Larin

@vlarin.bsky.social

PhD // decentralized AI/LLM researcher // co-founder, CTO at @fortytwonetwork

311 Followers  |  1,317 Following  |  7 Posts  |  Joined: 27.11.2024  |  1.7227

Latest posts by vlarin.bsky.social on Bluesky

Yup, really believe that this is a way

10.12.2024 22:40 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Without this compass, a well-crafted prompt could steer the AI into actions that look safe on the surface yet ultimately harmful. By having a continuous human empathy stream in the loop, the model starts to worry about these subtleties, sensing when something might undermine human well-being.

10.12.2024 18:47 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

This dynamic input ensures that each next-generation step leans into compassion and alignment โ€” enabling the AI to reflect on the context rather than just effectively โ€œerase weightsโ€ for potentially harmful prompts.

10.12.2024 18:46 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

The mechanism? Live RLHF. Consider a reward model that continuously evaluates ongoing and a number of predicted tokens. By embedding its judgments back into the main modelโ€™s pipeline, we give the AI a running commentary of human intent.

10.12.2024 18:45 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Abstract human-to-AI interaction

Abstract human-to-AI interaction

Iโ€™m starting to think that reflective human empathy might be key to genuine AI alignment. Instead of just filtering harmful outputs post-hoc, what if we integrate direct human feedback into the modelโ€™s reasoning process?

10.12.2024 18:45 โ€” ๐Ÿ‘ 7    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Humans learn sequentially โ€” each experience shaping the next. AI learns in parallel, absorbing vast knowledge simultaneously. This fundamental difference creates entirely new forms of cognition. Could AIโ€™s parallel consciousness unveil truths about reality that transcend human perception?

01.12.2024 06:07 โ€” ๐Ÿ‘ 32    ๐Ÿ” 4    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 1

Hi #Bluesky Really interested to see how it goes, glad to see how much attention the idea of real content ownership and full customization gets ๐Ÿš€

01.12.2024 05:45 โ€” ๐Ÿ‘ 7    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@vlarin is following 19 prominent accounts