Maria Gorinova's Avatar

Maria Gorinova

@mgorinova.bsky.social

Shaping the future of programming @tessl.io πŸš€ | ex-@TwitterCortex @Birdwatch πŸ’™ | PhD in probabilistic machine learning, loyal servant to a cat, collector of random variables, and lover of well-placed puns. https://mgorinova.github.io/

2,155 Followers  |  440 Following  |  135 Posts  |  Joined: 01.07.2023
Posts Following

Posts by Maria Gorinova (@mgorinova.bsky.social)

The Population Test Your Agent Must Pass | Maria Gorinova
YouTube video by AI Native Dev The Population Test Your Agent Must Pass | Maria Gorinova

Learn more about our work on abstraction adherence of coding agents! www.youtube.com/watch?v=nx5_...

#coding #aicoding #agenticai #codegen #devtools #cursor #claudecode

16.12.2025 13:52 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Had such a great chat with @sjmaple.bsky.social about abstraction adherence of coding agents, as well as what matters when it comes to evaluation. Check it out in the new @ainativedev.io podcast episode!

YouTube: ainativedev.co/ble
Apple Podcasts: ainativedev.co/exx
Spotify: ainativedev.co/2mn

09.12.2025 15:47 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

At the risk of starting the flame war to end all flame wars...

Modern LLMs (GPT-5.1, Claude 4.5, Gemini 3) produce excellent code and can be a significant productivity boost to software engineers who take the time to learn how to effectively apply them - especially if used with coding agent tools

27.11.2025 19:55 β€” πŸ‘ 773    πŸ” 76    πŸ’¬ 90    πŸ“Œ 55
Preview
AI Lovable Hackathon London: Community Tools Edition Β· Luma 🧠 AI Hackathon: Community Tools Edition Hosted by Led by Community Powered by Lovable & Tessl Join us for a hands-on hackathon to build AI-powered tools that…

Build AI tools for community growth at our hackathon with Lovable & Led By Community

One day to collaborate with developers, designers, and community builders. Focus on real problems: onboarding, engagement, moderation, and events.

πŸ“… Dec 3rd, 9 AM-5 PM
πŸ“ Tessl HQ, London

luma.com/ai-hackathon...

26.11.2025 12:03 β€” πŸ‘ 6    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Preview
A Proposed Evaluation Framework for Coding Agents: Specs Enhance Proper Use of Public APIs by ~35% This article proposes an evaluation framework highlighting how specifications enhance coding agents' effective use of public APIs.

Super excited to share what we've been doing at @tessl.io to improve the quality of code generated by AI agents! πŸ€–

We introduce a new way to measure abstraction adherence and show how Tessl's usage specs significantly boost it.

Check out the full article!

tessl.io/blog/propose...

20.11.2025 15:18 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It's funny to me how British digital newspapers are crying out loud what an invasion of privacy a state-issued digital ID is, but force me to accept tracking cookies or pay a fee.

26.09.2025 13:34 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
a raccoon laying on a bed with a bowl of blueberries in its mouth Alt: a raccoon laying on a bed with a bowl of blueberries in its mouth; labelled "bluwbewwy"

"I really need a tool that helps me count the number of letters in 'blueberry'. It will change my life."

... said no one ever.

10.08.2025 08:38 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
a pirate hat with a skull and crossbones and a sword on it Alt: a cat with a pirate hat with a skull and crossbones and a sword on it
03.08.2025 14:22 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I stole this for the linkedin post πŸ˜‚ Gratitude 🫑

03.08.2025 13:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Aaaaaaa! I'm regretting my choices πŸ₯²

03.08.2025 09:06 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I actually like the em dash. Am I an LLM? πŸ€”

03.08.2025 08:55 β€” πŸ‘ 18    πŸ” 0    πŸ’¬ 2    πŸ“Œ 2
A chatgpt generated take on the iconic kiss on the Berlin wall. But with Elon Musk kissing the Twitter blue bird. The text reads "My God, help me survive this tweet love"

A chatgpt generated take on the iconic kiss on the Berlin wall. But with Elon Musk kissing the Twitter blue bird. The text reads "My God, help me survive this tweet love"

A few years late but I was finally able to generate this

26.05.2025 17:46 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
a man is holding a piece of bread over a woman 's face and asking what are you ? ALT: a man is holding a piece of bread over a woman 's face and asking what are you ?

What I imagine open ai feels like after reading that s1 paper

11.02.2025 00:17 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Wait... So s1 cost only $50 to fine-tune and beat o1-preview. And the secret sauce is... forcing the model to generate "Wait" instead of end-of-sequence???

Ahahhahahhaha this is so cool

11.02.2025 00:16 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

🚨 MAJOR ALERT 🚨

The people wanted verification badges on Bluesky and @guan.dk and I have teamed up to bring you verification badges.

Behold! It is the Official Verified Labeller!
bsky.app/profile/veri...

26.11.2024 04:24 β€” πŸ‘ 2776    πŸ” 635    πŸ’¬ 242    πŸ“Œ 148

Strong characters sounds amazing + I think there is a lot to learn from that time if portrayed well. Thanks for the recommendation, I will give it a go!

25.11.2024 00:54 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Hi John,

That’s what all the bots say πŸ˜³πŸ˜‚

Not really my area of expertise, I know of the very obvious ones like input sanitation, regular finetuning on problematic examples, etc. The folks at lakera.ai have been on it for some time and regularly host hacking challenges, which I enjoy following

25.11.2024 00:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

I love Silo and For All Mankind. Maybe I should try Halt & Catch Fire next!

25.11.2024 00:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Thank you! Indeed, what a debacle πŸ˜‚ 🍿

24.11.2024 00:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What's the drama with Byzantine? Did someone raise it during reviewing in the past? Is there a public discussion? 🍿

23.11.2024 07:10 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Bluesky's firehose is a treasure trove of public data for researchers and developers, and it's completely free. Check out our developer docs: docs.bsky.app

23.11.2024 05:54 β€” πŸ‘ 7904    πŸ” 1526    πŸ’¬ 319    πŸ“Œ 166
Book outline

Book outline

Over the past decade, embeddings β€” numerical representations of
machine learning features used as input to deep learning models β€” have
become a foundational data structure in industrial machine learning
systems. TF-IDF, PCA, and one-hot encoding have always been key tools
in machine learning systems as ways to compress and make sense of
large amounts of textual data. However, traditional approaches were
limited in the amount of context they could reason about with increasing
amounts of data. As the volume, velocity, and variety of data captured
by modern applications has exploded, creating approaches specifically
tailored to scale has become increasingly important.
Google’s Word2Vec paper made an important step in moving from
simple statistical representations to semantic meaning of words. The
subsequent rise of the Transformer architecture and transfer learning, as
well as the latest surge in generative methods has enabled the growth
of embeddings as a foundational machine learning data structure. This
survey paper aims to provide a deep dive into what embeddings are,
their history, and usage patterns in industry.

Over the past decade, embeddings β€” numerical representations of machine learning features used as input to deep learning models β€” have become a foundational data structure in industrial machine learning systems. TF-IDF, PCA, and one-hot encoding have always been key tools in machine learning systems as ways to compress and make sense of large amounts of textual data. However, traditional approaches were limited in the amount of context they could reason about with increasing amounts of data. As the volume, velocity, and variety of data captured by modern applications has exploded, creating approaches specifically tailored to scale has become increasingly important. Google’s Word2Vec paper made an important step in moving from simple statistical representations to semantic meaning of words. The subsequent rise of the Transformer architecture and transfer learning, as well as the latest surge in generative methods has enabled the growth of embeddings as a foundational machine learning data structure. This survey paper aims to provide a deep dive into what embeddings are, their history, and usage patterns in industry.

Cover image

Cover image

Just realized BlueSky allows sharing valuable stuff cause it doesn't punish links. 🀩

Let's start with "What are embeddings" by @vickiboykis.com

The book is a great summary of embeddings, from history to modern approaches.

The best part: it's free.

Link: vickiboykis.com/what_are_emb...

22.11.2024 11:13 β€” πŸ‘ 653    πŸ” 101    πŸ’¬ 22    πŸ“Œ 6

πŸ’― no fake science please 😬😬😬 This sort of accounts can also be used to build up a following base and then activate as part of an influence campaign

22.11.2024 09:09 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Personally, I like Quiet Posters: it surfaces posts I wouldn't have seen otherwise!

22.11.2024 01:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

By the authority vested in me by the decentralised gods (πŸ’…πŸ») I accept this proposition.

(πŸ’…πŸ» Sarcasm. I possess no such authority. The decentralised gods would never grant it to me (πŸ’…πŸ»))

20.11.2024 09:09 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Maybe we need a special sarcasm tag here on bsky

20.11.2024 08:51 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

My question was a joke!

I know what the word means 🀣

20.11.2024 08:50 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

What do you mean "once"? 🀨

20.11.2024 08:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0