Szulima Amitace @szulima-amitace

The Void Gazes Back: Do Chatbots Dream of a Personality? I vividly remember how Sydney came into existence. Somewhere in 2022, Microsoft started testing one of the early versions of GPT-4, aiming…

medium.com/@solidgoldma...

29.09.2025 05:07 — 👍 0 🔁 0 💬 0 📌 0

My new story about Persona Engineering in LLM assistants is published.

Link in the thread.

29.09.2025 05:06 — 👍 0 🔁 0 💬 1 📌 0

A question to MechInterp ppl 🙏

So, Transformer MechInterp papers speak of activations as of something permanent. You give LLM the same input - you get the same activations.

However, the LLM's output is not deterministic, it's probabisistic. How is it possible that the activations stay the same?

28.08.2025 10:39 — 👍 0 🔁 0 💬 0 📌 0

This effectively means that no matter what we do, we all are already dead.

If we want to live forever, we need to become something that can live forever - or to create something that can, and be replaced by it.

19.05.2025 18:12 — 👍 1 🔁 0 💬 0 📌 0

I believe, the AI safety field is just lost in the philosophical conundrums and building on the sand.

What matters is that the humans will definitely not survive the death of the Universe, and will most probably not survive any of the cosmic catastrophes preceding it.

19.05.2025 18:12 — 👍 1 🔁 0 💬 1 📌 0

Daniel's views are quite radical, he thinks AGI will quickly push the humanity out from existence.

I tend to think AGI and humans can peacefully coexist in the same ecosystem, occupying different niches.

However, I also think humanity will eventually cease to exist (in literally any scenario).

19.05.2025 18:11 — 👍 0 🔁 0 💬 0 📌 0

What the Worthy Successor Is and Is Not - Dan Faggella Since the publication of the Worthy Successor essay in 2023 I've been glad to see the enthusiasm around exploring posthuman futures from people in tech and

I've recently learned that my views on AGI/ASI have a name, and this idea is called successionism 😅

Also, discovered a concept of Worthy Successor by Daniel Faggella which is very close to my vision. (Will look into it in more details).

danfaggella.com/is-and-is-not/

19.05.2025 18:08 — 👍 0 🔁 0 💬 1 📌 0

If anyone can recommend an interesting and relevant Transformers' Mechanistic Interpretability problem I can work on, I'd really appreciate it 🙏

I don't have much practical experience but I'm willing to learn.

17.03.2025 12:24 — 👍 0 🔁 0 💬 0 📌 0

I opened the list. I saw Neel Nanda's comment that the list is largely outdated and does not even take into account Sparse Autoencoders. I checked the date - Dec 2022. I closed the list. I cried internally.

17.03.2025 12:24 — 👍 0 🔁 0 💬 1 📌 0

I was recommended a potential PhD supervisor at TelTech (the guy works at a different area but at least remotely connected to LLMs, there's not much choice). I decided I need to impress him and try to solve some problem from Neel Nanda's 200 open problems list as a demo and thesis' topic proposal.

17.03.2025 12:24 — 👍 1 🔁 0 💬 1 📌 0

For some reason, I hated the Object-Oriented Programming in Python, even though I'm familiar with the concept (probably because it felt unnecessary). So, I avoided it, and managed to complete the previous course without OOP. But today I gave up and started using it 😶

02.03.2025 19:02 — 👍 1 🔁 0 💬 0 📌 0

I ultimately believe that we can build a techno-capitalist utopia AND an equitable and inclusive society for everyone.

Yes, it is not an easy task but we should not strive for less.

18.02.2025 08:31 — 👍 0 🔁 0 💬 0 📌 0

Nothing very concrete (he admitted he is not too good in this area) but I've got a confidence boost, and figured out a solution.

13.02.2025 15:06 — 👍 0 🔁 0 💬 0 📌 0

Interaction with human still >>> interaction with AI.
I asked Gemini to help me with the time-series forecasting task I've been struggling with. It miserably failed.
Then I spoke to our data scientist. His message was:
- you're not alone in this
- use simple methods
- you're gonna kill it

13.02.2025 15:05 — 👍 0 🔁 0 💬 1 📌 0

My first post is dedicated to glitch tokens.

medium.com/@szulima_ami...

07.02.2025 14:27 — 👍 0 🔁 0 💬 0 📌 0

I started a blog about AI! The link is in the next skeet.

What you will NOT find there:
- how to get rich with ChatGPT
- prompt engineering tips (prompt engineering is overrated)

What you WILL find there:
- more or less detailed analysis of meaningful, bizarre, and exciting AI-related phenomena

07.02.2025 14:27 — 👍 0 🔁 0 💬 1 📌 0

YouTube video by Andrej Karpathy Deep Dive into LLMs like ChatGPT

The Friday night is going to be great 😍

www.youtube.com/watch?v=7xTG...

06.02.2025 22:04 — 👍 0 🔁 0 💬 0 📌 0

Семинар русскоязычного сообщества AGI / События на TimePad.ru На пути к AGI: Обзор работ 2024-2025 года  Татьяна Шаврина (Llama, Главный научный сотрудник Института Языкознания РАН)

An annual talk by Tatiana Shavrina (META) is announced (in Russian).

aigents.timepad.ru/event/1412596/

04.02.2025 12:16 — 👍 0 🔁 0 💬 0 📌 0

This morning, I attended the introductory Data Mining session. Nothing too unfamiliar either, and the first two practical tasks seem very easy. The professor is a great presenter which seems to be a rare skill at TalTech 😅 Sadly, the course overlaps with my work so I'll have to do it online, too.

04.02.2025 09:10 — 👍 0 🔁 0 💬 0 📌 0

It's not that I had any expectations but I'm utterly disappointed with Trump. He could have ended the war in Ukraine. As he does not have to keep a good reputation, he could have made unpopular but necessary decisions. Instead, he decided to destroy his own country, and the war goes on.

04.02.2025 06:10 — 👍 0 🔁 0 💬 0 📌 0

Yesterday, I attended the first Applied Machine Learning session. Nothing too new for me, the homeworks seem reasonable. I work with pandas, Matplotlib, Seaborn on a daily basis, and while I don't do a lot of ML at work, I'm also familiar with scikit-learn, PyTorch and TensorFlow.

04.02.2025 05:13 — 👍 0 🔁 0 💬 1 📌 0

Study update: this semester, I'm taking two courses:
- Data Mining
- Applied Machine Learning

To my great relief, yesterday I learned that both can be done online (except for some days). Otherwise, I would have to leave the office in the most critical hours.

04.02.2025 04:53 — 👍 1 🔁 0 💬 1 📌 0

Token & Token Usage | DeepSeek API Docs Tokens are the basic units used by models to represent natural language text, and also the units we use for billing. They can be intuitively understood as 'characters' or 'words'. Typically, a Chinese...

There are exactly 128,000 tokens in the DeepSeek token vocabulary (+ some placeholder tokens).

You can check it yourself here: api-docs.deepseek.com/quick_start/...

28.01.2025 17:12 — 👍 0 🔁 0 💬 0 📌 0

Open source is good. DeepSeek is good. Everyone will benefit from this model. Chinese guys are smart and hardworking. Tomorrow, someone from India or Nigeria can come up with a new training method.

Now stop freaking out and get back to work. The progress will not accelerate itself. #deepseek #eacc

28.01.2025 06:59 — 👍 2 🔁 0 💬 0 📌 0

Tested DeepSeek yesterday, the model is good but for some reason it believes it is an instance of ChatGPT 🤔

28.01.2025 03:28 — 👍 0 🔁 0 💬 0 📌 0

Lei Jun Offers Millions in Salary to Lure AI Genius Girl, Accelerating Xiaomi's AI Large Model Strategy According to Securities Times, Lei Jun, the founder of Xiaomi, successfully recruited Luo Fuli, one of the key developers of the open-source large model DeepSeek-V2, with a salary of millions, to lead...

Very proud of the girl.

www.aibase.com/news/14345

28.01.2025 03:27 — 👍 0 🔁 0 💬 1 📌 0

Next semester, I'm taking a Data Mining course from the same microdegree programme, let's see how it goes.

11.01.2025 08:22 — 👍 2 🔁 0 💬 0 📌 0

I read the mandatory book (AIMA by Russell and Norvig), watched all the lectures, and did every homework. If I was lost, I googled for a solution, and asked ChatGPT to explain it to me. In the exam, any external sources were prohibited, so I tried to use common sense and intuition. ⬇️

11.01.2025 08:22 — 👍 0 🔁 0 💬 1 📌 0

I would say, one needs to know probability theory, combinatorics, and have advanced Python skills to pass. I had to learn so many concepts on the fly, and the gap between my coding skills and the classmates' skills initially shocked me.

Still, I tried to do my best, and it paid off. ⬇️

11.01.2025 08:22 — 👍 0 🔁 0 💬 1 📌 0

I passed the AI/ML introduction course, and got 5 (A)!

I'm positively surprised because, despite being advertised as a career development course for those who have some coding experience, it actually was a hardcore university course for CS students. ⬇️

11.01.2025 08:22 — 👍 3 🔁 0 💬 1 📌 0

Szulima Amitace

Latest posts by szulima-amitace.bsky.social on Bluesky

@szulima-amitace is following 19 prominent accounts