's Avatar

@samy234.bsky.social

28 Followers  |  186 Following  |  35 Posts  |  Joined: 17.11.2024  |  1.9356

Latest posts by samy234.bsky.social on Bluesky

How potentially practical is the theoretical performance?

27.06.2025 16:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Jorja Smith - Blue Lights (Macca & Loz Contreras Bootleg)
YouTube video by Liquiform Jorja Smith - Blue Lights (Macca & Loz Contreras Bootleg)

Run when you hear the sirens coming ...
www.youtube.com/watch?v=sanH...

12.06.2025 23:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Why is it not negative? Do they think trump will come to his senses and change his policy to prevent a recession?

29.04.2025 13:45 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0
Preview
Shared Content

A Plea for Ideological Laziness in an Age of Crisis

chatgpt.com/s/dr_680cd86...

26.04.2025 13:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I agree it's a hard problem and you need smarter people in power to tackle it.

05.04.2025 16:43 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

How to finance the increased costs of manufacturing in the US? Where is the labour coming from, if you still have high employment? What about supply chains in a trade war with reciprocal tariffs?

05.04.2025 15:45 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Google Gemini 2.5 is the first public AI model to definitively beat the performance of human PhDs with access to Google on hard multiple choice problems inside their field of expertise (around 81%).

All AI tests are flawed, but GPQA Diamond has been a pretty good one.
& conducted independently.

03.04.2025 10:46 β€” πŸ‘ 72    πŸ” 13    πŸ’¬ 4    πŸ“Œ 2

Anyone else experiencing a sort of cognitive-emotional lag when working with AI? Like your emotional system isn't adapted to the productivity and switching so fast between such complex tasks? Leaving you with a feeling of being overwhelmed even though the tasks only take a few minutes?

03.04.2025 13:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What do companies get out of this process? I believe there are only 2 things you have to check:
1. Is the chemistry right with the team?
2. Is the person competent enough to solve a task and communicate the solution to you.

You don't need geniuses to work a corporate job.

31.03.2025 11:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

And their beating OpenAI in price and features.

26.03.2025 15:14 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

DeepSeek documented the changes to some extent.

Source: api-docs.deepseek.com/updates

25.03.2025 03:04 β€” πŸ‘ 10    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

πŸ₯Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding.

Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. πŸ₯‡

25.03.2025 17:25 β€” πŸ‘ 215    πŸ” 65    πŸ’¬ 34    πŸ“Œ 11
Preview
Gemini 2.5: Our most intelligent AI model Gemini 2.5 is our most intelligent AI model, now with thinking.

Gemini 2.5 pro is out, and you can test it for free right now (5 RPM).

blog.google/technology/g...

25.03.2025 19:10 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

DeepSeek's new V3 model (0324) is a major update in performance and license. MIT license will make it hugely impactful for research and open building. Though, many are ending up confused about if it is a "reasoning" model. The model is contrasted to their R1 model which is an only-reasoning model

25.03.2025 15:15 β€” πŸ‘ 19    πŸ” 2    πŸ’¬ 2    πŸ“Œ 0
Post image

Don't get high on your own supply, Tay Musk

23.03.2025 09:20 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Tencent Hunyuan-T1, their AI reasoning model. Powered by Hunyuan TurboS, it's built for speed, accuracy, and efficiency.

βœ… Hybrid-Mamba-Transformer MoE Architecture – The first of its kind for ultra-large-scale reasoning
βœ… Strong Logic & Concise Writing – Precise following of complex instructions

21.03.2025 16:35 β€” πŸ‘ 10    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1

I think people have to discern between the unbased promises of super human AGI and the actual usability revolution that is happening through LLMs. Now everyone can access complex computing through natural language, we haven't adjusted to this paradigm change yet.

21.03.2025 07:22 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It is! After testing the current 2.0 gemini models on my RAG tasks and also the current top, pro or preview models from several vendors, i'm not sure how they will compete with Google in the long run given these prices.

21.03.2025 02:24 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Google's gemini 2.0 models are underrated for RAG applications, first benchmarks look really promising.

20.03.2025 15:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
French scientist denied US entry after phone messages critical of Trump found France’s research minister said the scientist was traveling to Houston for a conference when his phone was searched

Don't go to the free speech for white and orange nazis land if you have private messages critical of the regime on your phone.

www.theguardian.com/us-news/2025...

20.03.2025 07:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This feels a bit like the 2008 financial crisis stock market, but instead of waiting for political signals on which bank is being saved, you wait for the orange man to wake up at 15:30 cet and spew his crazy declarations on his "social" media platform.

18.03.2025 10:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

A very exciting day for open-source AI! We're releasing our biggest open source model yet -- OLMo 2 32B -- and it beats the latest GPT 3.5, GPT 4o mini, and leading open weight models like Qwen and Mistral. As usual, all data, weights, code, etc. are available.

13.03.2025 18:16 β€” πŸ‘ 141    πŸ” 37    πŸ’¬ 5    πŸ“Œ 3

"too incompetent to realize that they’re incompetent"

11.03.2025 13:20 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Total shitshow of incompetence.

you all know immediately who i mean.

10.03.2025 07:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In a shocking turn of events, many of the people who said they wanted "free speech" actually just wanted to be the ones in charge of which speech is free.

10.03.2025 02:44 β€” πŸ‘ 13030    πŸ” 2345    πŸ’¬ 244    πŸ“Œ 64
Preview
Ice arrests Palestinian activist who helped lead Columbia protests, lawyer says Mahmoud Khalil’s arrest comes as Trump vows to deport foreign students involved in protests against Israel’s war

Free speech?

www.theguardian.com/us-news/2025...

09.03.2025 19:04 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Makes intuitive sense how would the model know which tokens to pay attention to while embedding the needle in advance. "Noise" is therefore equally important during indexing. 4k tokens embeddings would only make sense if the needle is so complex it takes up 4k tokens.

08.03.2025 08:23 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Rudimental, Skepsis - Vex (Official Audio) ft. MIST, Popcaan
YouTube video by RudimentalVEVO Rudimental, Skepsis - Vex (Official Audio) ft. MIST, Popcaan

vibe www.youtube.com/watch?v=x8Va...

06.03.2025 09:22 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
QwQ-32B: Embracing the Power of Reinforcement Learning QWEN CHAT Hugging Face ModelScope DEMO DISCORD Scaling Reinforcement Learning (RL) has the potential to enhance model performance beyond conventional pretraining and post-training methods. Recent stud...

QwQ-32B is on par with R1:671B on math and coding tasks

They used 2 stages of RL, first math and coding and then general tasks.

Maybe this is the next coding model??

qwenlm.github.io/blog/qwq-32b/

05.03.2025 19:59 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 3    πŸ“Œ 1

Newsletter: Microsoft pulled back on over a gigawatt of planned data center capacity, suggesting that they do not think there is a growth future in generative AI. Meanwhile, SoftBank, the only company that can afford to fund OpenAI, has to take out loans to do so.
www.wheresyoured.at/power-cut/

03.03.2025 17:17 β€” πŸ‘ 2611    πŸ” 585    πŸ’¬ 50    πŸ“Œ 91

@samy234 is following 20 prominent accounts