When you need to tune for your domain, the parameters give you meaningful handles to turn. The interpretability is genuinely valuable."
arpitbhayani.me/blogs/bm25
@sungkim.bsky.social
A business analyst at heart who enjoys delving into AI, ML, data engineering, data science, data analytics, and modeling. My views are my own. You can also find me at threads: @sung.kim.mw
When you need to tune for your domain, the parameters give you meaningful handles to turn. The interpretability is genuinely valuable."
arpitbhayani.me/blogs/bm25
BM25 by Arpit Bhayani
"What makes BM25 worth understanding is not just that it works. It is that it works for knowable reasons. Every part of the formula has a clear interpretation. When a result is surprising, you can trace why.
A word of wisdom to live by - do not let your luxury possession possess you.
06.03.2026 06:16 β π 8 π 2 π¬ 2 π 0So true.
05.03.2026 23:34 β π 20 π 1 π¬ 1 π 1
My thoughts on gpt-5.4 high on Codex CLI
I have no idea if it is better than gpt-5.3-codex or even gpt-5.2, but it devours tokens like a competitive eater at a Las Vegas buffet.
Intel Panther Lake Die Shot
Why does it look like Impressionist painting? BSPDN.
FYI
05.03.2026 19:31 β π 5 π 1 π¬ 1 π 0
Speculative Speculative Decoding (SSD)
It's up to 2x faster than the strongest inference engines in the world, but you need H100 or better GPUs.
Paper: arxiv.org/abs/2603.03251
Repo: github.com/tanishqkumar...
PyTorch's FlexAttention also supports FlashAttention-4 backend.
PyTorch now auto-generates CuTeDSL score/mask modifications and JIT-instantiates FlashAttention-4 for your custom attention variant
The result: 1.2Γ to 3.2Γ speedups over Triton on compute-bound workloads.
pytorch.org/blog/flexatt...
- Paper: github.com/Dao-AILab/fl...
- Code: github.com/Dao-AILab/fl...
- Blogposts:
together.ai/blog/flashat...
tridao.me/blog/2026/fl...
research.colfax-intl.com/flashattenti...
FlashAttention-4
I hope it is not pain to work with. It changes the algorithm & pipeline so that softmax & SMEM bandwidth no longer dictate speed. Attn reaches ~1600 TFLOPs, pretty much at matmul speed!
You can always go to other platforms and browse through 100s or 1,000s of postings and view posts by the original authors.
05.03.2026 16:22 β π 0 π 0 π¬ 0 π 0
OpenAI's Symphony
A Linear Board for agents.
github.com/openai/symph...
Teaching LLMs to reason like Bayesians
By training models to mimic optimal probabilistic inference, they improved their ability to update their predictions and generalize across new domains.
research.google/blog/teachin...
No. You can safely assume that I havenβt tried most of the things I post about.
05.03.2026 05:41 β π 0 π 0 π¬ 1 π 0Question: Is ColBERT worth it? I am seeing like ~10X increase in latency and ~30X increase in storage, as compared to dense/sparse vectors.
05.03.2026 02:59 β π 2 π 0 π¬ 1 π 0I liked it when mkbhd said "If you are watching my video on MacBook Neo then a new MacBook Neo is not a computer for you".
05.03.2026 01:49 β π 4 π 0 π¬ 0 π 0What is gog?
05.03.2026 00:44 β π 0 π 0 π¬ 1 π 0
Google Workspace CLI
github.com/googleworksp...
- Coach and redirect agents quickly and effectively as they surface for help
- Conduct thorough code reviews of completed work
- Intervene manually when needed to ship well-tested features"
www.tolans.com/relay/why-th...
Why The Best AI Engineers Are Former Managers by Quinten Farmer
"We evaluate Agent Engineering Managers on their ability to:
- Break down ambiguous product problems into well-scoped tasks
delegate those tasks with appropriate milestones and planned checkpoints
Source: x.com/hacker_/stat...
04.03.2026 23:13 β π 0 π 0 π¬ 0 π 0Claude "the hacker"
04.03.2026 23:13 β π 3 π 0 π¬ 1 π 0Sure. For the record, Iβve been consistent in my views and very transparent about them.
04.03.2026 21:23 β π 1 π 0 π¬ 0 π 0
Heβs an opportunistic politician, but itβs still notable to hear a mainstream politician break from AIPAC and call Israel an βapartheid state.β
www.politico.com/news/2026/03...
Interesting developmentβ¦ I guess Alexandr Wang is on the way out. That was a bit quicker than I expected. I would have thought heβd be given at least a year of runway.
timesofindia.indiatimes.com/technology/t...
I would defer to the community.
04.03.2026 19:28 β π 1 π 0 π¬ 1 π 0
Full weights (16bit/4bit), code, technical report & training details β all free for the community.
github.com/Yuan-lab-LLM...
Inspur's Yuan Lab released Yuan 3.0 Ultra - their flagship multimodal MoE foundation model, built for stronger intelligence and unrivaled efficiency.
- Efficiency Redefined: 1010B total / 68.8B activated params
- Smarter, Not Longer Thinking
- Enterprise-Grade Agent Engine
FYI: I'm not affiliated with either Meta nor FFmpeg.
04.03.2026 18:55 β π 0 π 0 π¬ 0 π 0