Gennady Gorin's Avatar

Gennady Gorin

@goringennady.bsky.social

πŸ¦ πŸ§¬πŸ“Šbioinformatics, statistics, and stochastic processes.

301 Followers  |  100 Following  |  351 Posts  |  Joined: 18.11.2024  |  2.0314

Latest posts by goringennady.bsky.social on Bluesky

Maybe it's a Meta issue, maybe it's about incentives, maybe it's not representative of the tech. But after about a hundred garbled Far Sides the excuses become tedious and unconvincing. Especially in such a simple and visible use case (cf. Google Search snippets)

30.10.2025 02:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

My information sources are that my Facebook feed was, for a period of three months, saturated with emetic Far Side upscales. Sometimes it's about direct experience rather than theory.

(Preserved at www.reddit.com/r/FarSideCom...)

30.10.2025 02:34 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

I know I posted this on Twitter ages ago, but I came across it again last night and thought some may not have seen it yet.
(From the Not the Nine O'Clock News spin-off book, 1980.)

19.10.2025 10:59 β€” πŸ‘ 42    πŸ” 7    πŸ’¬ 1    πŸ“Œ 0

Anyway, this has little to do with testing the training set, which is to say the conclusion that performance is far worse on datasets/packages that were not in the original velocity papers. This means the methods don't generalize.

28.10.2025 16:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

You could say that. I would put it a different wayβ€” that the embeddings and neighborhoods are remarkably unstable and arbitrary (see Figs. 6, 7, 10 in our paper). By the way, did you happen to check calibration of the p-value distribution?

28.10.2025 16:14 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I have found the candles

28.10.2025 01:45 β€” πŸ‘ 44    πŸ” 6    πŸ’¬ 2    πŸ“Œ 0

Programming skill is just a set of rules you've inferred via personal suffering

27.10.2025 22:36 β€” πŸ‘ 44    πŸ” 5    πŸ’¬ 5    πŸ“Œ 1

To achieve good velocity embeddings, test on training data

27.10.2025 05:49 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

iCloud has a fun Easter egg where it deletes all your books from the local device at random intervals so you can't use Apple Books when you get on a plane

21.10.2025 06:04 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Sure, I got that, and I completely agree. There are plenty of sources that make this exact point. This should be an encouragement to understand, characterize, and exploit the noise, rather than to elide it as ultimately deterministic, especially since it is in part controlled by the genome itself.

18.10.2025 22:15 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I would prefer to strongly push back on the assumption, because I do not see how it follows. And the end game could be either; statistical modeling helps both, and insisting on a deterministic formulation hurts both.

18.10.2025 21:45 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Those are the conceptual and practical issues. But there's no need to go that far. I have read enough papers that torture their data to fit deterministic models to realize this approach is harmful.

18.10.2025 21:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Not really! The goal is to analyze, and having a model likelihood, limited as it may be, helps more than having an intractable hairball of unknown and unknowable interactions. I also don't see why we ought to assume those processes are deterministic.

18.10.2025 21:03 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

I would say it is not only parsimonious, it is practically mandatory as a starting point to identify signatures of unmodeled factors. You have to have some kind of null hypothesis or working model.

18.10.2025 20:37 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Because a seed is an initial condition for a specific deterministic process, and I don't really see how it helps define stochastic processes, even by analogy.

18.10.2025 20:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

hard no.

18.10.2025 20:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yes, because as soon as variability is treated as random, we can use the usual suite of statistical and probabilistic tools to quantify and evaluate that assumption.

18.10.2025 20:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In a manner of speaking. Because it makes using probabilistic methods and defining ergodic limits needlessly hard or impossible.

18.10.2025 08:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1

We could, but there are good reasons people do not.

18.10.2025 01:23 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I appreciate the aleatoric/epistemic uncertainty distinction as much as anyone. But to throw out biological noise is to throw out a century of really good tools for no benefit at all. (I also don't see why the quantum effects would disappear at the level of molecules.)

17.10.2025 23:52 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In my experience, treating universal and pervasive biochemical noise as somehow fake hurts more than it helps.

17.10.2025 23:38 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Intuitively yes, ranks are unstable, and one should hope that providing different data should yield different results! And yet I am not even sure about that (e.g. if the methods are ultimately relying on "uninteresting" robust signals, and throw out all other information)

17.10.2025 07:23 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

By "optimistic" I mean there may be a sensible way to identify, resolve, explain, and account for biases from different technologies, but this may be both impossible (because of data access) and futile (because every analysis is bespoke) for downstream pipelines

17.10.2025 07:06 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

A question I have often wondered about but have not yet managed to formulate in a quantitative way. By what measure?
I think the pipeline question is far too optimistic because source studies use a truly staggering variety of pipelines, references, and hyperparameters

17.10.2025 06:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Congratulations! Any speculations on whether activation in males uses similar mechanisms?

16.10.2025 02:33 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

How do the people at Google and various other AI users feel about the page-specific search snippets being poorly AI-paraphrased, and what implications do they draw, or avoid drawing, from this fact

12.10.2025 03:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1
Preview
Hungarian writer LΓ‘szlΓ³ Krasznahorkai wins the Nobel Prize in literature Hungarian writer LΓ‘szlΓ³ Krasznahorkai has won the Nobel Prize in literature for what the Nobel committee called his compelling and visionary work.

Hungarian writer LΓ‘szlΓ³ Krasznahorkai, known for philosophical and bleakly funny novels, has won the Nobel Prize in literature.

09.10.2025 11:31 β€” πŸ‘ 134    πŸ” 32    πŸ’¬ 1    πŸ“Œ 2

Amazing! None of the answers are even in the same ballpark as the physics usage (neutral net+symbolic regression) or my usage (neutral net parametrizing some physical model)

09.10.2025 00:54 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

"We're going to create superintelligence" How about making outlook search work first. How about that

03.10.2025 17:18 β€” πŸ‘ 10299    πŸ” 1729    πŸ’¬ 103    πŸ“Œ 78

The optimistic point of view, then πŸ™ƒ

02.10.2025 12:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@goringennady is following 20 prominent accounts