Yoav Goldberg's Avatar

Yoav Goldberg

@yoavgo.bsky.social

6,401 Followers  |  930 Following  |  1,130 Posts  |  Joined: 27.04.2023  |  2.1907

Latest posts by yoavgo.bsky.social on Bluesky

wdym by "enable manipulation"? the metaphors i think are only "nice to have" are the "projections between spaces" kind of things.

04.07.2025 18:28 — 👍 0    🔁 0    💬 1    📌 0

also, there ARE benefits to taking algebra in undergrad. you learn stuff, and the notions of proofs and abstraction are important to acquire. it just not central to DL.

04.07.2025 18:26 — 👍 1    🔁 0    💬 0    📌 0

and there is also this:

bsky.app/profile/yoav...

04.07.2025 18:02 — 👍 0    🔁 0    💬 1    📌 0

currently it is pre-req also here. but ideally there will be removed and replaced with a more suitable class.

04.07.2025 18:01 — 👍 0    🔁 0    💬 1    📌 0

the algebraic terminology is here to stay unfortunately, and we should use it. it doesnt mean we DL is "built on" linear algebra, nor that a linear algebra class should be pre-req.

04.07.2025 17:54 — 👍 0    🔁 0    💬 1    📌 0

if you REALLY want to understand DL, you should start by honing your Category Theory skills, as almost everything in DL at its core can be mapped to a functor or an endofunctor.

04.07.2025 10:39 — 👍 2    🔁 0    💬 0    📌 0

OR you could do what people actually do in ML these days and associate each symbolic token with a random list of numbers, and let the optimization take care of it.

04.07.2025 10:23 — 👍 1    🔁 0    💬 0    📌 0

ah, likely. idk

04.07.2025 10:10 — 👍 1    🔁 0    💬 0    📌 0

("any" is a bit extreme because the purists will come and say "ohh but you use commutativity and associativity of addition! thats group theory!!". but i agree with you of course)

04.07.2025 09:52 — 👍 0    🔁 0    💬 0    📌 0

it seems to me that a large chunk of ML can actually be characterized as doing dim reduction numerically

04.07.2025 09:47 — 👍 0    🔁 0    💬 0    📌 0

ok, but where is PCA important as a building block in ML?

04.07.2025 09:45 — 👍 0    🔁 0    💬 2    📌 0

idk the sociology of it. tensors are popular because hardware/software support them, and its convenient to implement batches this way i guess.

04.07.2025 09:45 — 👍 1    🔁 0    💬 1    📌 0

where is it used that is central / important?

04.07.2025 09:37 — 👍 0    🔁 0    💬 1    📌 0

btw, re the hardware example: the hatdware is not built to perform "norm". it is built to approximate norm over a floating point representation of real numbers. this is highly specific and doesnt enjoy the generality of algebra at all. i think it may not even be commutative and associative.

04.07.2025 09:25 — 👍 0    🔁 0    💬 1    📌 0

bsky.app/profile/yoav...

04.07.2025 09:14 — 👍 4    🔁 0    💬 0    📌 0

taking it a step further, I'd say in many cases using the algebra jargon is harmful to understanding, and its better to just describe whats really going on. ie, "we add an L2 penalty term" --> want the sum of squares to be small. "project to vocab space" --> compute similarity to each vocab item.

04.07.2025 09:14 — 👍 6    🔁 0    💬 1    📌 2

(and the useful parts of the metaphor is also much more geometric than algebric, imo)

04.07.2025 07:52 — 👍 0    🔁 0    💬 1    📌 0

i think these are nice metaphors, and i use them daily. i am not at all convinced that they are essential, except for very few select places.

04.07.2025 07:50 — 👍 0    🔁 0    💬 1    📌 0

because when we use a term such as "a norm" we get the definition, which is nice, but also a bunch of properties that hold for items of this kind. and if we dont actually rely on these properties later on, then its a waste in terms of what we communicated.

04.07.2025 07:40 — 👍 0    🔁 0    💬 1    📌 0

so i would argue that efficiency of computation is nice but also somewhat accidental and not that important. but lets focus on efficiency of communication: my point is kinda exactly around this. in a field that is actually built around algebra, the communication efficiency would be MUCH higher.

04.07.2025 07:37 — 👍 0    🔁 0    💬 1    📌 0

טוב גם הטענה "משין לרנינג בנויה על תורת הקבוצות" היא הזויה בעיני. אנחנו כנראה לא מסכימים על ההגדרה של "בנויה על".

04.07.2025 07:33 — 👍 0    🔁 0    💬 0    📌 0

אם הטענה היא שאין כלום במתמטיקה בלי אלגברה אז אוקיי, אבל זה קצת טיעון ריק בעיני

04.07.2025 07:03 — 👍 0    🔁 0    💬 1    📌 0

איפה אנחנו משתמשים בקונספט הזה? (ואנחנו קצת דוחקים פה את ההגדרה של אלגברה לדעתי, כי אני גם יכול להגיד כך גם שכל אנליזה פונקציונאלית זה בעצם אלגברה, אבל בוא נזניח את זה לרגע)

04.07.2025 06:59 — 👍 0    🔁 0    💬 1    📌 0

תרחיב על לפרמל ייצוג וקירוב? למה זה מעניין אותנו ומה האספקטים האלגבריים שם?

04.07.2025 06:54 — 👍 0    🔁 0    💬 1    📌 0

what efficiency are we discussing here? efficiency of communication, or efficiency of computation?

04.07.2025 06:48 — 👍 0    🔁 0    💬 1    📌 0

סליחה התכוונתי '' "משין לרנינג על אופטימיזציה" זו טענה שאני בכיף ''

04.07.2025 06:47 — 👍 0    🔁 0    💬 1    📌 0

"משין לרנינג בנוי על אופטימיזציה" זו טענה שאני מקבל בכייף

04.07.2025 06:43 — 👍 1    🔁 0    💬 1    📌 0

i agree that the language is being used. but the language is pretty much the only part that is being meaningfully used.

04.07.2025 06:35 — 👍 0    🔁 0    💬 1    📌 0

linear algebra is also concerned with solving systems of linear equations, the representations of linear equations as matrices/vectors, and related objects like spans and bases.

04.07.2025 06:19 — 👍 1    🔁 0    💬 2    📌 0

for me, the "algebra" part is the realization that the real numbers and addition/multiplication over them are just a special case of a "group" or a "field", and that many other kinds of groups and fields exist, and can be manipulated similarly, and share many properties.

04.07.2025 06:19 — 👍 1    🔁 0    💬 1    📌 0

@yoavgo is following 20 prominent accounts