Sangwhan Moon's Avatar

Sangwhan Moon

@sxm.bsky.social

Nothing to see here

54 Followers  |  34 Following  |  5 Posts  |  Joined: 03.07.2023  |  1.4051

Latest posts by sxm.bsky.social on Bluesky

Preview
Jamo-Level Subword Tokenization in Low-Resource Korean Machine Translation Junyoung Lee, Marco Cognetta, Sangwhan Moon, Naoaki Okazaki. Proceedings of the Eighth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2025). 2025.

Looks like NAACL is up on ACL Anthology, and so too is our ( Junyoung Lee + @sxm.bsky.social + Naoaki Okazaki) paper on Jamo-level Subword Tokenization for Korean Machine Translation (from LoResMT).

#tokenization #korean #nlp

28.04.2025 23:29 โ€” ๐Ÿ‘ 3    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

Vivaldi Mail really is intolerant to fat fingers - wonder if I should file a feature request for a manual "send all in draft" button...

09.07.2023 11:17 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Oh yes, you do. I should have read more carefully. :-)

09.07.2023 07:34 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

There is also the negger variant of that pattern, "X, Y, and Z has this - and if you don't implement it we have no choice but to switch".

09.07.2023 04:48 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Been sifting through common crawl a bit due to a side project. Fresh reminder that the internet is full of spam and garbage...

I'm fairly confident I can get any model that has seen common crawl to write online casino and viagra spam in Korean

07.07.2023 16:47 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Starting to post here as Threads made a conscious decision to not have a web client. Let's see which one sticks. (I like the concept of Mastodon, but the clients frankly are sort of crap.)

07.07.2023 15:04 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@sxm is following 20 prominent accounts