Rishu Kumar's Avatar

Rishu Kumar

@rishuk.bsky.social

39 Followers  |  93 Following  |  47 Posts  |  Joined: 26.09.2023  |  1.6631

Latest posts by rishuk.bsky.social on Bluesky

Post image

I love MT. :D

20.04.2025 19:45 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Must they torture me like this? :(

31.03.2025 10:48 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

WAITER!!! WAITER!!!!
MORE CHAIN OF THOUGHT SURVEYS PLEASE!!!!

26.03.2025 03:57 โ€” ๐Ÿ‘ 7    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

NOOOOOOOOOOOOO!!!

14.03.2025 10:08 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Screenshot of interaction with Gemini, converting my image to a random image.

Screenshot of interaction with Gemini, converting my image to a random image.

Yes, ofcourse that's totally me. ๐Ÿ˜ญ๐Ÿ˜ญ Am I using something wrong?

14.03.2025 09:58 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image

Narrator: He definitely let that slide. :D

13.03.2025 16:07 โ€” ๐Ÿ‘ 6    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Petition to change "Human evaluation" to "vibe evaluation" in summarization.

13.03.2025 15:45 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Off By One xkcd.com/3062

13.03.2025 02:41 โ€” ๐Ÿ‘ 7097    ๐Ÿ” 499    ๐Ÿ’ฌ 108    ๐Ÿ“Œ 46

Each time I tried vibe-coding something for my nlp-related side quests, my respect for the companies and people shilling "Coding LLMs" dropped by 20%.

12.03.2025 14:04 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Rishu Kumar โ€ชCharles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguisticsโ€ฌ - โ€ชโ€ชCited by 2,157โ€ฌโ€ฌ - โ€ชNatural Language Processingโ€ฌ - โ€ชLinguisticsโ€ฌ

I will believe we have AGI when my Google Scholar profile gets fixed automatically and I am not shown as an Agricultural science researcher who has occasionally worked on NLP stuff.๐Ÿ˜ญ
scholar.google.co.in/citations?us...

05.03.2025 22:14 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The Factory must grow

04.03.2025 22:44 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Few obvious questions: Is the upper limit of ~30% hallucination acceptable? And how much work it adds to a lawyerโ€™s work to ensure there is no hallucinations at all in the generated document?

(Havenโ€™t read the entire paper yet)

04.03.2025 20:44 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Honest answer: I don't read anything from the enormous stream of papers unless they are *critical* to my current work or I am deeply interested in the topic.

28.02.2025 16:51 โ€” ๐Ÿ‘ 5    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Come present your #NLP work in Leuven!

25.02.2025 07:49 โ€” ๐Ÿ‘ 4    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
x.com

Okay, automated cross-posting has its quirks. This was the original tweet: x.com/OpenAI/statu...

19.02.2025 00:31 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image

Reminds me of this from @xkcd


Though the real question is: how long before everyone races to optimize for this "benchmark?"

19.02.2025 00:30 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

WHY ARE THERE 5 DIFF ELECTRON VERSION ON MY PC? :/

17.02.2025 14:29 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Sometimes VSCodium feels so slow that I just want to go back to jupyterlab and vim.

17.02.2025 14:26 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Next year we will reach NeurIPS level at this rate.

17.02.2025 01:57 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Twitter screenshot of https://x.com/ChombaBupe/status/1891070696832278614 which reads:

Language models are still failing on trivial simple tasks, like following basic instructions, summarization, etc. Another screenshot of ars technical saying 'Over half of LLM-written news summaries have "significant issues" - BBC Analysis'

Twitter screenshot of https://x.com/ChombaBupe/status/1891070696832278614 which reads: Language models are still failing on trivial simple tasks, like following basic instructions, summarization, etc. Another screenshot of ars technical saying 'Over half of LLM-written news summaries have "significant issues" - BBC Analysis'

Honorary mention of arxiv.org/abs/2309.09558

17.02.2025 01:54 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

If we start calling them anything else, it doesnโ€™t anthropomorphise the models enough. :)

15.02.2025 22:56 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I still think about my application for an internship in 2022 where I got through the first screening, forgot about it, and got another email 6 months later in 2023, went ahead & did two(?) interviews, absolutely nailed it, got an email 6 months later that position is canned. :D

14.02.2025 12:12 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Amazon product name for Oral B Genius X electric toothbrush which has AI in it.

Amazon product name for Oral B Genius X electric toothbrush which has AI in it.

Does your brush even have AI, bro?

WHYYYYYYYYYY??

10.02.2025 10:51 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

There must be a term for when someone takes your code and modifies the logic so unnecessarily and badly that even the second double espresso isn't enough to fix your mood.

07.02.2025 09:18 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I donโ€™t think any of the โ€œfoundation modelโ€ orgs will ever talk about contamination, because that makes scores look bad. At this point a lot of these models seem like clowns in circus, only there to keep the public (investors?) at large excited about the circus that is โ€œAI researchโ€.

28.01.2025 21:19 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

This might be the reminder I needed to fix my webpage. Side-quest for the day acquired. :p

22.01.2025 08:44 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
ACL 2025 Ling theory & Cognitive modeling track emergency reviewer volunteer form The Linguistic Theories, Cognitive Modeling, and Psycholinguistics track at ACL 2025 is looking for emergency reviewers. The emergency reviews will take place between 18th to 26th of March, 2025. Thes...

Repost appreciated! ๐Ÿ™

ACL 2025 Ling theory & Cognitive modeling track is looking for emergency reviewers. The emergency review period is between 3/18-26, and these reviewers will be excluded from the ARR cycle. If you're interested, please sign up here! docs.google.com/forms/d/1fH7...

18.12.2024 15:37 โ€” ๐Ÿ‘ 4    ๐Ÿ” 11    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I had this experience in my thesis and a research project that I did. Maybe I should publish that project on arxiv.

17.12.2024 13:50 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Watching a small model train on a consumer-grade GPU is where the fun at.

10.12.2024 13:47 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Didnโ€™t realise itโ€™s an imposter account. :D

08.12.2024 21:47 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@rishuk is following 20 prominent accounts