I love MT. :D
20.04.2025 19:45 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0@rishuk.bsky.social
I love MT. :D
20.04.2025 19:45 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Must they torture me like this? :(
31.03.2025 10:48 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0WAITER!!! WAITER!!!!
MORE CHAIN OF THOUGHT SURVEYS PLEASE!!!!
NOOOOOOOOOOOOO!!!
14.03.2025 10:08 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Screenshot of interaction with Gemini, converting my image to a random image.
Yes, ofcourse that's totally me. ๐ญ๐ญ Am I using something wrong?
14.03.2025 09:58 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 1Narrator: He definitely let that slide. :D
13.03.2025 16:07 โ ๐ 6 ๐ 0 ๐ฌ 0 ๐ 0Petition to change "Human evaluation" to "vibe evaluation" in summarization.
13.03.2025 15:45 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Off By One xkcd.com/3062
13.03.2025 02:41 โ ๐ 7097 ๐ 499 ๐ฌ 108 ๐ 46Each time I tried vibe-coding something for my nlp-related side quests, my respect for the companies and people shilling "Coding LLMs" dropped by 20%.
12.03.2025 14:04 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0I will believe we have AGI when my Google Scholar profile gets fixed automatically and I am not shown as an Agricultural science researcher who has occasionally worked on NLP stuff.๐ญ
scholar.google.co.in/citations?us...
The Factory must grow
04.03.2025 22:44 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Few obvious questions: Is the upper limit of ~30% hallucination acceptable? And how much work it adds to a lawyerโs work to ensure there is no hallucinations at all in the generated document?
(Havenโt read the entire paper yet)
Honest answer: I don't read anything from the enormous stream of papers unless they are *critical* to my current work or I am deeply interested in the topic.
28.02.2025 16:51 โ ๐ 5 ๐ 1 ๐ฌ 0 ๐ 0Come present your #NLP work in Leuven!
25.02.2025 07:49 โ ๐ 4 ๐ 4 ๐ฌ 0 ๐ 0Okay, automated cross-posting has its quirks. This was the original tweet: x.com/OpenAI/statu...
19.02.2025 00:31 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Reminds me of this from @xkcd
Though the real question is: how long before everyone races to optimize for this "benchmark?"
WHY ARE THERE 5 DIFF ELECTRON VERSION ON MY PC? :/
17.02.2025 14:29 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Sometimes VSCodium feels so slow that I just want to go back to jupyterlab and vim.
17.02.2025 14:26 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Next year we will reach NeurIPS level at this rate.
17.02.2025 01:57 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0Twitter screenshot of https://x.com/ChombaBupe/status/1891070696832278614 which reads: Language models are still failing on trivial simple tasks, like following basic instructions, summarization, etc. Another screenshot of ars technical saying 'Over half of LLM-written news summaries have "significant issues" - BBC Analysis'
Honorary mention of arxiv.org/abs/2309.09558
17.02.2025 01:54 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0If we start calling them anything else, it doesnโt anthropomorphise the models enough. :)
15.02.2025 22:56 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0I still think about my application for an internship in 2022 where I got through the first screening, forgot about it, and got another email 6 months later in 2023, went ahead & did two(?) interviews, absolutely nailed it, got an email 6 months later that position is canned. :D
14.02.2025 12:12 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Amazon product name for Oral B Genius X electric toothbrush which has AI in it.
Does your brush even have AI, bro?
WHYYYYYYYYYY??
There must be a term for when someone takes your code and modifies the logic so unnecessarily and badly that even the second double espresso isn't enough to fix your mood.
07.02.2025 09:18 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0I donโt think any of the โfoundation modelโ orgs will ever talk about contamination, because that makes scores look bad. At this point a lot of these models seem like clowns in circus, only there to keep the public (investors?) at large excited about the circus that is โAI researchโ.
28.01.2025 21:19 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0This might be the reminder I needed to fix my webpage. Side-quest for the day acquired. :p
22.01.2025 08:44 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Repost appreciated! ๐
ACL 2025 Ling theory & Cognitive modeling track is looking for emergency reviewers. The emergency review period is between 3/18-26, and these reviewers will be excluded from the ARR cycle. If you're interested, please sign up here! docs.google.com/forms/d/1fH7...
I had this experience in my thesis and a research project that I did. Maybe I should publish that project on arxiv.
17.12.2024 13:50 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Watching a small model train on a consumer-grade GPU is where the fun at.
10.12.2024 13:47 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Didnโt realise itโs an imposter account. :D
08.12.2024 21:47 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0