Yonatan Bisk @ybisk.me - Bluesky Profile

We are getting closer to have agents operating in the real physical world. However, can we trust frontier models to make embodied decisions 🎮 aligned with human norms 👩‍⚖️ ?

With EgoNormia, a 1.8k ego-centric video 🥽 QA benchmark, we show that this is surprisingly challenging!

04.03.2025 04:32 — 👍 23 🔁 9 💬 1 📌 1

How many (checks calendar) decades do people keep around backups of data from their thesis? Am I a digital hoarder?

05.01.2025 05:00 — 👍 10 🔁 0 💬 4 📌 0

Recently, papers have been published in prestigious journals (Nature Human Behaviour, PNAS) claiming that large language models (e.g., ChatGPT) solve the "false belief" task (a task requiring Theory of Mind abilities).

What is the false belief task? ->

17.12.2024 08:36 — 👍 6 🔁 2 💬 1 📌 0

I think Bisky :)

25.11.2024 20:12 — 👍 2 🔁 0 💬 0 📌 0

When I first started reading papers in ~2007 we would wait for someone who attended the conference to bring back the conference booklet and tell us what to read. We’d read them. And then spend the next three months reading old papers or working cuz 🤷 It was a great way to grow up.

24.11.2024 14:52 — 👍 9 🔁 0 💬 0 📌 0

👋 Time is a weird thing :) maybe I should try and convince the department to force you to give a talk sometime — though ideally with @spandanagella.bsky.social too ;)

24.11.2024 02:30 — 👍 1 🔁 0 💬 1 📌 0

Just use your own domain name?

24.11.2024 02:13 — 👍 0 🔁 0 💬 1 📌 0

This article really spoke to me; all the science I've enjoyed and that I thought came out well has been done with a colleague that I was talking to every day and almost every couple of hours

17.11.2024 14:32 — 👍 42 🔁 3 💬 1 📌 0

Hello, Computational linguistics/NLP world in Bluesky! We're creating the same accounts on other social media platforms in Bluesky! #NLProc

14.11.2024 00:17 — 👍 133 🔁 31 💬 4 📌 5

I am trying to create a robotics and ai starter pack on bluesky: go.bsky.app/DfAoaJ1

Very incomplete please comment with suggestions (or just if you're missing and want to be added!)

11.11.2024 15:01 — 👍 111 🔁 38 💬 77 📌 4

3. How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
^^ includes @skgabrie.bsky.social who is just starting up her lab at UCLA!

10.11.2024 18:34 — 👍 4 🔁 0 💬 0 📌 0

2. Gradient Localization Improves Lifelong Pretraining of Language Models
TL;DR - Gradient norms tell you where your knowledge is stored and if it conflicts with what you already know.

10.11.2024 18:34 — 👍 2 🔁 0 💬 1 📌 0

#EMNLP2024

1. Tools Fail: Detecting Silent Errors in Faulty Tools

Are you using tools with your LLMs? Are you assuming your tools are perfect? Assuming the LLM can just handle any errors for you? 😬
Danger… 🚨 Models trust tools over their own “knowledge” even for simple and well trained cases.

10.11.2024 18:34 — 👍 19 🔁 0 💬 1 📌 0

TRI Women and Allies breakfast

A robot standoff. One with wheels and one with legs.

Su debugging her robot teleop system

Vidhi presenting her work on robot audio

Hi from CoRL 👋

08.11.2024 10:27 — 👍 15 🔁 2 💬 1 📌 0

Yonatan Bisk

Latest posts by ybisk.me on Bluesky

@ybisk.me is following 20 prominent accounts