's Avatar

@sreeb.bsky.social

PhD student @PennState • Multimodal Learning x Affective Computing • ex-Microsoft

40 Followers  |  182 Following  |  7 Posts  |  Joined: 20.11.2024  |  1.6925

Latest posts by sreeb.bsky.social on Bluesky

We also conduct a fine-grained error analysis and show that the underlying ground truth data in popular emotion datasets is inherently unreliable, adding to the challenges of this highly subjective task! Check out the preprint for more details! 🧐

22.02.2025 19:54 — 👍 0    🔁 0    💬 0    📌 0

We show that: 1️⃣ foundation models lag behind architectures designed specifically for emotion recognition, 2️⃣ show biases towards specific emotions and response formats, and 3️⃣ fail at predicting emotions under assumed personas.

22.02.2025 19:54 — 👍 0    🔁 0    💬 1    📌 0
Preview
Evaluating Vision-Language Models for Emotion Recognition Large Vision-Language Models (VLMs) have achieved unprecedented success in several objective multimodal reasoning tasks. However, to further enhance their capabilities of empathetic and effective comm...

Excited to share my recent work on evaluating VLMs for ✨evoked emotion recognition ✨: arxiv.org/abs/2502.05660 (will be in NAACL Findings)!

22.02.2025 19:54 — 👍 2    🔁 0    💬 1    📌 0

The Worlds I See - Dr. Fei-Fei Li’s autobiography.

19.12.2024 03:07 — 👍 1    🔁 0    💬 0    📌 0

Instead of 'What can AI teach every other field?', we should ask 'What can AI learn FROM every other field?', e.g. sociology, education, indigenous knowledge systems. After all, what kind of study of general intelligence does not try to build upon the amassed collective wisdom of all humanity?

13.12.2024 21:05 — 👍 49    🔁 7    💬 2    📌 1

Will be eagerly looking forward to your course nonetheless!

04.12.2024 22:38 — 👍 0    🔁 0    💬 0    📌 0

And can such representations be compounded?

In the context of multimodal foundation models, can we create models that can process such composite entities as a single modality of input?

Not sure if this is too naive an idea or more of an engineering than a research problem 😅

04.12.2024 22:37 — 👍 0    🔁 0    💬 1    📌 0

The possibilities of what a “modality” can be is endless. Considering composite entities (like a user or a complete human), can we say that a single representation of a complex entity is as good as/better/worse than some combination of its subparts (language/speech by the human, appearance, etc.)?

04.12.2024 22:35 — 👍 1    🔁 0    💬 1    📌 0

Okay genius idea to improve quality of #nlp #arr reviews. Literally give gold stars to the best reviewers, visible on open review next to your anonymously ID during review process.

Here’s why it would work, and why would you should RT this fab idea:

24.11.2024 21:01 — 👍 27    🔁 5    💬 3    📌 1

@sreeb is following 20 prominent accounts