Shaokai Ye's Avatar

Shaokai Ye

@shaokaiye.bsky.social

I am a 5th year PhD student from Mackenzie Mathis's lab at EPFL. I am on the job market and looking for positions that build multi-modal agentic systems that help understand the real / augmented world and that analyze self / others' behavior.

32 Followers  |  7 Following  |  1 Posts  |  Joined: 25.03.2025  |  1.3028

Latest posts by shaokaiye.bsky.social on Bluesky

LLaVAction: Video Action Recognition LLaVAction: evaluating and training multi-modal large language models for action recognition

โœจ Introducing a new #SOTA action recognition large multimodal language model: #LLaVAction!

By @shaokaiye.bsky.social Haozhe Qi, @trackingskills.bsky.social and me!

๐Ÿ“ arxiv.org/abs/2503.18712

๐Ÿค– mmathislab.github.io/llavaction/

1/n

25.03.2025 08:46 โ€” ๐Ÿ‘ 43    ๐Ÿ” 17    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

Thanks so much!

25.03.2025 08:48 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@shaokaiye is following 7 prominent accounts