Prem Seetharaman's Avatar

Prem Seetharaman

@pseeth.bsky.social

Researcher in computer audition, machine learning, and HCI. Sr. Research Scientist, @AdobeResearch. Previously @DescriptApp, @Northwestern. https://pseeth.github.io/

313 Followers  |  1,035 Following  |  8 Posts  |  Joined: 15.11.2024  |  1.5879

Latest posts by pseeth.bsky.social on Bluesky

Fun model to work on! More fun stuff to come!

02.04.2025 18:40 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

good story from someone completely unrelated to me i swear

21.12.2024 11:02 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿ‘€

14.12.2024 14:23 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

neat - i think all these spaces are basically a linear layer / permutation away from each other. with one codebook (or a vae setup) you could maybe just solve it with the embedding matrices directly, no audio needed

12.12.2024 22:42 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Great work from @hugofloresgarcia.bsky.socialโ€™s internship at Adobe - turn your voice into basically anything!

12.12.2024 16:28 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

Introducing MultiFoley, a video-aware audio generation method with multimodal controls! ๐ŸŽ‰

โŒจ๏ธMake a typewriter sound like a piano ๐ŸŽน
๐ŸฑMake a cat meow like a lion roars! ๐Ÿฆ
โฑ๏ธPerfectly time existing SFX ๐Ÿ’ฅ to a video

Link to research in comments:
by Adobe Research

27.11.2024 05:11 โ€” ๐Ÿ‘ 41    ๐Ÿ” 5    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Check out our new work on video-guided audio gen with a focus on fine-grained creative control! Done by @czyang.bsky.social during an internship with our group at Adobe Research. Super fun model!

27.11.2024 03:00 โ€” ๐Ÿ‘ 10    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

A nifty application of depth estimation, creating a mockup of a digital design on real-world objects: sniklaus.com/mockup

26.11.2024 18:44 โ€” ๐Ÿ‘ 9    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Here's one that seems to catch a bit more "thread-like" content, sorts by recency instead of likes, and drops arxiv bots: bsky.app/profile/psee.... Seems to work ok for now, and catches some non-ML threads too

24.11.2024 21:48 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...

24.11.2024 04:01 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image

For those of you who haven't yet, give scholar-inbox.com a try! It's a free personal paper recommender which helps you stay up-to-date by sending daily/weekly paper digests directly to your inbox. Your votes train your own classifier, and you can have a peek at its feature words. Here are mine!

24.11.2024 16:09 โ€” ๐Ÿ‘ 17    ๐Ÿ” 6    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 2

Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...

24.11.2024 04:01 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

I initiated a starter pack for Audio ML. Let me know if you'd like to be added/removed.
go.bsky.app/LGmct4z

18.11.2024 04:46 โ€” ๐Ÿ‘ 67    ๐Ÿ” 22    ๐Ÿ’ฌ 47    ๐Ÿ“Œ 1

@pseeth is following 20 prominent accounts