Fun model to work on! More fun stuff to come!
02.04.2025 18:40 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0@pseeth.bsky.social
Researcher in computer audition, machine learning, and HCI. Sr. Research Scientist, @AdobeResearch. Previously @DescriptApp, @Northwestern. https://pseeth.github.io/
Fun model to work on! More fun stuff to come!
02.04.2025 18:40 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0good story from someone completely unrelated to me i swear
21.12.2024 11:02 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0๐
14.12.2024 14:23 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0neat - i think all these spaces are basically a linear layer / permutation away from each other. with one codebook (or a vae setup) you could maybe just solve it with the embedding matrices directly, no audio needed
12.12.2024 22:42 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Great work from @hugofloresgarcia.bsky.socialโs internship at Adobe - turn your voice into basically anything!
12.12.2024 16:28 โ ๐ 3 ๐ 1 ๐ฌ 0 ๐ 0Introducing MultiFoley, a video-aware audio generation method with multimodal controls! ๐
โจ๏ธMake a typewriter sound like a piano ๐น
๐ฑMake a cat meow like a lion roars! ๐ฆ
โฑ๏ธPerfectly time existing SFX ๐ฅ to a video
Link to research in comments:
by Adobe Research
Check out our new work on video-guided audio gen with a focus on fine-grained creative control! Done by @czyang.bsky.social during an internship with our group at Adobe Research. Super fun model!
27.11.2024 03:00 โ ๐ 10 ๐ 2 ๐ฌ 0 ๐ 0A nifty application of depth estimation, creating a mockup of a digital design on real-world objects: sniklaus.com/mockup
26.11.2024 18:44 โ ๐ 9 ๐ 2 ๐ฌ 0 ๐ 0Here's one that seems to catch a bit more "thread-like" content, sorts by recency instead of likes, and drops arxiv bots: bsky.app/profile/psee.... Seems to work ok for now, and catches some non-ML threads too
24.11.2024 21:48 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...
24.11.2024 04:01 โ ๐ 7 ๐ 2 ๐ฌ 0 ๐ 1For those of you who haven't yet, give scholar-inbox.com a try! It's a free personal paper recommender which helps you stay up-to-date by sending daily/weekly paper digests directly to your inbox. Your votes train your own classifier, and you can have a peek at its feature words. Here are mine!
24.11.2024 16:09 โ ๐ 17 ๐ 6 ๐ฌ 2 ๐ 2Made a feed that tries to index paper threads only: bsky.app/profile/psee.... To get into the feed, make a post with "arxiv.org" in the post somewhere + don't be a bot. My tiny contribution to the recent migration! Built w/ @skyfeed.app. Planning on some paper threads of my own soon...
24.11.2024 04:01 โ ๐ 7 ๐ 2 ๐ฌ 0 ๐ 1I initiated a starter pack for Audio ML. Let me know if you'd like to be added/removed.
go.bsky.app/LGmct4z