DAn Ellis's Avatar

DAn Ellis

@dpwe.bsky.social

Research Scientist at Google DeepMind: Enivironmental sound understanding

23 Followers  |  42 Following  |  1 Posts  |  Joined: 10.09.2025  |  1.3647

Latest posts by dpwe.bsky.social on Bluesky

Preview
Recomposer: Event-roll-guided generative audio editing Editing complex real-world sound scenes is difficult because individual sound sources overlap in time. Generative models can fill-in missing or corrupted details based on their strong prior understand...

🔊New paper! Recomposer allows editing sound events within complex scenes based on textual descriptions and event roll representations. And we discuss the details that matter!

Work by the Sound Understanding folks
@GoogleDeepMind

arxiv.org/abs/2509.05256

11.09.2025 19:38 — 👍 7    🔁 1    💬 1    📌 1

@dpwe is following 20 prominent accounts