DAn Ellis @dpwe - Bluesky Profile

Recomposer: Event-roll-guided generative audio editing Editing complex real-world sound scenes is difficult because individual sound sources overlap in time. Generative models can fill-in missing or corrupted details based on their strong prior understand...

🔊New paper! Recomposer allows editing sound events within complex scenes based on textual descriptions and event roll representations. And we discuss the details that matter!

Work by the Sound Understanding folks
@GoogleDeepMind

arxiv.org/abs/2509.05256

11.09.2025 19:38 — 👍 7 🔁 1 💬 1 📌 1

@dpwe is following 20 prominent accounts

Heiga Zen (全炳河)
@heigazen

Principal Scientist (Director) at Google DeepMind in Japan. 波瀬小⇒一志中⇒鈴鹿高専⇒名工大 (IBM T.J. Watson Research intern)⇒東芝欧州研究所⇒Google (Speech🇬🇧⇒Brain🇯🇵) ⇒Google DeepMind. 3rd generation Korean in Japan.

Vincent Lostanlen
@lostanlen

Scientist at CNRS. https://audio.ls2n.fr A science game to test your musical memory: https://tunetwins.app

Steve Renals
@srenals

Once was speech technologist - Water of Leith, Edinburgh - Born 320.23 ppm

Eric Fosler-Lussier
@ericfos

Professor/Admin @ Ohio State. All opinions expressed on this channel are my personal opinions and do not represent that of my employer.

arXiv Sound
@arxiv-sound

Automated posting of sound-related articles uploaded to arxiv.org (eess.AS + cs.SD) Source: https://github.com/dsuedholt/bsky-paperbot-sound/ Inspired by @paperposterbot.bsky.social and https://twitter.com/ArxivSound

Andrew Owens
@andrewowens

Associate professor @ Cornell Tech

Johanna Devaney
@jcdevaney

Canadian in NYC (she/her) teaching music and data analysis at Brooklyn College and the Graduate Center, CUNY. Co-Editor-in-Chief of Journal of New Music Research.

Carl Vondrick
@cvondrick

Professor at Columbia. Computer Vision and Machine Learning

Jesse Engel
@jesseengel

Guitarist, Researcher Google DeepMind. Opinions are my own.

Romain Serizel
@rserizel

Professor at Université de Lorraine/Loria/Mines Nancy. Doing research is speech and audio processing.

Emmanouil Benetos
@emmanouilb

Reader in Machine Listening, @qmuleecs.bsky.social Queen Mary University of London - research on AI for audio. Website: https://www.seresearch.qmul.ac.uk/cmai/people/ebenetos/

Oriol (Uri) Nieto
@urinieto

Researcher at Adobe Research. Machine learning on audio. Screamer. Oaklander born in Barcelona. Titan. He/they 🌈 www.urinieto.com

Joan Serrà
@serrjoa

Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine. https://serrjoa.github.io/

Justin Salamon
@justinsalamon

Head of Sound Design AI Research at Adobe. Machine learning and signal processing for audio & video. Musician. He/him. www.justinsalamon.com

Salah Zaiem
@salahzaiem

Research Scientist at Google Deepmind working on audio/speech generation.

DCASE Challenge
@dcase-challenge

Challenge on Detection and Classification of Acoustic Scenes and Events. https://dcase.community/

Jordi Pons
@jordiponsdotme

Music and artificial intelligence. Researcher at Stability AI. Musician at BRNRT Collective. Previously at Dolby and Universitat Pompeu Fabra. artintech.substack.com www.jordipons.me

Matthias Mauch
@matthiasmauch

I lead music ML research for Music. Flexitalian.

Peyman Milanfar
@docmilanfar

Distinguished Scientist at Google. Computational Imaging, Machine Learning, and Vision. Posts are personal opinions. May change or disappear over time. http://milanfar.org

Shinji Watanabe
@shinjiw

I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.

Latest posts by dpwe.bsky.social on Bluesky

@dpwe is following 20 prominent accounts