We've heard you! Time after ICASSP is feeling tight for many, and thanks to a very strong reviewer pool, we can reduce the review load and shorten the review period.
We are thus happy to announce a 1 week extension๐ค
New #WASPAA2025 deadlines:
April 30: First submission
May 7: Final submission
19.04.2025 20:12 โ ๐ 1 ๐ 1 ๐ฌ 0 ๐ 1
Titans for Titans when @urinieto.bsky.social ๐คฃ
14.01.2025 18:11 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
new paper! ๐ฃ๏ธSketch2Sound๐ฅ
Sketch2Sound can create sounds from sonic imitations (i.e., a vocal imitation or a reference sound) via interpretable, time-varying control signals.
paper: arxiv.org/abs/2412.08550
web: hugofloresgarcia.art/sketch2sound
12.12.2024 14:43 โ ๐ 23 ๐ 9 ๐ฌ 2 ๐ 5
Diffusion Meets Flow Matching
Flow matching and diffusion models are two popular frameworks in generative modeling. Despite seeming similar, there is some confusion in the community about their exact connection. In this post, we a...
Blog post link: diffusionflow.github.io/
Despite seeming similar, there is some confusion in the community about the exact connection between the two frameworks. We aim to clear up the confusion by showing how to convert one framework to another, for both training and sampling.
02.12.2024 18:45 โ ๐ 37 ๐ 8 ๐ฌ 1 ๐ 0
We just created a Bluesky starter pack featuring people and groups working at the intersection of AI and music, covering both symbolic and audio approaches. Let us know if you'd like to be added or removed!
go.bsky.app/PBvFCxa
28.11.2024 03:20 โ ๐ 13 ๐ 3 ๐ฌ 2 ๐ 0
this is sick! would love to be added, as a controllable + accelerated diffusion fan (mostly for audio/music) ๐ธ
20.11.2024 23:11 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
This is awesome! Could I be added?
20.11.2024 23:07 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
#HCI Assistant Prof. @FIU @FIUSCIS | Prev: @ucsd_cse @DesignLabUCSD @MSFTResearch @AdobeResearch @S3DatCMU @cmuhcii @UniofNottingham
Research Leader @ Sony CSL Paris
Ph.D. student on generative models and domain adaptation for Earth observation ๐ฐ
Previously intern @SonyCSL, @Ircam, @Inria
๐ Personal website: https://lebellig.github.io/
Professor and Associate Dean for Research, School of Information Sciences, University of Illinois.
Digital Humanities-Music Information Retrieval-Library and Information Science
Sound effects, audio & video | PhD at @ISPAMM, @Sapienza | Former @C4DM, @QMUL
Senior AI Research Scientist @irisaudiotech | PhD in CS @SapienzaRoma | Former @CaFoscari, @SonyCSL, @Dolby and @c4dm
Musician and Music AI researcher @ MALer Lab, Sogang Univ. / base0
Postdoc researcher @telecomparis. Previously @CNRS/LS2N @c4dm. Machine learning for audio. https://changhongw.github.io/
PhD researcher in AI & Music at C4DM | QMUL. Previously: Sony CSL Paris, Sony Tokyo, AXD Imperial College, Blackstar Amps
NLP PhD student @ UCSD | NSF CS grad fellow
recommender systems, retrieval, ML ๋ง์ง
ashleyshin.org her/she
CS PhD student at UT Austin in #NLP
Interested in language, reasoning, semantics and cognitive science. One day we'll have more efficient, interpretable and robust models!
Other interests: math, philosophy, cinema
https://www.juandiego-rodriguez.com/
PhD candidate at Carnegie Mellon University
Senior Applied Scientist at Microsoft
๐ https://soham97.github.io
๐ https://github.com/soham97
๐ https://scholar.google.com/citations?user=MasiEogAAAAJ&hl=en
AI, sociotechnical systems, social purpose. Research director at Google DeepMind. Cofounder and Chair at Deep Learning Indaba. FAccT2025 co-program chair. shakirm.com
MS student @ Music and Audio Computing lab, KAIST. Controllable Audio Gen, Multimodal (audio, visual, text)
https://jnwnlee.github.io
1st-year PhD at UCSD | RL, NLP, HCI
Research: Agents๐ค, Reasoning๐ง , Games๐พ
Misc: Piano๐น, Composing๐ผ, Singing๐ค, Climbing๐งโโ๏ธ, (fiction) Writingโ๏ธ
๐คA CAPYBARA lover๐ค
getting a Music Tech PhD at NYU