Zachary Novack's Avatar

Zachary Novack

@zacknovack.bsky.social

Efficient+Controllable Audio Generation @ UCSD | Interning Stability AI, Adobe | Teaching drums @ POW Percussion

257 Followers  |  97 Following  |  4 Posts  |  Joined: 17.11.2024  |  1.472

Latest posts by zacknovack.bsky.social on Bluesky

We've heard you! Time after ICASSP is feeling tight for many, and thanks to a very strong reviewer pool, we can reduce the review load and shorten the review period.
We are thus happy to announce a 1 week extension๐Ÿค—
New #WASPAA2025 deadlines:
April 30: First submission
May 7: Final submission

19.04.2025 20:12 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

Titans for Titans when @urinieto.bsky.social ๐Ÿคฃ

14.01.2025 18:11 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing The recent explosion of generative AI-Music systems has raised numerous concerns over data copyright, licensing music from musicians, and the conflict between open-source AI and large prestige compani...

Hyped that 3/3 papers w/the folks
@ucsd-musaic.bsky.social
are accepted at #ICASSP2025!

PDMX: Public Domain Symbolic Music arxiv.org/abs/2409.10831
CoLLAP: Long-Context CLAP (~5 min) arxiv.org/abs/2410.02271
FUTGA-MIR: long music understanding for MIR tasks (arxiv soon)

Next stop, India!๐Ÿ‡ฎ๐Ÿ‡ณ

20.12.2024 23:04 โ€” ๐Ÿ‘ 6    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

new paper! ๐Ÿ—ฃ๏ธSketch2Sound๐Ÿ’ฅ

Sketch2Sound can create sounds from sonic imitations (i.e., a vocal imitation or a reference sound) via interpretable, time-varying control signals.

paper: arxiv.org/abs/2412.08550
web: hugofloresgarcia.art/sketch2sound

12.12.2024 14:43 โ€” ๐Ÿ‘ 23    ๐Ÿ” 9    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 5
Diffusion Meets Flow Matching Flow matching and diffusion models are two popular frameworks in generative modeling. Despite seeming similar, there is some confusion in the community about their exact connection. In this post, we a...

Blog post link: diffusionflow.github.io/

Despite seeming similar, there is some confusion in the community about the exact connection between the two frameworks. We aim to clear up the confusion by showing how to convert one framework to another, for both training and sampling.

02.12.2024 18:45 โ€” ๐Ÿ‘ 37    ๐Ÿ” 8    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

We just created a Bluesky starter pack featuring people and groups working at the intersection of AI and music, covering both symbolic and audio approaches. Let us know if you'd like to be added or removed!

go.bsky.app/PBvFCxa

28.11.2024 03:20 โ€” ๐Ÿ‘ 13    ๐Ÿ” 3    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

this is sick! would love to be added, as a controllable + accelerated diffusion fan (mostly for audio/music) ๐ŸŽธ

20.11.2024 23:11 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

This is awesome! Could I be added?

20.11.2024 23:07 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@zacknovack is following 19 prominent accounts