John Vinyard's Avatar

John Vinyard

@johnvinyard.bsky.social

Working on sparse, interpretable, and easy-to-manipulate models of audio (especially music). https://github.com/JohnVinyard

8 Followers  |  14 Following  |  16 Posts  |  Joined: 17.03.2025  |  2.1633

Latest posts by johnvinyard.bsky.social on Bluesky

Extracting Playable Instrument Models from Short Audio Examples

You can play with this example yourself here: blog.cochlea.xyz/resonancemod...

13.11.2025 21:11 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Here's me scraping unskillfully on a "violin".

13.11.2025 20:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

In this micro-experiment, I try to extract playable instrument models from a single audio recording. You can play with a handful of the models in your browser!

blog.cochlea.xyz/resonancemod...

12.11.2025 16:58 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Hand-Controlled WebAudio Instrument

You can play with it too: blog.cochlea.xyz/conv-instrum...

18.10.2025 16:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Text-to-music is amazing, but it sucks all the fun out for me, as a musician. I'm using machine learning to "extract" models that can be controlled in real-time. My goal is to give them all the subtlety and richness that you'd get from a real acoustic instrument.

18.10.2025 15:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

A fun little WebAudio + MediaPipe toy for October πŸ›Έ 🧟

blog.cochlea.xyz/theremin.html

16.10.2025 16:47 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

blog.cochlea.xyz/rnn-instrume...

07.09.2025 20:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Toward a Sparse and Interpretable Audio Codec Most widely-used modern audio codecs, such as Ogg Vorbis and MP3, as well as more recent "neural" codecs like Meta's Encodec or the Descript Audio Codec are based on block-coding; audio is divided int...

arxiv.org/abs/2505.05654

12.05.2025 11:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I have received an endorsement! Feel free to read the paper and provide feedback if you're so inclined!

08.05.2025 16:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Toward a Sparse Interpretable Audio Codec

Supplemental material with example audio reconstructions is here:

blog.cochlea.xyz/sparse-inter...

08.05.2025 15:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I wrote a paper summarizing the work I've been pursuing toward a sparse audio codec!

I'd like to post this to the CS -> Sound section of arxiv, if possible, so if anyone would be willing to "endorse" me, shoot me a message and I'll share the code!

blog.cochlea.xyz/toward-a-spa...

08.05.2025 15:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Post image

Here's the "dashboard" for a model that's training right now. Event times are represented bottom-left, and event vectors, which encode everything about the sound _except_ for time-of-occurrence, on the bottom-right.

27.03.2025 14:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Toward a Sparse Interpretable Audio Codec

To put it into context, an encoder model is trained to incrementally decompose a segment of audio into some number of event vectors and their times-of-occurrence. You can read more about the training process here:

blog.cochlea.xyz/sparse-inter...

27.03.2025 14:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

This play-able scatterplot (clicking on points plays a sound) maps many points from an audio event decoder onto a 2D plane using t-SNE. I find it both fun to play with and pretty to look at!

blog.cochlea.xyz/scatter.html

27.03.2025 14:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Video thumbnail

I've added accelerometer support to the spring-mass synthesis toy. Try it out on your phone!

blog.cochlea.xyz/string.html

25.03.2025 20:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

I believe that one key to sparse audio representations will be baking "resonance priors" into models; even many synth sounds are aping some real-world system. As such, I've gotten interested in physical modeling, and built this fun little Web Audio toy to learn more!

blog.cochlea.xyz/string.html

17.03.2025 20:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@johnvinyard is following 14 prominent accounts