John Vinyard @johnvinyard - Bluesky Profile

Extracting Playable Instrument Models from Short Audio Examples

You can play with this example yourself here: blog.cochlea.xyz/resonancemod...

13.11.2025 21:11 — 👍 0 🔁 0 💬 0 📌 0

Here's me scraping unskillfully on a "violin".

13.11.2025 20:27 — 👍 0 🔁 0 💬 1 📌 0

In this micro-experiment, I try to extract playable instrument models from a single audio recording. You can play with a handful of the models in your browser!

blog.cochlea.xyz/resonancemod...

12.11.2025 16:58 — 👍 0 🔁 0 💬 0 📌 0

Hand-Controlled WebAudio Instrument

You can play with it too: blog.cochlea.xyz/conv-instrum...

18.10.2025 16:00 — 👍 0 🔁 0 💬 0 📌 0

Text-to-music is amazing, but it sucks all the fun out for me, as a musician. I'm using machine learning to "extract" models that can be controlled in real-time. My goal is to give them all the subtlety and richness that you'd get from a real acoustic instrument.

18.10.2025 15:57 — 👍 1 🔁 0 💬 1 📌 0

A fun little WebAudio + MediaPipe toy for October 🛸 🧟

blog.cochlea.xyz/theremin.html

16.10.2025 16:47 — 👍 0 🔁 0 💬 0 📌 0

blog.cochlea.xyz/rnn-instrume...

07.09.2025 20:15 — 👍 0 🔁 0 💬 0 📌 0

Toward a Sparse and Interpretable Audio Codec Most widely-used modern audio codecs, such as Ogg Vorbis and MP3, as well as more recent "neural" codecs like Meta's Encodec or the Descript Audio Codec are based on block-coding; audio is divided int...

arxiv.org/abs/2505.05654

12.05.2025 11:59 — 👍 0 🔁 0 💬 0 📌 0

I have received an endorsement! Feel free to read the paper and provide feedback if you're so inclined!

08.05.2025 16:23 — 👍 0 🔁 0 💬 0 📌 0

Toward a Sparse Interpretable Audio Codec

Supplemental material with example audio reconstructions is here:

blog.cochlea.xyz/sparse-inter...

08.05.2025 15:17 — 👍 0 🔁 0 💬 0 📌 0

I wrote a paper summarizing the work I've been pursuing toward a sparse audio codec!

I'd like to post this to the CS -> Sound section of arxiv, if possible, so if anyone would be willing to "endorse" me, shoot me a message and I'll share the code!

blog.cochlea.xyz/toward-a-spa...

08.05.2025 15:16 — 👍 0 🔁 0 💬 2 📌 0

Here's the "dashboard" for a model that's training right now. Event times are represented bottom-left, and event vectors, which encode everything about the sound _except_ for time-of-occurrence, on the bottom-right.

27.03.2025 14:09 — 👍 0 🔁 0 💬 0 📌 0

Toward a Sparse Interpretable Audio Codec

To put it into context, an encoder model is trained to incrementally decompose a segment of audio into some number of event vectors and their times-of-occurrence. You can read more about the training process here:

blog.cochlea.xyz/sparse-inter...

27.03.2025 14:05 — 👍 0 🔁 0 💬 0 📌 0

This play-able scatterplot (clicking on points plays a sound) maps many points from an audio event decoder onto a 2D plane using t-SNE. I find it both fun to play with and pretty to look at!

blog.cochlea.xyz/scatter.html

27.03.2025 14:02 — 👍 0 🔁 0 💬 2 📌 0

I've added accelerometer support to the spring-mass synthesis toy. Try it out on your phone!

blog.cochlea.xyz/string.html

25.03.2025 20:05 — 👍 0 🔁 0 💬 0 📌 0

I believe that one key to sparse audio representations will be baking "resonance priors" into models; even many synth sounds are aping some real-world system. As such, I've gotten interested in physical modeling, and built this fun little Web Audio toy to learn more!

blog.cochlea.xyz/string.html

17.03.2025 20:25 — 👍 1 🔁 0 💬 0 📌 0

John Vinyard

Latest posts by johnvinyard.bsky.social on Bluesky

@johnvinyard is following 14 prominent accounts