You can play with this example yourself here: blog.cochlea.xyz/resonancemod...
13.11.2025 21:11 β π 0 π 0 π¬ 0 π 0@johnvinyard.bsky.social
Working on sparse, interpretable, and easy-to-manipulate models of audio (especially music). https://github.com/JohnVinyard
You can play with this example yourself here: blog.cochlea.xyz/resonancemod...
13.11.2025 21:11 β π 0 π 0 π¬ 0 π 0Here's me scraping unskillfully on a "violin".
13.11.2025 20:27 β π 0 π 0 π¬ 1 π 0In this micro-experiment, I try to extract playable instrument models from a single audio recording. You can play with a handful of the models in your browser!
blog.cochlea.xyz/resonancemod...
You can play with it too: blog.cochlea.xyz/conv-instrum...
18.10.2025 16:00 β π 0 π 0 π¬ 0 π 0Text-to-music is amazing, but it sucks all the fun out for me, as a musician. I'm using machine learning to "extract" models that can be controlled in real-time. My goal is to give them all the subtlety and richness that you'd get from a real acoustic instrument.
18.10.2025 15:57 β π 1 π 0 π¬ 1 π 0A fun little WebAudio + MediaPipe toy for October πΈ π§
blog.cochlea.xyz/theremin.html
blog.cochlea.xyz/rnn-instrume...
07.09.2025 20:15 β π 0 π 0 π¬ 0 π 0I have received an endorsement! Feel free to read the paper and provide feedback if you're so inclined!
08.05.2025 16:23 β π 0 π 0 π¬ 0 π 0Supplemental material with example audio reconstructions is here:
blog.cochlea.xyz/sparse-inter...
I wrote a paper summarizing the work I've been pursuing toward a sparse audio codec!
I'd like to post this to the CS -> Sound section of arxiv, if possible, so if anyone would be willing to "endorse" me, shoot me a message and I'll share the code!
blog.cochlea.xyz/toward-a-spa...
Here's the "dashboard" for a model that's training right now. Event times are represented bottom-left, and event vectors, which encode everything about the sound _except_ for time-of-occurrence, on the bottom-right.
27.03.2025 14:09 β π 0 π 0 π¬ 0 π 0To put it into context, an encoder model is trained to incrementally decompose a segment of audio into some number of event vectors and their times-of-occurrence. You can read more about the training process here:
blog.cochlea.xyz/sparse-inter...
This play-able scatterplot (clicking on points plays a sound) maps many points from an audio event decoder onto a 2D plane using t-SNE. I find it both fun to play with and pretty to look at!
blog.cochlea.xyz/scatter.html
I've added accelerometer support to the spring-mass synthesis toy. Try it out on your phone!
blog.cochlea.xyz/string.html
I believe that one key to sparse audio representations will be baking "resonance priors" into models; even many synth sounds are aping some real-world system. As such, I've gotten interested in physical modeling, and built this fun little Web Audio toy to learn more!
blog.cochlea.xyz/string.html