Yet another pre-Christmas release!! ๐
๐
Here is ๐๐ญ๐๐๐ฅ๐-๐๐๐ which generates sound effects from silent video frames showing semantic and temporal alignment.
๐ถ๐ฅ๐๏ธ
Huge thanks to @riccardofosco.bsky.social Christian Marinoni and all co-authors ๐ค
23.12.2024 14:01 โ ๐ 3 ๐ 1 ๐ฌ 0 ๐ 0
Super interesting work on #GenAI #Video2Audio with impressive results from my friends @riccardofosco.bsky.social @Christian Marinoni together with @emilianpos.bsky.social @mcomunita.bsky.social Luca Cosmo, Joshua Reiss and @dacom.bsky.social !
๐ Go check it out!
20.12.2024 18:37 โ ๐ 4 ๐ 1 ๐ฌ 0 ๐ 0
A great work with Christian Marinoni, @emilianpos.bsky.social, @mcomunita.bsky.social, Luca Cosmo, Joshua D. Reiss and @dacom.bsky.social
20.12.2024 18:20 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
This project explores how to generate realistic sound effects for a silent video. Our model combines:
๐น Video-based RMS envelope prediction, and
๐น Audio synthesis with Stable Audio and ControlNet, enabling high-quality sound design synchronized to the visual input.
20.12.2024 18:19 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
Stable-V2A: Synchronized Sound Effects Synthesis
Stable-V2A is a two-stage model for synthesizing synchronized sound effects with support for temporal and semantic controls.
๐ Excited to Share Our Latest Work! ๐ฅ๐ถ
Here we present Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls
arxiv: arxiv.org/abs/2412.15023
Video presentation and results: ispamm.github.io/Stable-V2A
20.12.2024 18:18 โ ๐ 5 ๐ 2 ๐ฌ 1 ๐ 2
AI & Music Data Scientist at @Music.AI | prev. @c4dm
Researcher in bioacoustics and AI ๐ฆ๐ค
Norwegian Institute for Nature Research (NINA)
https://www.nina.no/english/TABMON
PhD Student | Works on Explainable AI | https://donatellagenovese.github.io/
Mostly: ML for music production workflows.
Professor of Physics & Senior Data Fellow at Belmont University, Nashville TN
Head of Research for Hyperstate Music AI.
Teacher of audio engineers, Opinions my own.
Explainer blog: https://drscotthawley.github.io
Research scientist at Google DeepMind working on music โข DJ ๐ถ
https://ilariamanco.com/
Studying language in biological brains and artificial ones at the Kempner Institute at Harvard University.
www.tuckute.com
Guitarist, Researcher Google DeepMind. Opinions are my own.
Researcher in computer audition, machine learning, and HCI. Sr. Research Scientist, @AdobeResearch. Previously @DescriptApp, @Northwestern.
https://pseeth.github.io/
I created pyannote open source toolkit.
Co-founder and CSO at pyannoteAI
Scientist at CNRS.
https://audio.ls2n.fr
A science game to test your musical memory: https://tunetwins.app
Once was speech technologist - Water of Leith, Edinburgh - Born 320.23 ppm
KUโ็ฐ่พบๅโRโSOKENDAIโ้ไฟกไผ็คพN
Outlier detection / Anomaly detection / Kernel methods / Robust statistics / Statistical depth / Information geometry / Hopfield Networks
Auditory Signal Processing/Objective Metrics/Hearing Assistive Technologies. โจTwitter: @kyama0321โจWEB: https://sites.google.com/site/kyama0321/en
้ณๅฃฐใฎ็ ็ฉถใใใฆใใพใ