Yet another pre-Christmas release!! π
π
Here is πππππ₯π-πππ which generates sound effects from silent video frames showing semantic and temporal alignment.
πΆπ₯ποΈ
Huge thanks to @riccardofosco.bsky.social Christian Marinoni and all co-authors π€
23.12.2024 14:01 β
π 3
π 1
π¬ 0
π 0
Super interesting work on #GenAI #Video2Audio with impressive results from my friends @riccardofosco.bsky.social @Christian Marinoni together with @emilianpos.bsky.social @mcomunita.bsky.social Luca Cosmo, Joshua Reiss and @dacom.bsky.social !
π Go check it out!
20.12.2024 18:37 β
π 4
π 1
π¬ 0
π 0
A great work with Christian Marinoni, @emilianpos.bsky.social, @mcomunita.bsky.social, Luca Cosmo, Joshua D. Reiss and @dacom.bsky.social
20.12.2024 18:20 β
π 2
π 0
π¬ 0
π 0
This project explores how to generate realistic sound effects for a silent video. Our model combines:
πΉ Video-based RMS envelope prediction, and
πΉ Audio synthesis with Stable Audio and ControlNet, enabling high-quality sound design synchronized to the visual input.
20.12.2024 18:19 β
π 2
π 0
π¬ 1
π 0
Stable-V2A: Synchronized Sound Effects Synthesis
Stable-V2A is a two-stage model for synthesizing synchronized sound effects with support for temporal and semantic controls.
π Excited to Share Our Latest Work! π₯πΆ
Here we present Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls
arxiv: arxiv.org/abs/2412.15023
Video presentation and results: ispamm.github.io/Stable-V2A
20.12.2024 18:18 β
π 5
π 2
π¬ 1
π 2