ADIFF: Explaining audio difference using natural language
Soham Deshmukh, Shuo Han, Rita Singh, Bhiksha Raj
Two new datasets were created; a prefix-tuning baseline and ADIFF, which uses a cross-projection module and position captioning, were compared; ADIFF showed significant improvements via objective and human evaluation.
10.02.2025 07:07 β π 2 π 1 π¬ 0 π 0
Great opportunity to work with amazing set of people!
09.12.2024 21:41 β π 3 π 0 π¬ 0 π 0
Hi @jonathanleroux.bsky.social, could you please add me to the list as well? Thank you in advance!
09.12.2024 02:43 β π 0 π 0 π¬ 0 π 0
universal musical approximator. research scientist at gorgle derpmind, magenta team. https://ethman.github.io
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
WASPAA 2025 will be held Oct. 12-15, 2025 at Granlibakken Tahoe, Tahoe City, CA, USA.
Abstract deadline: April 23, 2025 (23:59 AOE)
Paper deadline: April 30, 2025 (23:59
https://kyutai.org/ Open-Science AI Research Lab based in Paris
Principal Scientist at Naver Labs Europe && Professor at University Grenoble Alpes
#NLP #AI #LLMs
Associate professor at CMU, studying natural language processing and machine learning. Co-founder All Hands AI
The Thirty-Eighth Annual Conference on Neural Information Processing Systems will be held in Vancouver Convention Center, on Tuesday, Dec 10 through Sunday, Dec 15.
https://neurips.cc/
SeΓ±or swesearcher @ Google DeepMind, adjunct prof at UniversitΓ© de MontrΓ©al and Mila. Musician. From πͺπ¨ living in π¨π¦.
https://psc-g.github.io/
AI x storytelling
AI Engineering: https://amazon.com/dp/1098166302
Designing ML Systems: http://amazon.com/dp/1098107969
@chipro
https://Answer.AI & https://fast.ai founding CEO; previous: hon professor @ UQ; leader of masks4all; founding CEO Enlitic; founding president Kaggle; various other stuffβ¦
Prof (CS @Stanford), Co-Director @StanfordHAI, Cofounder/CEO @theworldlabs, CoFounder @ai4allorg #AI #computervision #robotics #AI-healthcare
Co-CEO, Yutori. Join the waitlist at yutori.com
Researcher (OpenAI. Ex: DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian.
Anon feedback: https://admonymous.co/giffmana
π ZΓΌrich, Suisse π http://lucasb.eyer.be
AI @ OpenAI, Tesla, Stanford
Research scientist at Anthropic. Prev. Google Brain/DeepMind, founding team OpenAI. Computer scientist; inventor of the VAE, Adam optimizer, and other methods. ML PhD. Website: dpkingma.com
PhD-ing at UMD. Knows a little about multimodal generative models. Check out my website to know more - https://somepago.github.io/
Associate Professor in EECS at MIT. Neural nets, generative models, representation learning, computer vision, robotics, cog sci, AI.
https://web.mit.edu/phillipi/