An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment
13.12.2024 20:00 — 👍 6 🔁 1 💬 0 📌 0
If you want to learn more about audio-visual alignment and how to use it to give audio abilities to your VLM, stop by our @NeurIPSConf poster #3602 (East exhibit hall A-C) today at 11am!
13.12.2024 18:30 — 👍 2 🔁 0 💬 0 📌 1
Hi! Could you please add me?
28.11.2024 16:24 — 👍 2 🔁 0 💬 1 📌 0
Amazing! Would love to be added!
26.11.2024 14:09 — 👍 1 🔁 0 💬 0 📌 0
I’m a PhD student in University of Illinois Urbana-Champaign working on audio inverse problems.
My website: https://xzwy.github.io/alanweiyang.github.io/
PhD student @ Telecom Paris & Orosound
🗣️ Personalized speech enhancement
ELLIS PhD Fellow @belongielab.org | @aicentre.dk | University of Copenhagen | @amsterdamnlp.bsky.social | @ellis.eu
Multi-modal ML | Alignment | Culture | Evaluations & Safety| AI & Society
Web: https://www.srishti.dev/
PhD Student @ Telecom Paris
Doctoral Student at IIT Hyderabad, India
🎓 PHD student @ Télécom Paris - ADASP Team
👨💻 Building SSL foundation models for audio and music
Manchester Centre for AI FUNdamentals | UoM | Alumn UCL, DeepMind, U Alberta, PUCP | Deep Thinker | Posts/reposts might be non-deep | Carpe espresso ☕
Research Scientist @SonyAI
PhD from Seoul National University
Previous intern @MERL, @Sony, and @Supertone
Deep learning for audio signal processing and acoustics at Bang&Olufsen
francesclluis.com
Math Assoc. Prof. (On leave, Aix-Marseille, France)
Teaching Project (non-profit): https://highcolle.com/
universal musical approximator. research scientist at gorgle derpmind, magenta team. https://ethman.github.io
Current: MA Music Tech @ McGill, Input Devices and Music Interaction Lab. Harvard, CS + Music. Research interests: drum resynthesis, embedded audio, expressive controllers.
Research Director @ Inria, Grenoble
Mostly: ML for music production workflows.
Professor of Physics & Senior Data Fellow at Belmont University, Nashville TN
Head of Research for Hyperstate Music AI.
Teacher of audio engineers, Opinions my own.
Explainer blog: https://drscotthawley.github.io