Yet another pre-Christmas release!! ๐
๐
Here is ๐๐ญ๐๐๐ฅ๐-๐๐๐ which generates sound effects from silent video frames showing semantic and temporal alignment.
๐ถ๐ฅ๐๏ธ
Huge thanks to @riccardofosco.bsky.social Christian Marinoni and all co-authors ๐ค
23.12.2024 14:01 โ ๐ 3 ๐ 1 ๐ฌ 0 ๐ 0
We released ๐๐๐๐, a method for learning and aligning any ๐ฏ modalities simultaneously! ๐ฌ๐ค๐ผ๐ฝ๏ธ
GRAM may pave the way for several ML tasks in different fields of application.
Project page๐https://ispamm.github.io/GRAM/
Immensely proud of Giordano Cicchetti @eleonoragrassucci.bsky.social Luigi Sigillo
19.12.2024 12:21 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
The link between diffusion models and optimal transport is still a bit of an enigma to me.
One thing that's clear: different diffusion models trained on similar datasets tend to recover similar mappings. If these are generally not OT, in what sense are they optimal instead?
30.11.2024 12:56 โ ๐ 115 ๐ 11 ๐ฌ 3 ๐ 1
๐๐๐๐ก๐ก ๐ฎ๐ฌ๐ฎ๐ฑ in Rome, June 30 to July 5, 2025 ๐ง
๐https://2025.ijcnn.org ๐
We are very glad to invite you all to contribute to the International Joint Conference on Neural Networks #IJCNN2025
Looking forward to welcoming you to IJCNN 2025 in Rome!
#INNS #NeuralNetworks #AI #MachineLearning
#AIConf
18.11.2024 19:29 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 1
Postdoc @Harvard | Topological Signal Processing โ Deep Learning โ AI for Health and Climate โ Stochastic Optimization | Ex Visiting Associate @PennEngineers
๐ cbattiloro.com
PhD in ICT @SapienzaRoma | Generative Deep Learning | http://luigisigillo.github.io
MSCA PhD student at AMLab with Erik Bekkers, interested in geometric deep learning, generative models and their intersection ๐ โจ
bit.ly/olga-zaghen
I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.
Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine.
https://serrjoa.github.io/
Sound effects, audio & video | PhD at @ISPAMM, @Sapienza | Former @C4DM, @QMUL
PhD in ML/AI | Researching Efficient ML/AI (vision & language) ๐ & Interpretability | @SapienzaRoma @EdinburghNLP | https://alessiodevoto.github.io/ | ex @NVIDIA
Research scientist at Google DeepMind working on music โข DJ ๐ถ
https://ilariamanco.com/
Machine learning researcher @Stanford. https://petersen.ai/
Assistant Professor @Sapienza, Rome.
Generative AI, Multimodal Learning, Generative Semantic Communication
Professor a NYU; Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
http://yann.lecun.com
AI for Music โข Research Scientist @ Suno
Music and artificial intelligence.
Researcher at Stability AI.
Musician at BRNRT Collective.
Previously at Dolby and Universitat Pompeu Fabra.
artintech.substack.com
www.jordipons.me
Research scientist at Anthropic. Prev. Google Brain/DeepMind, founding team OpenAI. Computer scientist; inventor of the VAE, Adam optimizer, and other methods. ML PhD. Website: dpkingma.com
CEO of Fairly Trained / Composer. Working towards fairer training data practices in generative AI.
International Conference on Learning Representations https://iclr.cc/
Group Leader, Generative AI | NeurIPS 2024 Program Chair | Principal Scientist & Director | Founder of Amsterdam AI Solutions
Speech and audio research scientist @MERL. saneworkshop.org co-founder. IguanaTex developer.
๐ jonathanleroux.org
๐ github.com/Jonathan-LeRoux/
๐ scholar.google.com/citations?user=aUpxty8AAAAJ&hl=en