β°The ICCV 2025 discussion phase will close soon! β°
As a reviewer, it's your job to:
-π Read author rebuttals
-π£οΈ Engage in discussions
-β
Submit your final rating & justification
Your responsible engagement is critical for a fair review process!
Deadline: May 27!
26.05.2025 15:12 β π 4 π 2 π¬ 0 π 0
My solidarity to Harvard colleagues and my respect for maintaining dignity during troubled times. Their leadership makes their standing in the global stage well deserved.
23.05.2025 22:50 β π 19 π 2 π¬ 0 π 0
An owl which is the mascot of Rice is formed on the foam of a cappuccino
Cappuccino served at the 50th year celebration of the School of Engineering and Computing at Rice.
09.04.2025 14:30 β π 18 π 0 π¬ 0 π 0
Group picture with my PhD students #studio-ghibli
28.03.2025 02:59 β π 4 π 0 π¬ 0 π 0
Entrepreneurs 4 NSF - Formstack
Adding here an example of what I was going for: entrepreneurs4nsf.formstack.com/forms/petiti...
26.03.2025 01:44 β π 4 π 1 π¬ 0 π 1
Entrepreneurs 4 NSF - Formstack
Adding here an example of what I was going for: entrepreneurs4nsf.formstack.com/forms/petiti...
26.03.2025 01:44 β π 4 π 1 π¬ 0 π 1
I wish more authors of papers I have missed citing would email me. At the same time I have only done this sparely and mostly with people I already know first hand or had some kind of interaction in the past and in no case I believe this was deliberate. It is just hard to keep up sometimes.
24.03.2025 17:11 β π 7 π 0 π¬ 0 π 0
Thanks CVPR for bringing the conversation here
28.02.2025 03:37 β π 5 π 0 π¬ 0 π 0
We just have to believe
28.02.2025 03:37 β π 3 π 0 π¬ 1 π 0
I remember when Twitter was this small and familiar. I had/have some followers on Twitter that now are low key celebrities but probably followed me long way back before they became low key celebrities.
28.02.2025 03:37 β π 7 π 0 π¬ 1 π 0
By popular demand, we are extending #CVPR2025 coverage to Bluesky. Stay tuned!
27.02.2025 21:07 β π 124 π 17 π¬ 5 π 2
But if it does happen in the open, my hope is that itβs a concerted effort that includes academics as much as a larger coalition of people who agrees this issue is important.
14.02.2025 09:17 β π 2 π 0 π¬ 1 π 0
There should be pushback publicly but it should not be only self serving. If people only express concerns when it directly affects them thatβs a sad state of things.
14.02.2025 09:07 β π 0 π 0 π¬ 0 π 0
Indeed I donβt feel that way. I think there should be pushback but I also think the general public should be as concerned. Complaining and pushing back will happen but maybe not in the open.
14.02.2025 09:02 β π 1 π 0 π¬ 1 π 0
That said there's no reason to not continue using our institutional channels to continue championing science and education.
14.02.2025 04:50 β π 2 π 0 π¬ 1 π 0
One thing we can do going forward is to work so that the general community gets convinced why funding science and having strong research institutions is a good thing. This time we might have to learn from our mistakes.
14.02.2025 04:50 β π 2 π 1 π¬ 1 π 0
There are so many more things to be outraged about at the moment than complaining about federal funding for science. Especially when coming from academics, it is a bit too self-serving at this moment. Yes it is bad and the effects will be long lasting but so will be a dozen other things.
14.02.2025 04:50 β π 7 π 0 π¬ 1 π 0
Great times for innovation ahead!
29.01.2025 02:13 β π 3 π 0 π¬ 0 π 0
These are models that can perfectly be run on most cheap hardware unlike the full large R1. But I wouldnβt be surprised we will see R1 quality running on more accessible hardware.
29.01.2025 02:13 β π 4 π 0 π¬ 1 π 0
Everyone concentrates on o1 and R1 but even the base 7B or 1.5B models seem better than the very first public version of ChatGPT (3.5-turbo) that took the world by surprise.
29.01.2025 02:13 β π 3 π 0 π¬ 1 π 0
I think itβs exciting to see LLMs that are open source and on par with the top models accesible only through APIs. DeepSeek and before that Llama-3.
29.01.2025 02:13 β π 4 π 0 π¬ 1 π 0
Can pretrained diffusion models be connected for cross-modal generation?
π’ Introducing AV-Link βΎοΈ
Bridging unimodal diffusion models in one self-contained framework to enable:
π½οΈ β‘οΈ π Video-to-Audio generation.
π β‘οΈ π½οΈ Audio-to-Video generation.
π: snap-research.github.io/AVLink/
β€΅οΈ Results
14.01.2025 18:13 β π 7 π 3 π¬ 1 π 1
Check this recent work by my PhD student Moayed. He has been doing amazing work on Generative AI for images, video and audio. We introduce AV-Link βΎοΈ, an unified approach for audio-video generation. Our generated audio is the best in terms of synchronization with video actions. Check thread below.
14.01.2025 18:23 β π 6 π 1 π¬ 1 π 0
I still donβt feel quite more productive in the era of LLMs. There are very few things I can do better but far from what I hear from anecdotes. I wonder what would be the one low hanging fruit I should be delegating to LLMs.
14.01.2025 03:52 β π 2 π 1 π¬ 2 π 0
Happy New Year Everyone!
09.01.2025 19:33 β π 8 π 0 π¬ 0 π 0
I remember watching this video circa 2009-2010.
19.12.2024 14:22 β π 4 π 0 π¬ 0 π 0
End of year celebration with the great Moshe Vardi @myvardi.bsky.social is one of the perks of Rice. Hopefully he will be active on Bluesky soon.
14.12.2024 02:31 β π 9 π 1 π¬ 0 π 0
news.rice.edu/news/2024/ri...
09.12.2024 02:21 β π 1 π 0 π¬ 0 π 0
Job alert! π¨ (a bit special but anyway)
If you are (or know someone who is) a Phd about to graduate or just graduated, and have to skip some in between time, I currently have a PostDoc position that could run for 6 months.
Needed: Experience with publishing at Top A conferences.
Just ping me. π
07.12.2024 13:08 β π 23 π 7 π¬ 0 π 0
El aero y espaciotrastornado de @microsiervos.bsky.social y alguna que otra cosa mΓ‘s. TΓmido casi a niveles enfermizos. Intento ser riquiΓ±o casi todo el tiempo. No es por dar envidia βo sΓβ pero vΓ despegar el ΓΊltimo transbordador espacial de la NASA.
Principal Scientist (Director) at Google DeepMind in Japan. ζ³’η¬ε°βδΈεΏδΈβι΄ιΉΏι«ε°βεε·₯ε€§ (IBM T.J. Watson Research intern)βζ±θ欧ε·η η©ΆζβGoogle (Speechπ¬π§βBrainπ―π΅) βGoogle DeepMind. 3rd generation Korean in Japan.
Principal research scientist at Naver Labs Europe, I am interested in most aspects of computer vision, including 3D scene reconstruction and understanding, visual localization, image-text joint representation, embodied AI, ...
Teacher, Researcher, Distributed Systems, Cloud Computing, Big Data, Professor @ ESPOL, CS @ Illinois alumni, Fulbrighter
ELLIS PhD Fellow @belongielab.org | @aicentre.dk | University of Copenhagen | @amsterdamnlp.bsky.social | @ellis.eu
Multi-modal ML | Alignment | Culture | Evaluations & Safety| AI & Society
Web: https://www.srishti.dev/
Professor. Sociologist. NYTimes Opinion Columnist. Books: THICK, LowerEd. Forthcoming: 1)Black Mothering & Daughtering and 2)Mama Bears.
Beliefs: C.R.E.A.M. + the internet ruined everything good + bring back shame.
βIβm just here so I donβt get fined.β
Professor for AI at Hasso Plattner Institute and University of Potsdam
Berlin (prev. Rutgers NJ USA, Tsinghua Beijing, Berkeley)
http://gerard.demelo.org
ML Researcher, Qualcomm AI Research
Postdoc, University of British Columbia
Vector Institute
PhD, TU Darmstadt
Official account for the IEEE/CVF International Conference on Computer Vision. #ICCV2025 Honolulu πΊπΈ Co-hosted by @natanielruiz @antoninofurnari @yaelvinker @CSProfKGD
Professor, University of TΓΌbingen @unituebingen.bsky.social.
Head of Department of Computer Science π.
Faculty, TΓΌbingen AI Center π©πͺ @tuebingen-ai.bsky.social.
ELLIS Fellow, Founding Board Member πͺπΊ @ellis.eu.
CV π·, ML π§ , Self-Driving π, NLP πΊ
Research Scientist at DeepMind. Opinions my own. Inventor of GANs. Lead author of http://www.deeplearningbook.org . Founding chairman of www.publichealthactionnetwork.org
Phd @RiceUniversity | Research Intern @Snap
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
Journalist, currently at The New York Times. I cover privacy, technology, A.I., and the strange times we live in. Named after the Led Zeppelin song. Author of YOUR FACE BELONGS TO US. (Yes, in my head it will always be All Your Face Are Belong To Us)
Associate Professor & Co-Founder - Dynamical Deep Learning
professor at WashU; computer vision, remote sensing, etc.
Associate Professor in CS @ Georgia Tech
NLP/ML researcher
https://cocoxu.github.io/
Researcher in ML/NLP at the University of Edinburgh (faculty at Informatics and EdinburghNLP), Co-Founder/CTO at www.miniml.ai, ELLIS (@ELLIS.eu) Scholar, Generative AI Lab (GAIL, https://gail.ed.ac.uk/) Fellow -- www.neuralnoise.com, he/they