Haiwen Huang @haiwen-huang - Bluesky Profile

Happy to find that I've been selected as an Outstanding Reviewer for CVPR 2025!

11.05.2025 12:44 — 👍 3 🔁 0 💬 0 📌 0

📢 New paper CVPR 25!
Can meshes capture fuzzy geometry? Volumetric Surfaces uses adaptive textured shells to model hair, fur without the splatting / volume overhead. It’s fast, looks great, and runs in real time even on budget phones.
🔗 autonomousvision.github.io/volsurfs/
📄 arxiv.org/pdf/2409.02482

05.05.2025 13:00 — 👍 28 🔁 20 💬 1 📌 1

⏰ Heads up! The deadline for two #CVPR2025 Autonomous Grand Challenge tracks is May 10th, 2025:

1️⃣ NAVSIM v2 Challenge: huggingface.co/spaces/AGC20...

2️⃣ World Model Challenge by 1X: huggingface.co/spaces/1x-te...

28.04.2025 09:41 — 👍 9 🔁 6 💬 1 📌 0

Introducing CaRL: Learning Scalable Planning Policies with Simple Rewards
We show how simple rewards enable scaling up PPO for planning.
CaRL outperforms all prior learning-based approaches on nuPlan Val14 and CARLA longest6 v2, using less inference compute.
arxiv.org/abs/2504.17838

28.04.2025 15:17 — 👍 25 🔁 14 💬 0 📌 1

Sometimes you choose aesthetics over aligned maximum at all axes 😂

27.04.2025 03:01 — 👍 0 🔁 0 💬 1 📌 0

Loft🆙 Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models. We achieve SotA upsampling results for DINOv2. Paper and code:
andrehuang.github.io/loftup-site/

26.04.2025 14:47 — 👍 28 🔁 3 💬 2 📌 0

Sharing another video showing how LoftUp significantly improves DINOv2 features! Works like a charm!

Try it out:
Code: github.com/andrehuang/l...
Paper: arxiv.org/abs/2504.14032

26.04.2025 07:52 — 👍 9 🔁 2 💬 0 📌 0

Excited to introduce LoftUp!

A strong (than ever) and lightweight feature upsampler for vision encoders that can boost performance on dense prediction tasks by 20%–100%!

Easy to plug into models like DINOv2, CLIP, SigLIP — simple design, big gains. Try it out!

github.com/andrehuang/l...

22.04.2025 07:55 — 👍 19 🔁 5 💬 0 📌 0

How much 3D do visual foundation models (VFMs) know?

Previous work requires 3D data for probing → expensive to collect!

#Feat2GS @cvprconference.bsky.social 2025 - our idea is to read out 3D Gaussains from VFMs features, thus probe 3D with novel view synthesis.

🔗Page: fanegg.github.io/Feat2GS

31.03.2025 16:06 — 👍 24 🔁 7 💬 1 📌 1

🦣Easi3R: 4D Reconstruction Without Training!

Limited 4D datasets? Take it easy.

#Easi3R adapts #DUSt3R for 4D reconstruction by disentangling and repurposing its attention maps → make 4D reconstruction easier than ever!

🔗Page: easi3r.github.io

01.04.2025 15:21 — 👍 22 🔁 3 💬 2 📌 4

🐎 Centaur, our first foray into test-time training for end-to-end driving. No retraining needed, just plug-and-play at deployment given a trained model. Also, theoretically nearly no overhead in latency with some clever use of buffers. Surprising how effective this is! arxiv.org/abs/2503.11650

17.03.2025 11:03 — 👍 12 🔁 7 💬 1 📌 1

🚀 Names matter! We show that better class names in open-vocabulary segmentation benchmarks greatly improve dataset quality and boost model performance. RENOVATE your dataset labels with our automatic framework! #AI #ComputerVision #NeurIPS24
andrehuang.github.io/renovate/

26.02.2025 14:45 — 👍 32 🔁 6 💬 1 📌 0

Synchronization is ubiquitous in nature and a key mechanism for information processing in the brain. We introduce AKOrN as a dynamical alternative to threshold units, which can be combined with MLPs, CNNs or Transformers. ICLR'25 Oral. Project page: takerum.github.io/akorn_projec...

12.02.2025 14:07 — 👍 47 🔁 11 💬 2 📌 2

This week we had our winter retreat jointly with Daniel Cremer's group in Montafon, Austria. 46 talks, 100 Km of slopes and night sledding with some occasionally lost and found. It has been fun!

16.01.2025 17:49 — 👍 72 🔁 11 💬 0 📌 1

Haiwen Huang

Latest posts by haiwen-huang.bsky.social on Bluesky

@haiwen-huang is following 20 prominent accounts