Jack (in SF) Langerman's Avatar

Jack (in SF) Langerman

@jacklangerman.bsky.social

63 Followers  |  52 Following  |  55 Posts  |  Joined: 26.09.2023  |  2.3507

Latest posts by jacklangerman.bsky.social on Bluesky

Very proud that our paper has been accepted to
@iccv.bsky.social !!!

See you in Hawaii!
bsky.app/profile/duch...

25.06.2025 20:52 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸŽ‰ Thrilled to share our CVPR 2025 Award Candidate & Oral paper:

πŸ”Ή GlobustVP
Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World

🧱 Global optimality
πŸ’₯ Tolerates up to 70% outliers
⚑ Fast runtime

πŸ“„ Paper: arxiv.org/abs/2505.04788

πŸ’» Code: github.com/WU-CVGL/GlobustVP

1/

09.05.2025 21:15 β€” πŸ‘ 29    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0

#ICCV2025 reviews are out and being sent via email to authors! They will also be available on OpenReview later. 11,152 active submissions all have at least 3 reviews. Authors have the opportunity to submit a rebuttal by May 16 2025 11:59 PM HST.

09.05.2025 20:49 β€” πŸ‘ 20    πŸ” 8    πŸ’¬ 1    πŸ“Œ 0

Please reshare, tell you friends, give it a try, and don't hesitate to reach out to the team with any questions!

bsky.app/profile/jack...

06.05.2025 23:29 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
2nd Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics CVPR 2025 Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics.

🌍 Workshop + other challenges at:
usm3d.github.io
Let’s make SfM more structured.
Good luck, teams!

(7/7)

06.05.2025 23:28 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Preview
usm3d/handcrafted_submission_2025 Β· Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

πŸ“Œ Want a clear reference?
Check out the example baseline submission:
huggingface.co/usm3d/handcr...

(6/7)

06.05.2025 23:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Explaining Human Preferences via Metrics for Structured 3D Reconstruction "What cannot be measured cannot be improved" while likely never uttered by Lord Kelvin, summarizes effectively the purpose of this work. This paper presents a detailed evaluation of automated metrics ...

πŸ“Š NEW Evaluation Metric
Say goodbye to WED.
Hello to a human-aligned combo of:
β€’ Vertex F1
β€’ Edge IoU
Backed by this paper:

πŸ‘‰ arxiv.org/abs/2503.08208

06.05.2025 23:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - s23dr/hoho2025: Tools and utilities for the S23DR-2025 competition and HoHo25k Dataset Tools and utilities for the S23DR-2025 competition and HoHo25k Dataset - s23dr/hoho2025

βš™οΈ All helper functions + data loaders:
pip install hoho2025
πŸ› οΈ Codebase: github.com/s23dr/hoho2025

(4/7)

06.05.2025 23:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
usm3d/hoho25k Β· Datasets at Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

πŸ“¦ New dataset:
hoho25k β€” 5x bigger than 2024!
β€’ 25,000 scenes
β€’ 8 images per scene
β€’ 200,000 total images
πŸ“₯ Get it here: huggingface.co/datasets/usm...

(3/7)

06.05.2025 23:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
S23DR2025 - a Hugging Face Space by usm3d This application allows you to view competition details, manage your submissions, and check leaderboards. You can access rules, dataset information, and view your own submissions and scores.

🧠 The task:
Turn multiview inputs (semantic segs, monocular depth) + SfM outputs (point clouds + camera poses) into sparse, geometrically accurate wireframes.
AKA: β€œMore Structured Structure-from-Motion” 😎

Challenge page πŸ‘‰ huggingface.co/spaces/usm3d...

(2/7)

06.05.2025 23:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

🚨 Just one month left to submit your solutions for The Structured Semantic 3D Reconstruction (S23DR-2025) Challenge!!! It is not too late to join!

Comp on @hf.co, part of the Workshop on Urban Scene Modeling at @cvprconference.bsky.social 2025

πŸ”₯$25,000 prize pool. Deadline: June 5, 2025.

🧡 (1/7)

06.05.2025 23:28 β€” πŸ‘ 5    πŸ” 4    πŸ’¬ 1    πŸ“Œ 1
LoRA+: Efficient Low Rank Adaptation of Large Models

Soufiane Hayou, Nikhil Ghosh, Bin Yu
In this paper, we show that Low Rank Adaptation (LoRA) as originally introduced in Hu et al. (2021) leads to suboptimal finetuning of models with large width (embedding dimension). This is due to the fact that adapter matrices A and B in LoRA are updated with the same learning rate. Using scaling arguments for large width networks, we demonstrate that using the same learning rate for A and B does not allow efficient feature learning. We then show that this suboptimality of LoRA can be corrected simply by setting different learning rates for the LoRA adapter matrices A and B with a well-chosen ratio. We call this proposed algorithm LoRA+. In our extensive experiments, LoRA+ improves performance (1-2 % improvements) and finetuning speed (up to ∼ 2X SpeedUp), at the same computational cost as LoRA.

LoRA+: Efficient Low Rank Adaptation of Large Models Soufiane Hayou, Nikhil Ghosh, Bin Yu In this paper, we show that Low Rank Adaptation (LoRA) as originally introduced in Hu et al. (2021) leads to suboptimal finetuning of models with large width (embedding dimension). This is due to the fact that adapter matrices A and B in LoRA are updated with the same learning rate. Using scaling arguments for large width networks, we demonstrate that using the same learning rate for A and B does not allow efficient feature learning. We then show that this suboptimality of LoRA can be corrected simply by setting different learning rates for the LoRA adapter matrices A and B with a well-chosen ratio. We call this proposed algorithm LoRA+. In our extensive experiments, LoRA+ improves performance (1-2 % improvements) and finetuning speed (up to ∼ 2X SpeedUp), at the same computational cost as LoRA.

hey @cloneofsimo.bsky.social (or anyone else) did you ever try LoRA+ style LR split between down/up LoRA projection matrices on diffusion models?

29.04.2025 23:46 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Logo for MIB: A Mechanistic Interpretability Benchmark

Logo for MIB: A Mechanistic Interpretability Benchmark

Lots of progress in mech interp (MI) lately! But how can we measure when new mech interp methods yield real improvements over prior work?

We propose 😎 π— π—œπ—•: a 𝗠echanistic π—œnterpretability 𝗕enchmark!

23.04.2025 18:15 β€” πŸ‘ 49    πŸ” 15    πŸ’¬ 1    πŸ“Œ 6

When I was working at Bell Labs in Murray Hill we used to specifically go to 3rd floor for coffee because the couch in that particular coffee room was (supposedly) Dennis Ritchie's.

was also a room with a whiteboard that may have been where Shannon did a lot of the info theory work.

good times

27.03.2025 15:37 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

do you think this is a matter of semantic disipline or genuine beliefs about qualia?

ie "with certain inputs models can produce outputs that contain patterns typically associated with anxiety in humans. asking them to perform mindfulness exercises can mitigate this behavior" vs "they feel anxious"?

18.03.2025 20:40 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

1 week to USM3D deadline

17.03.2025 17:03 β€” πŸ‘ 6    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

more blending between rings i think (~long range dependence)

15.03.2025 22:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image Post image Post image

Explaining Human Preferences via Metrics for Structured 3D Reconstruction

@jacklangerman.bsky.social Denys Rozumnyi, Yuzhong Huang, @ducha-aiki.bsky.social

tl;dr: we asked 3D modelers to rank wireframe reconstructions & compared it to ranking by metrics. Observations🧡
1/
arxiv.org/abs/2503.08208

12.03.2025 08:28 β€” πŸ‘ 23    πŸ” 8    πŸ’¬ 1    πŸ“Œ 4
Post image Post image

2nd Building3D #CVPR2025 challenge at #USM3D workshop is open!
Task: point cloud to wireframe.
Prize pool: $10k
Competition deadline: May 25 2025.
Website: huggingface.co/spaces/Build...
@cvprconference.bsky.social

10.03.2025 07:39 β€” πŸ‘ 12    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Post image

We made a new keypoint detector named DaD, paper isn't up yet, but code and weights are:
github.com/Parskatt/dad

10.03.2025 07:53 β€” πŸ‘ 44    πŸ” 8    πŸ’¬ 7    πŸ“Œ 0
Post image

Those, who work in structured (images/pcl to CAD) reconstruction - USM3D #CVPR2025 workshop submissions are open.
Deadline: March 24 2025
Both full papers (8 pages) and extended abstracts (4 pages) are OK

usm3d.github.io
#USM3D
@cvprconference.bsky.social @jacklangerman.bsky.social

10.03.2025 07:32 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 0    πŸ“Œ 1
Post image

We have extended the deadline for paper submission for Image Matching Workshop. Anything image matching or 3D reconstruction related is welcomed
Now it is March 17
@cvprconference.bsky.social
#CVPR2025
image-matching-workshop.github.io

09.03.2025 17:08 β€” πŸ‘ 17    πŸ” 8    πŸ’¬ 1    πŸ“Œ 0
Preview
MPS operator coverage tracking issue (2.6+ version) Β· Issue #141287 Β· pytorch/pytorch πŸ› Describe the bug This issue is to have a centralized place to list and track work on adding support to new ops for the MPS backend. PyTorch MPS Ops Project : Project to track all the ops for MPS ...

github.com/pytorch/pyto...

06.03.2025 23:56 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

yeah `PYTORCH_ENABLE_MPS_FALLBACK=1 python myscript.py` did it.

06.03.2025 23:53 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

bsky.app/profile/jack...

06.03.2025 23:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
:_( 
operation not implemented on MPS
woe is me...

:_( operation not implemented on MPS woe is me...

oh no....

06.03.2025 23:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

what is the best intro I can send people who suddenly want to know "what is AI?" and "what is a transformer based model?" but arent really going to invest tons of time and maybe dont have any math background?

my default is 3b1b, but does anyone have another suggestion?

31.01.2025 03:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
ChatGPT and Image Matching – Wide baseline stereo meets deep learning Are we done yet?

Image matching and ChatGPT - new post in the wide baseline stereo blog.

tl;dr: it is good, even feels like human, but not perfect.
ducha-aiki.github.io/wide-baselin...

02.01.2025 21:01 β€” πŸ‘ 34    πŸ” 8    πŸ’¬ 2    πŸ“Œ 1

ive said for a long time this is one of the things I love most about ML: get to play in everyones' playgrounds :-)

29.12.2024 22:44 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

o u know - jc

27.12.2024 16:52 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@jacklangerman is following 20 prominent accounts