Jia-Bin Huang's Avatar

Jia-Bin Huang

@jbhuang0604.bsky.social

Associate Professor at UMD CS. YouTube: https://youtube.com/@jbhuang0604 Interested in how computers can learn and see.

2,620 Followers  |  32 Following  |  175 Posts  |  Joined: 18.11.2024  |  1.9786

Latest posts by jbhuang0604.bsky.social on Bluesky

Post image

In an era of billion-parameter models everywhere, it's incredibly refreshing to see how a fundamental question can be formulated and solved with simple, beautiful math.

- How should we orient a solar panel β˜€οΈπŸ”‹? -

Zero AI! If you enjoy math, you'll love this!

Video: www.youtube.com/watch?v=ZKzL...

16.07.2025 14:25 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

*Slides without slide titles*

When I first tried presenting WITHOUT slide titles, everything flowed so much better! (totally validated ... by me)!

Give it a shot! Once you try it, you’ll never want to go back.

08.07.2025 11:26 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

*Empty initial slides*

What’s a better starting point than that default slide layout?

A completely blank slide.

It helps you explore the design space and focus on delivering a clear, compelling story.

08.07.2025 11:26 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

*Bullet points*

The second thing the layout prompts you to do?
("Click to add text").

Start a bullet list.

Among so many creative forms of presenting your ideas, it nudges you toward the most boring one: a list. πŸ”’

08.07.2025 11:26 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

*Slide title*

The first thing this layout does is to ask you to add a slide title.

Seems reasonable, right? visuals, this encourages you to
1) lead your presentation with text instead of visuals and
2) cram in many titles in a talk, making it harder to maintain a narrative flow.

08.07.2025 11:26 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Why is the "Title and Content" slide layout BAD?

Most people prepare their presentation from this default layout. I used it for years without questioning it.

BUT, this essentially guides you toward developing poor presentation. Why? πŸ€”

08.07.2025 11:26 β€” πŸ‘ 20    πŸ” 2    πŸ’¬ 5    πŸ“Œ 0

Thanks! Yup, I hope to cover some fun computer vision applications. Stay tuned!

02.07.2025 07:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Kids’ summer camp just kicked off, and that means...
I finally have time to make new videos!

What topics are you most interested in right now?

01.07.2025 09:51 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Why More Researchers Should be Content Creators

Just trying something new! I recorded one of my recent talks, sharing what I learned from starting as a small content creator.

youtu.be/0W_7tJtGcMI

We all benefit when there are more content creators!

24.06.2025 21:58 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

Fresh out of the oven! 🍞 @jbhuang0604.bsky.social breaks down Mean Flow from Kaiming’s group in his latest video.

Video: youtu.be/swKdn-qT47Q?...

19.06.2025 22:24 β€” πŸ‘ 18    πŸ” 2    πŸ’¬ 0    πŸ“Œ 1
Policy Gradient in One Minute
YouTube video by Jia-Bin Huang Policy Gradient in One Minute

No time? I’ve got your back!

Check out Policy Gradient in One Minute!
youtu.be/p9k9YUdnNlk

Have fun!

20.06.2025 23:08 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Policy gradient methods rock!

These are the core techniques for making your transformer "chat" and "reason", a robot that manipulates objects, and a drone that maneuvers in a complex environment.

BUT, how do we learn all the developments in the past 30+ years?

20.06.2025 23:08 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
One Step, Big Leap: The Simple Idea Transforming Generative AI
YouTube video by Jia-Bin Huang One Step, Big Leap: The Simple Idea Transforming Generative AI

Check out the video to learn this new, elegant formulation of generative models!

youtu.be/swKdn-qT47Q

20.06.2025 16:09 β€” πŸ‘ 12    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Awesome! 🀩

So glad to hear the authors enjoyed the video, totally made my day!

20.06.2025 16:09 β€” πŸ‘ 13    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We had a blast at CVPR2025!

There was so much to learn! I am particularly excited to meet many new friends and reconnect with old ones.

I feel energized. Already looking forward to the next one!

17.06.2025 14:38 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thanks a lot!

04.06.2025 20:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Kullback–Leibler (KL) divergence is a cornerstone of machine learning.

We use it everywhere, from training classifiers and distilling knowledge from models, to learning generative models and aligning LLMs.

BUT, what does it mean, and how do we (actually) compute it?

Video: youtu.be/tXE23653JrU

04.06.2025 14:58 β€” πŸ‘ 30    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1

My X/Twitter account has been hacked... Please don't believe what they said!

Trying to get it back in the meantime. Sorry for the inconvenience!

03.06.2025 18:11 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

How LLMs Learn to Reason with Reinforcement Learning

Full video: www.youtube.com/watch?v=mg-i...

21.05.2025 18:32 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Ha! Yes, Seungjae insisted that we call this IVE.

21.05.2025 17:47 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

RL is so back!

Reinforcement learning is a key driver in aligning LLMs and enhancing their reasoning capabilities.

BUT, it’s a tricky topic to wrap your head around (at least for myself πŸ˜΅β€πŸ’«).

So, I put up a video breaking down the basics in a way that clicked for me. I hope it helps you, too!

21.05.2025 17:14 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I find TRPO's idea of learning from others' experiences fascinating.

So, I started running TRPO for my group, making all (previously individual) feedback on experiments, writing, rebuttals, and presentations public.

Now everyone gets to learn from each other’s trajectories!

19.05.2025 14:29 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Indeed!!

14.05.2025 17:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸ₯Ί

14.05.2025 13:41 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models IVE: Imagine, Verify, Execute: Agentic Exploration with Vision-Language Models

Brought to you by our amazing students Seungjae Lee, Daniel Ekpo, Haowen Liu, and faculty Furong Huang and Abhinav Shrivastava

Learn more at ive-robot.github.io

14.05.2025 13:33 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

IVE leverages VLMs to
β€’ extract semantic scene graphs,
β€’ imagine novel scenes,
β€’ predict their physical plausibility, and
β€’ generate executable sequences.

IVE is a memory-guided agentic exploration framework that operates fully automatically, enabling more diverse and meaningful exploration.

14.05.2025 13:33 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Exploration is key for robots to generalize, especially in open-ended environments with vague goals and sparse rewards.

BUT, how do we go beyond random poking? Wouldn't it be great to have a robot that explores an environment just like a kid?

Introducing Imagine, Verify, Execute (IVE)!

14.05.2025 13:33 β€” πŸ‘ 10    πŸ” 2    πŸ’¬ 2    πŸ“Œ 0

Yup, it’s so much fun!

26.04.2025 22:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Solving high-impact real-world problems with multimodal foundation models

26.04.2025 16:57 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

You are constantly leveling up your presentation!!

23.04.2025 12:39 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@jbhuang0604 is following 20 prominent accounts