David Chuan-En Lin's Avatar

David Chuan-En Lin

@davidlin.io.bsky.social

PhD student @cmuhcii.bsky.social. Building human๐ŸคAI design tools.

21 Followers  |  42 Following  |  11 Posts  |  Joined: 20.11.2024  |  1.6455

Latest posts by davidlin.io on Bluesky

Preview
Visual transcripts: lecture notes from blackboard-style lecture videos: ACM Transactions on Graphics: Vol 34, No 6 Blackboard-style lecture videos are popular, but learning using existing video player interfaces can be challenging. Viewers cannot consume the lecture material at their own pace, and the content is also difficult to search or skim. For these reasons, ...

โ€ข Rubyslippers dl.acm.org/doi/10.1145...
โ€ข QuickCut dl.acm.org/doi/10.1145...
โ€ข Visual transcripts dl.acm.org/doi/abs/10....

23.02.2025 05:51 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
MixT | Proceedings of the 25th annual ACM symposium on User interface software and technology

A few related HCI works I am inspired by
โ€ข Video digests dl.acm.org/doi/10.1145...
โ€ข MixT dl.acm.org/doi/10.1145...

23.02.2025 05:51 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

So, I built this quick tool to "unroll" a video (which is temporal and sequential in nature) into a "flattened" article. The article is segmented into steps and has demonstrative video clips that align with the steps.

23.02.2025 05:51 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I don't want to just read a recipe article because I want to see how specific techniques are performed. For example, kneading pizza dough.

23.02.2025 05:51 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I've been learning new food recipes.

However, it's challenging to watch a cooking video (play/pause/go back/jump forward) while actively cooking at the same time.

23.02.2025 05:51 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

Unroll a video.

instructional video โ†’ step-by-step animated guide

23.02.2025 05:51 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

Interpolate abstract concepts using analogies.

peaceful โ†’ dove
aggressive โ†’ falcon

08.02.2025 14:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

Multimodal interpolation with text โ†” image โ†” audio.

01.02.2025 14:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

Interpolate concepts in latent space

24.01.2025 22:35 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

Transforming between modalities could be interesting.

text โ†” image โ†” video

text โ†’ image: image generation
image โ†’ video: video generation
video โ†’ image: highlight detection
image โ†’ text: image captioning

07.12.2024 16:53 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

๐Ÿค Semantic pinching

What if you can pinch your screen to transform an article ๐Ÿ“ into an emoji ๐Ÿ’ช and reverse!

Here is a simple prototype that uses LLM + gestures to transform text between different levels of abstractions:
emoji โ†” word โ†” sentence โ†” paragraph โ†” article

๐ŸŒ semanticpinching.vercel.app

07.12.2024 16:53 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@davidlin.io is following 20 prominent accounts