โข Rubyslippers dl.acm.org/doi/10.1145...
โข QuickCut dl.acm.org/doi/10.1145...
โข Visual transcripts dl.acm.org/doi/abs/10....
@davidlin.io.bsky.social
PhD student @cmuhcii.bsky.social. Building human๐คAI design tools.
โข Rubyslippers dl.acm.org/doi/10.1145...
โข QuickCut dl.acm.org/doi/10.1145...
โข Visual transcripts dl.acm.org/doi/abs/10....
A few related HCI works I am inspired by
โข Video digests dl.acm.org/doi/10.1145...
โข MixT dl.acm.org/doi/10.1145...
So, I built this quick tool to "unroll" a video (which is temporal and sequential in nature) into a "flattened" article. The article is segmented into steps and has demonstrative video clips that align with the steps.
23.02.2025 05:51 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0I don't want to just read a recipe article because I want to see how specific techniques are performed. For example, kneading pizza dough.
23.02.2025 05:51 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0I've been learning new food recipes.
However, it's challenging to watch a cooking video (play/pause/go back/jump forward) while actively cooking at the same time.
Unroll a video.
instructional video โ step-by-step animated guide
Interpolate abstract concepts using analogies.
peaceful โ dove
aggressive โ falcon
Multimodal interpolation with text โ image โ audio.
01.02.2025 14:15 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Interpolate concepts in latent space
24.01.2025 22:35 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Transforming between modalities could be interesting.
text โ image โ video
text โ image: image generation
image โ video: video generation
video โ image: highlight detection
image โ text: image captioning
๐ค Semantic pinching
What if you can pinch your screen to transform an article ๐ into an emoji ๐ช and reverse!
Here is a simple prototype that uses LLM + gestures to transform text between different levels of abstractions:
emoji โ word โ sentence โ paragraph โ article
๐ semanticpinching.vercel.app