FM (-OT) is not necessarily straight when you map to anything more than a single point (=more than delta peak).
Doing naive distillation with noise/image pairs will give straight paths (on any diffusion/FM model).
After training doing Reflow from rectified flow will also indeed straighten paths.
03.12.2024 21:35 β π 2 π 0 π¬ 0 π 0
Finally, something that is not obvious at all, and requires some digging / equation re-writing. SD3's (arxiv.org/abs/2403.03206) Flow matching weighting is very similar to EDM's weighting (arxiv.org/abs/2206.00364).
03.12.2024 13:26 β π 2 π 0 π¬ 0 π 0
Above FM does not actually look that straight, an often claimed feature of FM. What's going on?
- FM schedule (often omitted "OT") is straight to a single point (possibly with tiny noise).
- Unfortunately, that does not guarantee straightness between distributions.
03.12.2024 13:26 β π 2 π 0 π¬ 2 π 0
DDIM vs Flow Matching.
Note whatever schedule (alpha/sigma, VP, VE, FM) we pick, DDIM always ends up at the same spot:
- DDIM is invariant to alpha/sigma rescalings
- With FM schedule, sampling with either DDIM and Euler (=what FM uses) is the same.
03.12.2024 13:26 β π 1 π 0 π¬ 1 π 0
This is a really nice blogpost by
@RuiqiGao and team that I enjoyed being a part of. My favorite key learnings are:
- DDIM sampler == flow matching sampling
- (Not) straight?
- SD3 weighting (Esser, Rombach, et al) is very similar to the EDM weighting (Karras, et al).
π
03.12.2024 13:26 β π 10 π 2 π¬ 2 π 0