Distribution Matching Distillation
Distribution Matching Distillation
Auto-generated stub. Edit this file to add more details.
A distillation technique used to transfer the generative capabilities from a larger, bidirectional teacher model to a smaller, causal student model by matching output distributions.
Related Papers
Metadata & Links
- created_at
- 2026-03-27T06:06:58Z