
Time for some (extreme) distillation with Thomas van Dongen - founder of the Minish Lab
Word embeddings might feel like they are a little bit out of fashion. After all, we have attention mechanisms and transformer models now, right? Well, it turns out that if you apply distillation the right way you can act





















