Example: biology
A Transformer
Found 1 free book(s)Training data-efficient image transformers & distillation ...
arxiv.orgtransformer through attention. This transformer-specific strategy outper-forms vanilla distillation by a significant margin. •Interestingly, with our distillation, image transformers learn more from a convnet than from another transformer with comparable performance. •Our models pre-learned on Imagenet are competitive when transferred to