Example: biology

A Transformer

Found 1 free book(s)
Training data-efficient image transformers & distillation ...

Training data-efficient image transformers & distillation ...

arxiv.org

transformer through attention. This transformer-specific strategy outper-forms vanilla distillation by a significant margin. •Interestingly, with our distillation, image transformers learn more from a convnet than from another transformer with comparable performance. •Our models pre-learned on Imagenet are competitive when transferred to

  Transformers

Similar queries