An Image Is Worth 16x16 Words:Transformers For Image Recognition At Scale

Paper Summary

Paper

Introduction

Method

Observations from the paper

Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • A Simple Framework for Contrastive Learning of Visual Representations
  • Big Self-Supervised Models are Strong Semi-Supervised Learners
  • Big Self-Supervised Models Advance Medical Image Classification
  • Masked Autoencoders Are Scalable Vision Learners
  • DEIT - Training data-efficient image transformers & distillation through attention