Masked Autoencoders Are Scalable Vision Learners

Paper Summary

Paper

Introduction

Method

Evaluation

Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • An Image Is Worth 16x16 Words:Transformers For Image Recognition At Scale
  • A Simple Framework for Contrastive Learning of Visual Representations
  • Big Self-Supervised Models are Strong Semi-Supervised Learners
  • Big Self-Supervised Models Advance Medical Image Classification
  • DEIT - Training data-efficient image transformers & distillation through attention