An Empirical Study of Training Self-Supervised Vision Transformers

Mentioned by I-JEPA paper.