RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTA

This is just training BERT on more images.