RoBERTa: A Robustly Optimized BERT Pretraining Approach RoBERTA This is just training BERT on more images.