AdaLN
Used in the pi0 paper.
AdaLN, short for Adaptive Layer Normalization, is a Vision Transformer (ViT) architecture designed for multidomain learning, particularly in the context of extracting building information from satellite and street view images