AdaLN

Used in the pi0 paper.

AdaLN, short for Adaptive Layer Normalization, is a Vision Transformer (ViT) architecture designed for multidomain learning, particularly in the context of extracting building information from satellite and street view images