Skip Connection
Used in ResNet. Also saw this in the World Models paper.
How are skip connections implemented in code? See annotated transformer
Used in ResNet. Also saw this in the World Models paper.
How are skip connections implemented in code? See annotated transformer