10.1 Vision Transformer (ViT)
10.2 Pretrained Transformer Models
10.3 Scaling of Decoder Transformer Models