Vision Transformer (ViT) from Scratch
PyTorchPatch embeddings, multi-head self-attention, and training on CIFAR-10 with strong augmentation.
transformers
cifar10
timm-like
View code →