601 views
Anton Konushin
Transformers in computer vision - ViT, Swin models. Using transformer developments in convolutional architectures - ConvNeXT model. Author's TG channel: https://t.me/ktoshiks