How to Fine-Tune a Pretrained Vision Transformer on Satellite Data

A step-by-step tutorial in PyTorch Lightning

Caroline Arnold
Towards Data Science
6 min readMar 21, 2024

--

Image created by the author using Midjourney.

The Vision Transformer is a powerful AI model for image classification. Released in 2020, it brought the efficient transformer architecture to computer vision.

In pretraining, an AI model ingests large amounts of data and learns common patterns. The…

--

--

AI Consultant, PhD in Physics. I write about artificial intelligence, data analysis, science, and diversity. https://medium.com/visual-data-science