Speeding up vision transformer prediction by 9 times with PyTorch, ONNX and TensorRT
How to use 16bit float, TensorRT, network rewriting and multi-threading to dramatically speed up deep learning model prediction
Published in
11 min readJun 4, 2023
Vision transformer such as UNET, SwinUNETR are state-of-the-art in computer vision tasks, such as semantic…