How Fast Is MLX? A Comprehensive Benchmark on 10 Apple Silicon Chips and 3 CUDA GPUs

A benchmark of the main operations and layers on MLX, PyTorch MPS and CUDA GPUs.

Tristan Bilot
Towards Data Science
6 min readFeb 2, 2024

--

Image by author: Example of benchmark on the softmax operation

In less than two months since its first release, Apple’s ML research team’s latest creation, MLX, has already made significant strides in the ML community. It is remarkable to see how…

--

--

PhD student in GNNs for cybersecurity. Basically writing about deep learning, programming and performance 📝.