ML Latency No More
Common Ways to Reduce ML Prediction Latency to Sub X ms
Published in
21 min readApr 25, 2022
Machine Learning (ML) systems don’t exist until they are deployed.
Unfortunately, prediction latency is one of those edges that hurt badly.
And, it hurts too late in the product cycle.