Easy Distribution-Free Conformal Intervals for Time Series

Using Python and your test set to derive distribution-agnostic intervals

Published in

Towards Data Science

7 min readFeb 15, 2023

As important as producing a point estimate for forecasting applications is determining how far off the actual value is likely to be from the prediction. Most forecasts are not 100% accurate so having a good sense of the possibilities when dealing with model implementation becomes crucial. For models with underlying functional forms, such as ARIMA, confidence intervals can be determined using the assumed distribution of the residuals and the standard errors of the estimation. These intervals are logical in that they expand the further out from the known last value a forecast goes — as uncertainty accumulates, this becomes represented in a mathematical way that gels with our intuitions. And if the model assumptions hold, a 95% confidence interval is guaranteed to contain 95% of the actual values.

Conformal Prediction

However, when dealing with a machine learning model that has no form that can be represented with a simple equation and assumes no distribution in the underlying data, creating a sound confidence interval becomes more of a challenge. A popular solution to this problem are conformal predictions. The GitHub repository, Awesome Conformal Prediction…

Great to see another open source forecasting library adding conformal prediction https://github.com/valeman/awesome-conformal-prediction

Scalecast is now listed on Awesome Conformal Prediction in Python section

Easy Distribution-Free Conformal Intervals for Time Series

Using Python and your test set to derive distribution-agnostic intervals

Conformal Prediction

Create an account to read the full story.

Published in Towards Data Science

Written by Michael Keith

Responses (1)

More from Michael Keith and Towards Data Science

Exploring the LSTM Neural Network Model for Time Series

Practical, straightforward implementation with the scalecast library

The Data Scientist’s Dilemma: Answering “What If?” Questions Without Experiments

A hands-on alternative to Google’s Causal Impact

Think Correlation Isn’t Causation? Meet Partial Correlation

Despite being so powerful, partial correlation is perhaps the most underrated tool in data science

Five Practical Applications of the LSTM Model for Time Series, with Code

How to implement an advanced neural network model in several different time series contexts

Recommended from Medium

N-BEATS — The First Interpretable Deep Learning Model That Worked for Time Series Forecasting

An easy-to-understand deep dive into how N-BEATS works and how you can use it.

Autoregressive Linear Models for Multi-period Probabilistic Forecasting

Linear regression models, not surprisingly, can do a point estimate for time series. We have built many autoregressive models in Automatic…

Lists

Predictive Modeling w/ Python

Practical Guides to Machine Learning

Coding & Development

Natural Language Processing

Why you should consider Polars for your time series

Although I believe no tool fits every situation there is a lot to say about Polars as an alternative to Numpy / Pandas when working with…

An Introduction to the Prophet Model: Time Series Forecasting Made Easy

Time series forecasting is an essential task in many industries, from finance to retail, where predicting future trends can guide critical…

Bayesian ARIMA for time series analysis in Python

Bayesian methods provide a probabilistic approach to time series analysis, offering a flexible and intuitive way to incorporate uncertainty…

Time Series Analysis: Interpretation of ACF and PACF Plots

Autocorrelation (ACF) and Partial Autocorrelation (PACF) plots are powerful tools for uncovering hidden patterns in time series data…