SHAP for Feature Selection and HyperParameter Tuning

Use SHAP for optimal Feature Selection while Tuning Parameters

Published in

Towards Data Science

5 min readJun 8, 2021

Feature selection and hyperparameter tuning are two important steps in every machine learning task. Most of the time they help to improve the performances but with a drawback to be time expensive. The more the parameter combinations, or the more accurate the selection process, the higher the duration. This is a physical limit that actually we can’t defeat. What we can do is leverage the best from our pipeline. We face different possibilities, two of the most convenient are:

Combine the tuning and the selection of features;
adopt SHAP (SHapley Additive exPlanations) to make the whole procedure more generalizable and accurate.

Combining the tuning process with the optimal choice of features may be a need of every ranking-based selection algorithm. A ranking selection consists of iteratively dropping the less important features while retraining the model until convergence is reached. The model used for feature selection may differ (in parameter configuration or in the type) from the one used for final fitting and prediction. This may result in suboptimal performances. This is the case for example of RFE (Recursive Feature Elimination) or Boruta, where the features, selected…

SHAP for Feature Selection and HyperParameter Tuning

Use SHAP for optimal Feature Selection while Tuning Parameters

Create an account to read the full story.

Published in Towards Data Science

Written by Marco Cerliani

More from Marco Cerliani and Towards Data Science

Time2Vec for Time Series features encoding

Learn a valuable representation of time for your Machine Learning Model

How to Run Jupyter Notebooks and Generate HTML Reports with Python Scripts

A step-by-step guide to automating Jupyter Notebook execution and report generation using Python

Are Meta’s AI Profiles Unethical?

As AI becomes further enmeshed into every product we use, what rules should exist to protect humans?

Proxy SHAP: Speed Up Explainability with Simpler Models

A Practical Guide to Efficient SHAP Computation

Recommended from Medium

SHAP for Binary and Multiclass Target Variables

A guide to the code and interpreting SHAP plots when your model predicts a categorical target variable

Understanding Feature Importance in XGBoost using SHAP Values: The Math Behind the Magic

Introduction:

Lists

Predictive Modeling w/ Python

Practical Guides to Machine Learning

Natural Language Processing

The New Chatbots: ChatGPT, Bard, and Beyond

Selecting Independent Features: Balancing Feature Selection, SHAP Values, and Domain Knowledge for…

In the world of machine learning, selecting the right independent features is critical to building a model that delivers high accuracy. By…

Exploring Explainable AI: A Deep Dive into SHAP

My articles are free for everyone to read! If you don’t have a Medium subscription, feel free to explore the full article directly on my…

Understanding Statistical Distributions

Python implementation and visualization

Machine Learning Optimization with Optuna

How to fine-tune every machine learning algorithm in Python: The ultimate guide to machine learning optimization with Optuna