Explainable Artificial Intelligence applied on models trained for prediction of whether a customer will subscribe to a bank term deposit or not.
Bank Marketing dataset from Kaggle:
-
The data is related with direct marketing campaigns (phone calls) of a Portuguese banking institution. The classification goal is to predict if the client will subscribe a term deposit (variable y). The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was required, in order to access if the product (bank term deposit) would be ('yes') or not ('no') subscribed.
-
41188 data points, 20 user features (detailed description can be found on Kaggle)
-
very imbalanced data:
-
an extensive data exploration was conducted to get overall insight into the data
-
various different preprocessing as well as feature extraction techniques were tested (ones with the best performance are in the notebook)
Accuracy is not a good evaluation method for imbalanced datasets, which is why precision is used instead (under assumption that false positives would make worse impact than false negatives).
- Logistic Regression
- Decision Trees
- Random Forest
- Suport Vector Machine
- XGBoost
GridSearch was used to determine best parameters!
For analysis and interpretability of different models two xAI libaries were used: