1. Introduction Let’s imagine that we have a business like a toy store (why not? 😁) with online shop. And that we want to know about our users of the online store to improve the following marketing actions and the new season products. How do we get this information? Google analytics (from now on, referred to as GA) is widely used in many places to track user activities on the web pages and mobile apps.
SHAP is one of the most well-known and commonly used model explainability libraries. In this series of posts, we’ll explain its theoretical basis, some fundamentals about its implementation and we’ll dig deep into some advanced usage points. Introduction In our daily life we are used to creating models for every challenge we face. In many cases, these models are complex and we cannot easily analyze their behavior. This is the case, for example, of neural networks or, more generally, black box models.
1 2 3 4 5 6 7 8 9 10 11 12 13 %matplotlib notebook import os import pandas as pd import numpy as np import seaborn as sns import matplotlib as mlp import matplotlib.pyplot as plt from sklearn.preprocessing import MinMaxScaler from sklearn.cluster import KMeans, DBSCAN, SpectralClustering from sklearn.decomposition import PCA from sklearn.neighbors import NearestNeighbors pd.set_option('display.max_columns', None) In this post we will analyze some clustering models, and the importance of understanding and interpreting the model aiming for the best possible performance.
In this post we will explain what time series are, and why their analysis and prediction is a particular case of Machine Learning problems. Afterwards, we will use FB Prophet to model a particular case. For that, we will use data from a shipping company with different delivery stations, where clients leave packages (letters, boxes, etc.) for their shipment. The series represents the total number of packages received each day by the whole network of stations.
In this article, we will see why data analysis is such an impactful part of any data project. Starting from a public Telco Churn dataset, we will go through the main steps to perform an insightful data analysis for every step of the project: Assessing project feasibility Assessing model performance Monitoring production models Introduction to our Customer Churn project Customer churn (also known as attrition) is defined as the amount of customers who stopped using a service in a given timeframe.