Far too often, even experienced data scientists get confused about what feature engineering really means. They may mistake […]
In this previous article we described how to construct basic tools such as the “confusion matrix” and Lift/Gain charts to evaluate […]
The process of developing classification models for business use cases follows a sequence that is represented in this diagram […]
In a previous article we discussed the application of principal component analysis (PCA) using RapidMiner to reduce the dimension of […]
We have written about the importance of calculating customer lifetime value (CLV) as a means to quantify the benefit from […]
Principal component analysis (PCA) is a technique according to Wikipedia that “uses an orthogonal transformation to convert a set of […]
In today’s data overloaded world, there are 5 types of data which require the use of big data […]
Inspired by the really cool video series on text mining by Vancouver Data Blog, we are going to kick […]
Understanding the needs of your customers is a critical aspect of business. This requires proper customer segmentation. There are many […]
In this post on data science for all, let us explore the interesting idea of predicting machine failure […]
simafore.ai