Introduction to Data Quality

How many times have you heard managers and colleagues complain about the quality of the data in a particular report, system or database? People often describe poor quality data as unreliable or not trustworthy. Defining exactly what high or low quality data is, why it is a certain quality level and how to manage and improve it is often a trickier task.

Datification 2016

Big Data, to be effective, must recognize the following voices (in order).  

  1. VOC=Voice of the Customer
  2. VOB=Voice of the Business
  3. VOP=Voice of the Process

Datification is the link between the three voices. As well as capturing and  displaying  relevant metrics from all  your improvement projects. 

DATIFICATION!! What's the big deal? And why your business needs it?

 What is Datification?

A Checklist for Variable and Feature Selection

  1. Do you have domain knowledge? If yes, construct a better set of “ad hoc” features.
  2. Are your features commensurate? If no, consider normalizing them.
  3. Do you suspect interdependence of features? If yes, expand your feature set by constructing conjunctive features or products of features, as much as your computer resources allow you (see example of use in Section 4.4).