Data analysis process

The data analysis process is composed of the following steps:

  1. The statement of problem
  2. Obtain your data
  3. Clean the data
  4. Normalize the data
  5. Transform the data
  6. Exploratory statistics
  7. Exploratory visualization
  8. Predictive modelling
  9. Validate your model
  10. Visualize and interpret your results
  11. Deploy your solution

All of above activities can be grouped as follows:
The Problem → Data Preparation → Data Exploration → Predictive modeling → Visualization of Results

The problem
The problem is defined as asking a high-level question, such as what’s going to be the gold price in the next month.

Data preparation
Data preparation is about how to obtain, clean, normalize, and transform the data which is suitable for modelling.

Data exploration
Data exploration is used to find patterns, connections, and relations in the data, by looking at the data in graphical and statistical form.

Predictive modelling
Predictive modelling is a process used in data analysis to create or choose a statistical model trying to best predict the probability of an outcome.

Visualization of results
How is it going to present the result.

Quantitative versus qualitative data analysis
Quantitative data: It is numerical measurements expressed in terms of numbers (Structured Data, Statistical analysis, Objective conclusions)
Qualitative data: It is categorical measurements expressed in terms of natural language descriptions (Unstructured data, Summary, Subjective conclusions)


Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s