October 1, 2012

Data analysis - Introduction

Data analysis

Introduction of the subject

The data analysis would be complete with more ideas and aspects.
We start with a study of the data correlations between individuals and variables. After this step we will see a Principal Component Analysis for compress the variables of our data set.
The last study on this analysis should be a classification in following three methods.

Subject of the study

The data come from the Machine Learning Repository of the University of California Irvine.
Url is : http://archive.ics.uci.edu/ml/datasets/Communities+and+Crime

The data combines socio-economic data from the 1990 US Census, law enforcement data from the 1990 US LEMAS survey, and crime data from the 1995 FBI UCR.

There is no temporal aspect s on this data set. The time and the evolution of the crimes are not important. 

There is 1994 instances and 128 attributes.
Each instance is represented by a community and a state.

