October 2, 2012

Correlation Matrix - On variables

Correlation Matrix


On variables

The correlation matrix permit a clear corresponding view between variables. We can see lots of variables extremely correlated or not or neutral .The last column and last line correspond to the variable to determine (the violent crimes). For this value,we discover a large data set with different connections. the correlation force is light and give a complex model to research.

On line/column 60 we see a value uncorrelated with the value between 50 and 70. The value 60 corresponding to the PctSpeakEnglOnly. and values extremely correlated between 50 and 59 correspond to the immigration information. It appear clearly that immigrants don't talk only English.
Between 61 and 70 , the variables describe the housing status. I don't understand why housing status can be an important information to speak only English. I suppose an important correlation with the immigrants statistics that create this link.

Variables between 79 and 87 correspond to the housing business that is extremely correlated.

A grouping area between 12 and 58 whose corresponding to information relative to the social environment.This variable list appear similar in back analysis.The list is:


  • medIncome: median household income (numeric - decimal) 
  • pctWWage: percentage of households with wage or salary income in 1989 (numeric - decimal) 
  • pctWFarmSelf: percentage of households with farm or self employment income in 1989 (numeric - decimal) 
  • pctWInvInc: percentage of households with investment / rent income in 1989 (numeric - decimal) 
  • pctWSocSec: percentage of households with social security income in 1989 (numeric - decimal) 
  • pctWPubAsst: percentage of households with public assistance income in 1989 (numeric - decimal) 
  • pctWRetire: percentage of households with retirement income in 1989 (numeric - decimal) 
  • medFamInc: median family income (differs from household income for non-family households) (numeric - decimal) 
  • perCapInc: per capita income (numeric - decimal) 
  • whitePerCap: per capita income for caucasians (numeric - decimal) 
  • blackPerCap: per capita income for african americans (numeric - decimal) 
  • indianPerCap: per capita income for native americans (numeric - decimal) 
  • AsianPerCap: per capita income for people with asian heritage (numeric - decimal) 
  • HispPerCap: per capita income for people with hispanic heritage (numeric - decimal) 
  • NumUnderPov: number of people under the poverty level (numeric - decimal) 
  • PctPopUnderPov: percentage of people under the poverty level (numeric - decimal) 
  • PctLess9thGrade: percentage of people 25 and over with less than a 9th grade education (numeric - decimal) 
  • PctNotHSGrad: percentage of people 25 and over that are not high school graduates (numeric - decimal) 
  • PctBSorMore: percentage of people 25 and over with a bachelors degree or higher education (numeric - decimal) 
  • PctUnemployed: percentage of people 16 and over, in the labor force, and unemployed (numeric - decimal) 
  • PctEmploy: percentage of people 16 and over who are employed (numeric - decimal) 
  • PctEmplManu: percentage of people 16 and over who are employed in manufacturing (numeric - decimal) 
  • PctEmplProfServ: percentage of people 16 and over who are employed in professional services (numeric - decimal) 
  • PctOccupManu: percentage of people 16 and over who are employed in manufacturing (numeric - decimal) ######## 
  • PctOccupMgmtProf: percentage of people 16 and over who are employed in management or professional occupations (numeric - decimal) 
  • MalePctDivorce: percentage of males who are divorced (numeric - decimal) 
  • MalePctNevMarr: percentage of males who have never married (numeric - decimal) 
  • FemalePctDiv: percentage of females who are divorced (numeric - decimal) 
  • TotalPctDiv: percentage of population who are divorced (numeric - decimal) 
  • PersPerFam: mean number of people per family (numeric - decimal) 
  • PctFam2Par: percentage of families (with kids) that are headed by two parents (numeric - decimal) 
  • PctKids2Par: percentage of kids in family housing with two parents (numeric - decimal) 
  • PctYoungKids2Par: percent of kids 4 and under in two parent households (numeric - decimal) 
  • PctTeen2Par: percent of kids age 12-17 in two parent households (numeric - decimal) 
  • PctWorkMomYoungKids: percentage of moms of kids 6 and under in labor force (numeric - decimal) 
  • PctWorkMom: percentage of moms of kids under 18 in labor force (numeric - decimal) 

No comments:

Post a Comment