Due in class, Tuesday Sep 30.
In class this week, we have continued analysing data from the General Social Survey (GSS). The data is available from General Social Survey 1972-2012, with detailed descriptions of each variable in the codebook.
For this week's homework, make use of GSS's download website to download variables by name.
Pick five variables from the survey: YEAR, SEX, REGION, CONINC and a variable of your choice (that is not included in the file we used in class), and download a csv data file together with a codebook.
- Read the csv file into R. You will notice that all variables are coded in form of integer variables. Write an R script to get all textual information into the dataset (base this on the codebook). Clean the data at the same time, i.e. assign missing values whenever appropriate.
- Find a good graphical representation of all five variables, and write a paragraph to summarize the chart.
Deliverables: An R markdown file (.Rmd) submitted to Blackboard with all of the R code, all the charts and the additional write-up & interpretation.
Some great answers (coming up)