Introduction
|
|
Formatting Data in Spreadsheets
|
Never modify your raw data. Always make a copy before making any changes.
Keep track of all of the steps you take to clean your data.
Organize your data according to tidy data principles.
Data stored in common spreadsheet formats will often not be read correctly into data analysis software, introducing errors into your data.
Exporting data from spreadsheets to formats like CSV or TSV puts it in a format that can be used consistently by most programs.
|
Introduction to R and RStudio
|
R is a programming language. RStudio is an Integrated Development Environment for working with R.
Tabular data is stored in R as a dataframe object.
You can get a sense of your data with summary() , str() , names() , dim() and other functions discussed here.
|
Subsetting Data in R with dplyr
|
|
Visualizing Data in R with ggplot
|
|