Introduction
  • Introduction
  • Managing Expectations and Course Orientation
  • Data Pre-Processing as Integral Part of Data Science
  • Let's See an R Example of Data Pre-Processing
  • Lures Example Script
Data Import and Data Structuring
  • Script: Data import
  • Importing Data and Snippets
  • Using fread to handle big data fast
  • Choosing the right class for your data
  • Further R Exercises
Cleaning Your Data
  • Script: Data cleaning
  • tidyr - How tidy data looks like
  • Wide to long data format
  • Splitting columns
  • Long to wide data format
Querying and Filtering Data with data.table
  • Script: Querying with data.table
  • What is data.table?
  • Basic queries
  • Queries at column level
  • The by paramater for queries
  • Update on recycle queries
  • Keys
  • Data.table exercises
  • Data.table solutions
Queries and Filtering Exercises
  • Query exercises INTRO
  • 10 Exercises on 'data.frame'
  • Data.frame Exercise Script
  • Data.frame Solutions 1-4
  • Data.frame Solutions 5-10
  • 10 Exercises on 'data.table'
  • Data.table Exercise Script
  • Data.table Solutions 1-4
  • Data.table Solutions 5 - 10
Using dplyr on one and multiple Datasets
  • Script: dplyr
  • Single Table Verbs in 'dplyr'
  • Two Table Verbs - Mutating Joins
  • Two Table Verbs - Filtering Joins and handling of ID mismatches
  • Two Table Verbs - Set Operations
Integrate SQL into R
  • Script: Integrate SQL
  • Get package dbplyr
  • R to SQL Translator
  • Using SQL within R
  • Set Up a SQLite Database in R
Detecting Outliers
  • Outlier Script
  • Introduction to Outlier Detection
  • Detecting Outliers in Univariate Datasets
  • Detecting Outliers in Multivariate Datasets
Working with Strings - Regular Expressions
  • Script: Working with Strings
  • Regular Expressions and Gsub
  • What You Should Know about Strings in R
  • The Gsub Family of Functions and Regular Expressions
  • Regular Expressions Syntax
  • A Great Add On Package
  • Working with Strings in R: Exercise with Solution
Working with Dates and Time
  • Data management and time series INTRO
  • Importing a Time Series From Excel
  • Section Script
  • Classes POSIXt, Date and Chron
  • Lubridate: Input and Time Zones
  • Lubridate: Weekdays and Intervals
  • Lubridate: Exercise Data Frame
  • Lubridate: Calculations and Leap Years
  • Lubridate: Data Handling Exercise
  • Further R Exercises