Get in Touch

Course Outline

Day One: Language Basics

  • Course Introduction
  • Understanding Data Science
    • Defining Data Science.
    • The Data Science Process.
  • Introduction to the R Language
  • Variables and Data Types
  • Control Structures (Loops and Conditionals)
  • R Scalars, Vectors, and Matrices
    • Defining R Vectors.
    • Matrices.
  • String and Text Manipulation
    • Character data type.
    • File Input/Output.
  • Lists
  • Functions
    • Introduction to Functions.
    • Closures.
    • Using lapply/sapply functions.
  • DataFrames
  • Labs for all sections

Day Two: Intermediate R Programming

  • DataFrames and File I/O
  • Reading data from files.
  • Data Preparation.
  • Built-in Datasets.
  • Visualization
    • Base Graphics Package.
    • Using plot(), barplot(), hist(), boxplot(), and scatter plots.
    • Heat Maps.
    • ggplot2 package (qplot(), ggplot()).
  • Data Exploration with dplyr
  • Labs for all sections

Day Three: Advanced Programming with R

  • Statistical Modeling with R
    • Statistical Functions.
    • Handling Missing Values (NA).
    • Distributions (Binomial, Poisson, Normal).
  • Regression
    • Introduction to Linear Regression.
  • Recommendation Systems.
  • Text Processing (tm package / Word Clouds).
  • Clustering
    • Introduction to Clustering.
    • K-Means Clustering.
  • Classification
    • Introduction to Classification.
    • Naive Bayes.
    • Decision Trees.
    • Training using the caret package.
    • Evaluating Algorithms.
  • R and Big Data
    • Connecting R to databases.
    • The Big Data Ecosystem.
  • Labs for all sections

Requirements

  • A basic background in programming is recommended.

Setup

  • A modern laptop.
  • The latest version of RStudio and the R environment installed.
 21 Hours

Number of participants


Price per participant

Testimonials (7)

Provisional Upcoming Courses (Require 5+ participants)

Related Categories