Section 1
##### What is R?

Section 2
##### Basic Operations in R

1

Expressions: Basic Idea

2

Constant Values: Numeric and Non-Numeric

3

Arithmetic: Operations and BODMAS

4

Conditions: Equality, Greater Than, Less Than, etc

5

Function Calls: Introduction to R Functions

6

Symbols and Assignment

7

Keywords: NA, Inf, NaN, NULL, TRUE, FALSE

8

Naming a Variable: Generally accepted conventions

Section 3
##### Data Types and Data Structures

Section 4
##### Subsetting in R

1

Vector Subsetting

2

c() function: Creation of Vectors

3

Using rep() and seq() functions

4

Using factor() to covert vectors to factors

5

Using data.frame() to create data frames

6

Meta data access: dimnames(), rownames(), colnames()

7

Using matrix() to create matrices

8

Using array() to create arrays

9

Subsetting data frames: row subset, column subset, using subset() function

10

Assigning to a subset

11

Using is.na() to detect NA

12

Subsetting factors

Section 5
##### Additional Topics on Data structures

1

The recycling rule: Uneven arithmetic operation on vectors

2

Type coercion: Character to Numeric

3

Automatic Type coercion

4

Coercing factors: Using as.factor() function

5

Changing factor levels

6

Attributes: attribute(), attr(), names() functions

7

Classes: Idea of OOP in R

8

Dates: As a special class

9

Formulas: As a special class

10

Exploring Objects: summary(), str(), dim() functions

11

Generic functions

Section 6
##### Data Import and Export

1

Text formats: Reading Delimited Files

2

read.table() function

3

Using read.fwf() function for fixed width files

4

Using readLines() for reading lines

5

Using write.csv() function to store data as CSV files

6

Reading Excel file: Package XLConnect

7

Reading SPSS file: Package Foreign

8

Reading SAS data file: Package sas7bdat

9

Database connection: The ideas of ODBC connecting in Windows

10

RODBC package: Create and Query database from R

11

Basic SQL

Section 7
##### Control Structures and User Defined Functions

1

Conditional Statements

2

If statement: The Structure

3

If Else statement: The Structure

4

Ifelse() function

5

Iteration

6

The for loop

7

The while loop

8

The repeat statement

9

lapply() function

10

sapply() function

11

apply() function

12

User defined function

13

Variable scooping: Global and Local Variables

14

Using user defined functions inside function definition

Section 8
##### Data Visualisation: Charting with R

1

The plot function

2

plot.new() function: Generating new plot object

3

plot.window() function: Creating window

4

points() function: Plotting points

5

axis() function: Generating Axis

6

box() function: Creating enclosure

7

title() function: Assigning title

8

par() function: Fixing plotting parameters

9

lines() function: Adding connector lines

10

Multi figure layout: Creating multiple charts in the same window

11

hist() function: Plotting histograms

12

Kernel Density Plot: The non-parametric probability distribution

13

Comparing Groups via Kernel Density: Comparing two different probability distributions

14

Simple Bar Plot: Visualizing categorical data

15

Staked Bar Plot: Understating category composition

16

Grouped Bar Plot

17

Line Charts

18

Pie Charts

19

Boxplots: Understanding data distributions and outliers

20

Using Google Chart Tools with R (Package googleVis)

21

Geo Charts

22

Motion Charts

Section 9
##### Visualisation on R using Google Vis

Section 10
##### Visualization in R using GGPLOT2

SAS vs R

A Short Description of Both Software Suites

SAS and R, both are important data analytics tools used in today’s tech world. Both tools are extensively used by Data Scientists and Data Analysts. Making a choice between SAS and R has been a longstanding debate in the world of Data Science.

Statistical Analysis System (SAS) language is a programming language that is used to read in data from spreadsheets and databases and output the results of statistical analysis in tables and graphs and as RTF, HTML, and PDF docs. SAS is commonly used for financial analytics capabilities. SAS is easy to learn, and it offers great technical support. It can be considered as an expensive alternative to R;

R is mostly used by the research community, professors and researchers, among other faculties. Since, R is an open-source tool, you can get the latest version as soon as it is released. R is mainly used for statistical analysis, graphical representations, and reporting. Here, we take a simplified yet concise look at the various features, functions, and strengths and weaknesses of each of these tools.

Features of SAS and R

Parameters of Comparison

Ease of Learning

SAS is very good when it comes to picking a new tool to learn without any prior programming language experience and thus can be an excellent experience for beginners.

R is bit tougher to learn as compared to SAS. It is not a high-level programming language and hence even a small mistake can turn out to be a huge problem.

Managing Data

In terms of handling and managing data, SAS is in a better position since the data is increasing at a huge pace day by day and SAS is better at handling data. Furthermore, R works only on RAM, and increasing the RAM as and when the data increases is not a feasible option.

Graphics

Graphics is a very important aspect of any Data Science or Data Analytics capabilities. Ability to visualize and analyze data is a crucial part. R is the winner in this area, thanks to the availability of various packages like ggplot, Latice, and RGIS.

SAS is not great at graphical capabilities. Though Base SAS has some graphical capabilities.

Working with Big Data

While working with Big Data, R has some very good features which can be utilized by Big Data, Data Science, and Data Analytics communities. If you are looking for deploying analytics at scale for Machine Learning capabilities, then R is the language to choose. Of late, SAS is taking fast strides to execute analytics also. But still, SAS lags R when it comes to integrating successfully with Big Data tools like Hadoop and others.

Industry Deployment

Since R is an open-source programming language, it can be used by anybody. It thus finds a widespread usage among small and medium enterprises. SAS, on the other hand, is extremely costly and is useful for large organizations. SAS is mostly used for data warehousing, data quality, data analytics, and reporting capabilities.

Cost

There are numerous packages in R which provide advanced graphical capabilities. It incorporates the latest features quickly as the packages get added on by programmers across the world. Currently, R is in popular demand. Although, SAS has been the market leader in corporate jobs, it is very expensive for start-ups.

Service Support

R has the biggest online community but without customer service support which makes it difficult for people to tackle any technical issue. Whereas, SAS has dedicated customer service, along with its community. Hence, installation and other technical challenges get easily sorted.

The choice between SAS and R always depends on organizational requirements. Large-scale organizations usually opt SAS over R, while the start-ups prefer the latter option.