Section 1
##### Introduction to Analytics

Introduction to Excel

Conditional Formatting

Data Summarization techniques

Graphical summary using SAS/GRAPH: Introduction to Bar graph

Graphical summary using SAS/GRAPH: Introduction to Pie graph

Graphical summary using SAS/GRAPH introduction to Histogram, Box plots, Scatter diagram

Descriptive Statistics-Introduction to various measures of Central Tendency

Introduction to the measures of Dispersion, Range, Mean Deviation , Standard Deviation

Section 2
##### Understanding Probability and Probability Distribution

Introduction to Probability theory

Types of probability distribution – Discrete Distribution and Continuous distribution

Understanding Probability Mass Function and Probability Density Function

Normal Distribution and Standard Normal Distribution

Normal plot using Proc GPLOT procedure in SAS

Application of Normal distribution in Analytics with real life examples

Binomial Distribution and Binomial plot using PROC GPLOT procedure in SAS

Poisson distribution and Poisson plot using Proc GPLOT procedure in SAS

Application of Binomial and Poisson distribution in Analytics with real life examples

Section 3
##### Introduction to Sampling Theory and Estimation

Concept of Population and Sample

Use of PROC SURVEYSELECT procedure in SAS

Introduction to Some important terminologies

Parameter and Statistic

Properties of a good estimator

Standard Deviation and Standard Error

Point and Interval Estimation

Confidence level and level of Significance

Constructing Confidence Intervals

Formulation of Null and Alternative hypothesis

Performing simple test of Hypothesis

Section 5
##### Statistical Significance of T-Tests Chi Square Tests and Analysis of Variance

Performing test of one sample mean using Proc ttest

Difference between two group means (independent sample) using Proc ttest

difference between two group means (Paired sample) using Proc ttest

Performing Chi-square tests: Test of Independence

Performing one-way ANOVA with PROC ANOVA and PROC GLM procedure

Performing post-hoc multiple comparisons tests in PROC

GLM using Tukey’s mean test

Section 6
##### Introduction to Segmentation Techniques: Factor Analysis

Introduction to Factor Analysis and various techniques

Principal Component Analysis (PCA) and Exploratory Factor Analysis (EFA)

Application of Factor Analysis using Proc Factor procedure

KMO MSA test, Bartlett’s Test Sphericity

The Mineigen Criterion, Scree plot

Introduction to Factor Loading Matrix

Various rotation techniques like Varimax

Section 7
##### Introduction to Segmentation Techniques: Cluster Analysis

Introduction to Cluster Analysis and various techniques

Hierarchical and Non – Hierarchical Clustering techniques

Using Hierarchical Clustering by Proc Tree procedure in SAS

Performing K – means Clustering in SAS

Divisive Clustering, Agglomerative Clustering

Application of Cluster Analysis in Analytics with profiling of the clusters and interpretation of the clusters

Section 8
##### Correlation and Linear Regression

Introduction to Pearson’s Correlation coefficient using PROC CORR procedure

Correlation and Causation – Fitting a simple linear regression model with the Proc REG procedure

Understanding the concepts of Multiple Regression

Using automated model selection techniques in PROC REG to choose the best model

Interpretation of the model: overall fit of the model and finding out the influential variables

Linear Regression diagnostics

Examining Residual

Assessing Collinearity, Heteroskedasticity and Auto – Correlation

Section 9
##### Introduction to Categorical Data Analysis and Logistic Regression

Comparison between Liner Regression and Logistic Regression

Performing Logistic regression using Proc Logistic Procedure in SAS

Performing Goodness of ft of the model

Introduction to Percent Concordant, AIC, SC, and Hosmer – Lemeshow

Receiver Operating Characteristics (ROC) Curve and Area under Curve (AUC)

Interpretation of the model: overall fit of the model and finding out the influential variables using Odds ratio criteria

Using automated model selection techniques in PROC Logistic to choose the best model using AIC criteria

Section 10
##### Introduction to Time Series Analysis

What is Time series Analysis, Objectives and Assumptions of Time Series

Identifying pattern in Time series data: Decomposition of the time series data and general aspect of the analysis

Introduction to Various Smoothing techniques: Simple Moving Average, Weighted Moving Average, Exponential Smoothing, Holt’s Linear Exponential Smoothing

Examples of Seasonality and detecting Seasonality in Time series data

Introduction to Proc Forecast to generate forecast for time series data

Autoregressive models and Stepwise Autoregression (STEPAR) procedure

Autoregressive and Moving Average models and Introduction to Box Jenkins Methodology

Introduction to Autoregressive Moving Average (ARMA) model

Autoregressive Integrated Moving Average (ARIMA) model

Building an ARIMA Model

Detection of Stationarity, Seasonality in ARIMA Model

Detecting the order of AR and MA of ARIMA model by Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF)

Detecting the order by using AIC and BIC criterion

Estimation and forecast using Proc ARIMA in SAS

What is a Pie Chart?
A pie chart is a circular chart divided into wedge-like sectors, illustrating proportion. Each wedge represents a proportionate part of the whole, and the total value of the pie is always 100 percent. Pie charts can make the size of portions easy to understand at a glance. They're widely used in business presentations and education to show the proportions among a large variety of categories including expenses, segments of a population, or answers to a survey.

Pie Chart vs Bar Chart
Some critics of pie charts point out that the portions are hard to compare across other pie charts and if a pie chart has too many wedges, even wedges in a single pie chart are hard to visually contrast against each other compared to the height of bars in a bar graph for example. Bar charts are easier to read when you're comparing categories or looking at change over time. The only thing bar charts lack is the whole-part relationship that makes pie charts unique. Pie charts imply that if one wedge gets bigger, the other has to be smaller. This would not be true of two bars on a bar chart.

Let’s now see how we can create different types of pie charts in SAS.

PROC GCHART DATA= mylib.CANDY_SALES_SUMMARY;

PIE3D SUBCATEGORY;

RUN;
This code generates a 3-dimensional pie-chart using the option pie-3d. Gchart is used to procedure the graphical chart. The pie-chart represents each of the subcategory on a pie, i.e. as a percent-age of 360 degrees. We are creating a pie chart for each of the different subcategory of candies present in the data set called “Candy_Sales_Summary”. “mylib” is the name of the library which stores all the SAS data sets.

If we modify the above code like:

PROC GCHART DATA= mylib.CANDY_SALES_SUMMARY;

PIE SUBCATEGORY/ VALUE= INSIDE;

RUN;
**

We will get a variation of the previous pie chart representation, value=inside keeps the frequency values in the slices along with the names of the subcategory. Each of the sub-category is shown in slices of different colors. Note: We have not mentioned the keyword “3D” here and hence we would get a 2-dimensional pie chart, which is the default type of chart in SAS. The code below is for a pie chart which puts out the frequency of sale corresponding to the sale subcategory. The percentage frequency of the sale and the discrete value of the sale of the subcategory are shown outside and the name of the variable is shown outside the slice.

PROC GCHART DATA= mylib.CANDY_SALES_SUMMARY;

PIE3d SUBCATEGORY/ VALUE=INSIDE

PERCENT=INSIDE

SLICE=OUTSIDE

FREQ=SALE_AMOUNT;

RUN;
**

On running the above code, we will get a graph like the one shown below