Contents

About the Authors

Acknowledgments

Preface

image Part 1: Basics of SAS Programming for Analytics

image Chapter 1: Introduction to Business Analytics and Data Analysis Tools

Business Analytics, the Science of Data-Driven Decision Making

Business Analytics Defined

Is Advanced Analytics the Solution for You?

Simulation, Modeling, and Optimization

Data Warehousing and Data Mining

What Can Be Discovered Using Data Mining?

Business Intelligence, Reporting, and Business Analytics

Analytics Techniques Used in the Industry

Regression Modeling and Analysis

Time Series Forecasting

Conjoint Analysis

Cluster Analysis

Segmentation

Principal Components and Factor Analysis

Correspondence Analysis

Survival Analytics

Some Practical Applications of Business Analytics

Customer Analytics

Operational Analytics

Social Media Analytics

Data Used in Analytics

Big Data vs. Conventional Business Analytics

Introduction to Big Data

Introduction to Data Analysis Tools

Main Parts of SAS, SPSS, and R

Selection of Analytics Tools

The Background Required for a Successful Career in Business Analytics

Skills Required for a Business Analytics Professional

Conclusion

image Chapter 2: SAS Introduction

Starting SAS in Windows

The SAS Opening Screen

The Five Main Windows

Editor Window

Log Window

Output Window

Explorer Window

Results Window

Important Menu Options and Icons

View Options

Run Menu

Solutions Menu

Shortcut Icons

Writing and Executing a SAS Program

Comments in the Code

Your First SAS Program

Debugging SAS Code Using a Log File

Example for Warnings in Log File

Tips for Writing, Reading the Log File, and Debugging

Saving SAS Files

Exercise

Conclusion

image Chapter 3: Data Handling Using SAS

SAS Data Sets

Descriptive Portion of SAS Data Sets

Data Portion of Data Set

SAS Libraries

Creating the Library Using the GUI

Rules of Assigning a Library

Creating a New Library Using SAS Code

Permanent and Temporary Libraries

Two Main Types of SAS Statements

Importing Data into SAS

Data Set Creation Using the SAS Program

Using the Import Wizard

Import Using the Code

Data Manipulations

Making a Copy of a SAS Data Set

Creating New Variables

Updating the Same Data Set

Drop and Keep Variables

Subsetting the Data

Conclusion

image Chapter 4: Important SAS Functions and Procs

SAS Functions

Numeric Functions

Character Functions

Date Functions

Important SAS PROCs

The Proc Step

PROC CONTENTS

PROC SORT

Graphs Using SAS

PROC gplot and Gchart

PROC SQL

Data Merging

Appending the Data

From SET to MERGE

Blending with Condition

Matched Merging

Conclusion

image Part 2: Using SAS for Business Analytics

image Chapter 5: Introduction to Statistical Analysis

What Is Statistics?

Basic Statistical Concepts in Business Analytics

Population

Sample

Variable

Variable Types in Predictive Modeling Context

Parameter

Statistic

Example Exercise

Statistical Analysis Methods

Descriptive Statistics

Inferential Statistics

Predictive Statistics

Solving a Problem Using Statistical Analysis

Setting Up Business Objective and Planning

The Data Preparation

Descriptive Analysis and Visualization

Predictive Modeling

Model Validation

Model Implementation

An Example from the Real World: Credit Risk Life Cycle

Business Objective and Planning

Data Preparation

Descriptive Analysis and Visualization

Predictive Modeling

Model Validation

Model Implementation

Conclusion

image Chapter 6: Basic Descriptive Statistics and Reporting in SAS

Rudimentary Forms of Data Analysis

Simply Print the Data

Print and Various Options of Print in SAS

Summary Statistics

Central Tendencies

Calculating Central Tendencies in SAS

What Is Dispersion?

Calculating Dispersion Using SAS

Quantiles

Calculating Quantiles Using SAS

Box Plots

Creating Boxplots Using SAS

Bivariate Analysis

Conclusion

image Chapter 7: Data Exploration, Validation, and Data Sanitization

Data Exploration Steps in a Statistical Data Analysis Life Cycle

Example: Contact Center Call Volumes

Need for Data Exploration and Validation

Issues with the Real-World Data and How to Solve Them

Missing Values

The Outliers

Manual Inspection of the Dataset Is Not a Practical Solution

Removing Records Is Not Always the Right Way

Understanding and Preparing the Data

Data Exploration

Data Validation

Data Cleaning

Data Exploration, Validation, and Sanitization Case Study: Credit Risk Data

Importing the Data

Step 1: Data Exploration and Validation Using the PROC CONTENTS

Step 2: Data Exploration and Validation Using Data Snapshot

Step 3: Data Exploration and Validation Using Univariate Analysis

Step 4: Data Exploration and Validation Using Frequencies

Step 5: The Missing Value and Outlier Treatment

Conclusion

image Chapter 8: Testing of Hypothesis

Testing: An Analogy from Everyday Life

What Is the Process of Testing a Hypothesis?

State the Null Hypothesis on the Population: Null Hypothesis (H0)

Alternate Hypothesis (H1)

Sampling Distribution

Central Limit Theorem

Test Statistic

Inference

Critical Values and Critical Region

Confidence Interval

Tests

T-test for Mean

Case Study: Testing for the Mean in SAS

Other Test Examples

Two-Tailed and Single-Tailed Tests

Conclusion

image Chapter 9: Correlation and Linear Regression

What Is Correlation?

Pearson’s Correlation Coefficient (r)

Variance and Covariance

Correlation Matrix

Calculating Correlation Coefficient Using SAS

Correlation Limits and Strength of Association

Properties and Limitations of Correlation Coefficient (r)

Some Examples on Limitations of Correlation

Correlation vs. Causation

Correlation Example

Correlation Summary

Linear Regression

Correlation to Regression

Estimation Example

Simple Linear Regression

Regression Line Fitting Using Least Squares

The Beta Coefficients: Example 1

How Good Is My Model?

Regression Assumptions

When Linear Regression Can’t Be Applied

Simple Regression: Example

Conclusion

image Chapter 10: Multiple Regression Analysis

Multiple Linear Regression

Multiple Regression Line

Multiple Regression Line Fitting Using Least Squares

Multiple Linear Regression in SAS

Example: Smartphone Sales Estimation

Goodness of Fit

Three Main Measures from Regression Output

Multicollinearity Defined

How to Analyze the Output: Linear Regression Final Check List

Double-Check for the Assumptions of Linear Regression

F-test

R-squared

Adjusted R-Squared

VIF

T-test for Each Variable

Analyzing the Regression Output: Final Check List Example

Conclusion

image Chapter 11: Logistic Regression

Predicting Ice-Cream Sales: Example

Nonlinear Regression

Logistic Regression

Logistic Regression Using SAS

SAS Logistic Regression Output Explanation

Output Part 1: Response Variable Summary

Output Part 2: Model Fit Summary

Output Part 3: Test for Regression Coefficients

Output Part 4: The Beta Coefficients and Odds Ratio

Output Part 5: Validation Statistics

Individual Impact of Independent Variables

Goodness of Fit for Logistic Regression

Chi-square Test

Concordance

Prediction Using Logistic Regression

Multicollinearity in Logistic Regression

No VIF Option in PROC LOGISTIC

Logistic Regression Final Check List

Loan Default Prediction Case Study

Background and Problem Statement

Objective

Data Set

Model Building

Final Model Equation and Prediction Using the Model

Conclusion

image Chapter 12: Time-Series Analysis and Forecasting

What Is a Time-Series Process?

Main Phases of Time-Series Analysis

Modeling Methodologies

Box–Jenkins Approach

What Is ARIMA?

The AR Process

The MA Process

ARMA Process

Understanding ARIMA Using an Eyesight Measurement Analogy

Steps in the Box–Jenkins Approach

Step 1: Testing Whether the Time Series Is Stationary

Step 2: Identifying the Model

Step 3: Estimating the Parameters

Step 4: Forecasting Using the Model

Case Study: Time-Series Forecasting Using the SAS Example

Checking the Model Accuracy

Conclusion

image Chapter 13: Introducing Big Data Analytics

Traditional Data-Handling Tools

Walmart Customer Data

Facebook Data

Examples of the Growing Size of Data

What Is Big Data?

The Three Main Components of Big Data

Applications of Big Data Analytics

The Solution for Big Data Problems

Distributed Computing

What Is MapReduce?

Map Function

Reduce Function

What Is Apache Hadoop?

Hadoop Distributed File System

MapReduce

Apache Hive

Apache Pig

Other Tools in the Hadoop Ecosystem

Companies That Use Hadoop

Big Data Analytics Example

Examining the Business Problem

Getting the Data Set

Starting Hadoop

Looking at the Hadoop Components

Moving Data from the Local System to Hadoop

Viewing the Data on HDFS

Starting Hive

Creating a Table Using Hive

Executing a Program Using Hive

Viewing the MapReduce Status

The Final Result

Conclusion

Index

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset