Professional Documents
Culture Documents
PG DIPLOMA
IN DATA ANALYTICS
Program Curriculum
Note: This curriculum is subject to change
based on inputs from IIITB and Industry
INSTALLING R
BASIC OPERATIONS IN R
VECTORS, FACTORS, MATRICES, LISTS
R INTRO TO R
LOOPS & CONDITIONAL STATEMENTS
FUNCTIONS
TOOLS & LANGUAGES
DATA FRAMES
INSTALLATION
BASICS
LISTS
DATA STRUCTURES IN PYTHON
TUPLES
DICTIONARIES
SETS
IMPORTING PACKAGES
IF-ELIF-ELSE
PYTHON*
LOOPS & CONDITIONAL STATEMENTS
CONTROL STRUCTURES & FUNCTIONS COMPREHENSIONS
FUNCTIONS
MAP, FILTER & REDUCE
INTRODUCING NUMPY
DATA ANALYSIS USING PANDAS INTRODUCING PANDAS
MERGING, QUERYING & AGGREGATION
BASICS OF SQL
SQL INTRO TO SQL MYSQL FUNCTIONS
SQL WITH R
TABLEAU INTERFACE
TABLEAU CONNECTING TO DATA & BASIC VISUALISATIONS
VISUALISATION WITH TABLEAU
INSIGHTS FROM VISUALISATIONS
*Optional
&
PG DIPLOMA
IN DATA ANALYTICS
Program Curriculum
DATA DICTIONARY
DATA DICTIONARY & GRANULARITY
DATA GRANULARITY
INCONSISTENT DATA
MISSING DATA
HOMONYMS
BUSINESS & DATA UNDERSTANDING DATA QUALITY ISSUES & CLEANING SYNONYMS
INACCURATE DATA
GENERAL PURPOSE ATTRIBUTES
UNSTRUCTURED DATA
TYPES OF MERGES
MERGING IN R
DATA PREPARATION & MERGES
DATA DEDUPLICATION
BUSINESS RULES
PG DIPLOMA
IN DATA ANALYTICS
Program Curriculum
MEAN
MEDIAN
MEASURES OF CENTRAL TENDENCY MODE
TRUNCATED MEAN
GEOMETRIC MEAN
RANGE
VARIANCE
DESCRIPTIVE STATISTICS
STANDARD DEVIATION
SPREAD OF THE DATA
SKEWNESS
KURTOSIS
COEFFICIENT OF VARIATION
COVARIANCE
ASSOCIATION BETWEEN VARIABLES CORRELATION
CORRELATION IS NOT CAUSATION
UNDERSTANDING PROBABILITY
MARGINAL PROBABILITY
BASICS OF PROBABILITY JOINT PROBABILITY
CONDITIONAL PROBABILITY
BAYES THEOREM FOR CONDITIONAL PROBABILITY
BASICS OF DISTRIBUTION - PDF & CDF
INFERENTIAL STATISTICS PROBABILITY DISTRIBUTION DISCRETE PROBABILITY DISTRIBUTIONS
NORMAL DISTRIBUTION
SAMPLING BIASES & SAMPLING TECHNIQUES
SAMPLING DISTRIBUTION
SAMPLING & SAMPLING DISTRIBUTION CENTRAL LIMIT THEOREM
CONFIDENCE INTERVAL
MARGIN OF ERROR
STATISTICS & EDA
BASICS
NULL & ALTERNATE HYPOTHESIS
STANDARDISED SCORE APPROACH
CONCEPTS IN HYPOTHESIS TESTING UNSTANDARDISED TEST SCORE
P-VALUE APPROACH
TYPES OF TESTS
TYPES OF ERRORS
HYPOTHESIS TESTING
1-POPULATION MEAN TEST
2-POPULATION MEAN TEST
SETTING UP HYPOTHESIS TEST
1-POPULATION PROPORTION TEST
2-POPULATION PROPORTION TEST
UNDERSTANDING T-DISTRIBUTION
SETTING UP T-TEST
WHEN NOT TO USE Z-TEST
NON-PARAMETRIC TEST
SETTING UP CHI-SQUARE TEST
PG DIPLOMA
IN DATA ANALYTICS
Program Curriculum
CONTINUOUS DATA
K-NN CATEGORICAL DATA
K-NN IN R
NAIVE BAYES WITH 1 FEATURE
SUPERVISED CLASSIFICATION I*
CONDITIONAL INDEPENDENCE
NAIVE BAYES DECIPHERING NAIVE BAYES
NAIVE BAYES WITH CONTINUOUS DATA
NAIVE BAYES IN R
SIGMOID FUNCTION
INTRO TO LOGISTIC REGRESSION
ESTIMATING & INTERPRETING THE COEFFICIENTS
FEATURE SELECTION THROUGH STEPWISE
PREDICTIVE ANALYTICS I
UNSUPERVISED LEARNING
INTRO TO CLUSTERING
CUSTOMER SEGMENTATION
STEPS OF THE ALGORITHM
K MEANS ALGORITHM VISUALISING THE K MEANS ALGORITHM
PRACTICAL CONSIDERATIONS IN K MEANS
DATA PREPARATION
MAKING THE CLUSTERS
UNSUPERVISED LEARNING: K MEANS IN R
DECIDING THE OPTIMAL K
CLUSTERING
INTERPRETING THE RESULTS
STEPS OF THE ALGORITHM
HIERARCHICAL CLUSTERING INTERPRETING THE DENDROGRAM
TYPES OF LINKAGES
CONSTRUCTING THE DENDROGRAM
HIERARCHICAL CLUSTERING IN R CUTTING THE DENDROGRAM
INTERPRETING THE DENDROGRAM
*Optional
&
PG DIPLOMA
IN DATA ANALYTICS
Program Curriculum
AR & MA MODELLING
WORKING WITH STATIONARY TIME SERIES
ARMA MODELLING
ENSEMBLE IN R
*Optional
&
PG DIPLOMA
IN DATA ANALYTICS
Program Curriculum
INDUSTRY APPLICATIONS & UTILITY HOW TO PROCESS IT? OR THE PROCESSING PLATFORMS
WHAT ARE THE JOB ROLES FOR BIG DATA ANALYSTS?
INTRODUCTION TO SQOOP
DATA INGESTION WITH SQOOP
SQOOP IMPORT AND EXPORT
SQOOP COMMANDS DEMO
MANAGING DATA
INTRODUCTION TO HIVE
UNDERSTANDING HIVE COMPONENTS
HADOOP DATABASE - HIVE
USING HIVE COMMANDS ON SAMPLE DATASETS TO
DEMONSTRATE MANAGED AND EXTERNAL TABLES
INTRODUCTION TO SPARKSQL
DATA ANALYSIS USING SQL
USING SPARKSQL
ACQUISITION STRATEGIES
ACQUISITION ANALYTICS ACQUISITION ANALYTICS
LAB - BANK MARKETING
BFS
PG DIPLOMA
IN DATA ANALYTICS
Program Curriculum
A/B TESTING
A/B TESTING A/B TESTING
EXECUTING A/B TEST IN OPTIMISELY
US HEALTHCARE
OVERVIEW OF HEALTHCARE INDUSTRY
SCOPE OF ANALYTICS
JOB PERSPECTIVES
COST MANAGEMENT
NETWORK DESIGN
PATIENT ADHERENCE
CLINICAL TRIALS
BUSINESS PROBLEM
INSIGHT GENERATION