You are on page 1of 5

Dice Analytics Presents:

R & Data Science


Professional Course

About the Course


In this course you will learn about data mining algorithms and it's applications. Further you will also be guided
how to use the data mining algorithms in KNIME and R. This course will cover datasets from multiple domains
and how to apply data Mining algorithms on the available data, how to get value out of data Mining
algorithms, and how to present the output of those algorithms

By the end of the course, you will have enough knowledge and hands-on expertise in R and Knime to use and
apply them inreal world around you

Who should Attend?

Graduate or master students who want to start their career in the data science domain
People who are working in the BI domain and want to advance their career in the field of data science
Executive who want to build a data science department in their startup/organizations

About the Instructor

Ali Raza Anjum


Data Scientist
Ali Raza Anjum holds a computer engineering and Master in Business Administration Degree from
NUST. He is also a Gold Medalist from NUST in best Final year project. He has been working in the
domain of Datawarehousing, Data Science, Big Data and Customer Value Management since last 7
years.

0092-51-8356066 info@diceanalytics.pk www.diceanalytics.pk NISTE Building, Gate#1, Faiz Ahmed Faiz Road, H/8-1, Islamabad
Curriculum
Week # 0 :

Basics Of Data Science & Probability & Machine Learning


Domains In Machine Learning (Supervised Learning, Unsupervised Learning, ReinforcementLearning,
Deep Learning)
What Is Data & Its Different Types (Contineous &Nominal)
Introduction & Installation Of Rstudio & Rnotebook
Rstudio Overview
R Language & Syntax
Observational & Experimentatl Studies
Population Sampling
Data Visuliazing (Contneous & Categorical Data: Scatterplots, Boxplots, )
Data Types In R (Vectors,Lists, Data Frames, Names Attributes
Subsetting (Lists, Matrices, Matching)
Logic Controls (If-Else, Loops,Breaks
Data Centricity (Mean, Modes, Median, Std, Variance, Interquantile Range)
Data Transformation (Log, Natural Log, Min Max )
What Is Probability
Functions In R
Debugging In R Code
Random Sampling In R
Conditional Probability (Disjoint Events + General Addition Rule)
Disjoint Vs. Independent Events
Probability Trees & Bayesian Inference With Their Examples
Intoduction To Kaggle
Probability Distributions (Normal, Binomial, Poison)
Outliers Basics

Week # 1 :

Statistical Inference
Variability & Central Limiting Theorem
Confidence Interval & Confidence Level
Data Cleasing In R

0092-51-8356066 info@diceanalytics.pk www.diceanalytics.pk NISTE Building, Gate#1, Faiz Ahmed Faiz Road, H/8-1, Islamabad
Reading Data In R (CSV & Excell )
Read R & Tidy R
Accuracy & Percision
Samplingpling & Sampling Size For Mean Estimation
Hypothesis Testing & Null Hypothesis
Dplyr Package In R
Data Wrangling/ETL In R
Data Visualization In R
Expected Values
T Statistics
P Statistics
GGPlot2 In R
Kaggle Dataset In R (EDA/HR Analytics)
Hypothesis Testing & Null Hypothesis

Week # 2 :

Power Analysis
Bootstraping
ANOVA
Coreplot In R
Interactive Discussion Last Week Assignments
Correlation Matrix In R
Chisquare Test
Correlation Matrix
Multicollinearity
Missing Values In R
Outliers Detection In R
Kaggle Dataset In R (EDA/IMDB Movies)
Dimension Reductionality
Unsupervised Learning Basics

Week # 3 :
Unsupervised Learning
Clustering

0092-51-8356066 info@diceanalytics.pk www.diceanalytics.pk NISTE Building, Gate#1, Faiz Ahmed Faiz Road, H/8-1, Islamabad
Kmeans,Kmodes,Kmedians
Kaggle DataSet (Zillow Dataset )
Silhoute Indexes & Clustering Quality
Hierarchical Clustering
Association Rules
Apriori Algorithm
Supervised Learning
Linear Regression
R Square, Suquare SUM Of Regression, Least Square, Extrapolation

Week # 4 :

Multivariate Regression
Logistic Regression
Decsion Trees
Information Gain, Gini Index, Gain Ratio
ID3, CART , C4.5
Random Forest
Confusion Matrix
True Positive, True Negative, False Positive , False Negage
Percision, Accuracy, Recall, F Measure

Week # 5 :

Network Theory Basics


Neural Networks Basics
Text Mining Basics
Model OverFiting, UnderFitting

Week # 6 :

Project & Presenation

Tools
R Programming Language

0092-51-8356066 info@diceanalytics.pk www.diceanalytics.pk NISTE Building, Gate#1, Faiz Ahmed Faiz Road, H/8-1, Islamabad
Price and Timing

Price: Rs.25,000.00 per person


Duration: 7 Weeks
Timing: 07:00 pm to 09:00 pm
Days: 4 Days a week

How to Register?

Register on the course page: www.diceanalytics.pk/ds


Or, Email us at: info@diceanalytics.pk
Or, Call us on: 051-8356066

0092-51-8356066 info@diceanalytics.pk www.diceanalytics.pk NISTE Building, Gate#1, Faiz Ahmed Faiz Road, H/8-1, Islamabad

You might also like