Professional Documents
Culture Documents
08
Unit – II
Statistical & Probabilistic analysis of Data:
Multiple hypothesis testing 01
1
Parameter Estimation methods 01
2 01
3 Confidence intervals
01
4 Bayesian statistics
02
Data Distributions 02
5
08
Unit – III
1 Introduction to machine learning: 01
Supervised & unsupervised learning classification 01
2
Algorithms 02
3 01
4 clustering Algorithms,
01
5 Dimensionality reduction: PCA & SVD, Correlation
01
& Regression analysis, 01
6
Training & testing data: Over fitting & Under fitting
7
08
Unit – IV
Introduction to Information Retrieval:
Boolean Model, 01
1
Vector model, 02
2 01
3 Probabilistic Model,
01
4 Text based search: Tokenization,
01
TF-IDF, stop words and n-grams, 01
5
Synonyms and parts of speech tagging. 01
6
08
Unit – V
Introduction to Web Search& Big data: Crawling
and Indexes, 01
1
Search Engine architectures, 01
2 02
3 Link Analysis and ranking algorithms such as
4 HITS and PageRank,
01
Hadoop File system & 01
5
MapReduce Paradigm 08
6
Total Class Required 40
References :
Text Book: