S.#
Date
Day
Topics
Downloads
1-2
5/9/14
Friday
Course Overview, What is Data Mining and its Origin, Typical Data Mining Tasks, Data Mining Applications/Examples, Data Mining vs. OLAP and Statistics, Introduction to Classification/Decision Trees

3-4
12/9/14
Friday
Model Interpretation of Classification Trees, Measures of Node Impurity, Computation of GINI Index, Computation of Entropy and Misclassification Error, Induction of Classification Trees, Handling of Continuous and Multi-State Variables, Data Preparation, Normalization, Outlier Detection

5-6
19/9/14
Friday
Discretization using Value Reduction, Overview of Chi Square Test, ChiMerge Discretization, KNIME Demo (using German Credit Card Data), Model Evaluation, Accuracy, Weighted Accuracy, Recall and Precision, Receiver Operating Characteristics (ROC Curve)

7-8
26/9/14
Friday
Lift and Gain Charts, Bayes Theorem, Naive Bayes Classifier, KNIME Discussion


3/10/14

Midterm 1 Week

9-10
10/10/14
Friday
Assignment 1 Presentation (Case Studies from Books)

11-12
17/10/14
Friday
PAKDD 2010 Case Study, Performance Evaluation of SVM, NB, NN, DT, etc. in Non-linear Classification, Motivation, History, Multi-layer Feedforward Network, Backpropagation Algorithm

13-14
24/10/14
Friday
Model Evaluation (Holdout, k-Cross Validation), Sampling with Replacement (Bootsrapping), Ensemble Methods (Bagging and Boosting), Random Forest, Stacking, Lazy Learner vs. Eager Learner, k-Nearest Neighbor: Pros and Cons

15-16
31/10/14
Friday
Clustering: Basic Concepts and Popular Types, Applications, K-Means: Concepts, Working, Limitations, Schemes to Handle Initial Centroid Problems in K-Means, Hierarchical Clustering: Simple/Complete/Average Linkages, Validity of Clusters: External and Internal Metrics

17-18
7/11/14
Friday
KNIME Demo (K-Means and Fuzzy c-Means Clustering with Relative Index and External Index, Hierarchical Clustering), Distance Computation for Mixed Type Variables: Interval-Scaled, Symmetric and Asymmetric Binary, Categorical and Ordinal, Fuzzy c-Means

19-20
14/11/14
Friday
Assignment 2 Presentation


21/11/14

Midterm 2 Week

21-22
28/11/14
Friday
Kohonen Self-Organizing Map, Text Analytics, Part-of-Speech Tagging, Bag of Words, Term Frequency, Inverse Document Frequency, TF-IDF

23-24
14/12/14
Friday
Association Rule Mining, Apriori Algorithm, Frequent Itemsets and Rules Generation, Support, Confidence, Interest and Lift, Handling of Continuous and Categorical Data, min-Apriori, Multi-level Association Rules, KNIME Demo, Principal Component Analysis

25-26
19/12/14
Friday
Project Presentations, Big Data Overview