|
Name of Subject : DATA MININIG AND WAREHOUSING (7 CS 2) |
|
Unit |
|
Contents |
|
Overview, Motivation(for Data Mining),Data Mining-Definition & Functionalities, Data Processing, Form of Data |
|
Preprocessing, Data Cleaning: Missing Values, Noisy Data, (Binning, Clustering, Regression, Computer and |
|
I |
|
Human inspection), Inconsistent Data, Data Integration and Transformation. Data Reduction:-Data Cube |
|
Aggregation, Dimensionality reduction, Data Compression, Numerosity Reduction, Clustering, Discretization and |
|
Concept hierarchy generation. |
|
Concept Description:- Definition, Data Generalization, Analytical Characterization, Analysis of attribute relevance, |
|
Mining Class comparisons, Statistical measures in large Databases. Measuring Central Tendency, Measuring |
|
Dispersion of Data, Graph Displays of Basic Statistical class Description, Mining Association Rules in Large |
|
II |
|
Databases, Association rule mining, mining Single-Dimensional Boolean Association rules from Transactional |
|
Databases– Apriori Algorithm, Mining Multilevel Association rules from Transaction Databases and Mining Multi- |
|
Dimensional Association rules from Relational Databases. |
|
What is Classification & Prediction, Issues regarding Classification and prediction, Decision tree, Bayesian |
|
Classification, Classification by Back propagation, Multilayer feed-forward Neural Network, Back propagation |
|
Algorithm, Classification methods K-nearest neighbor classifiers, Genetic Algorithm. Cluster Analysis: Data types in |
|
III |
|
cluster analysis, Categories of clustering methods, Partitioning methods. Hierarchical Clustering- CURE and |
|
Chameleon. Density Based Methods-DBSCAN, OPTICS. Grid Based Methods- STING, CLIQUE. Model Based |
|
Method –Statistical Approach, Neural Network approach, Outlier Analysis |
|
Data Warehousing: Overview, Definition, Delivery Process, Difference between Database System and Data |
|
IV |
|
Warehouse, Multi Dimensional Data Model, Data Cubes, Stars, Snow Flakes, Fact Constellations, Concept |
|
hierarchy, Process Architecture, 3 Tier Architecture, Data Marting. |
|
Aggregation, Historical information, Query Facility, OLAP function and Tools. OLAP Servers, ROLAP, MOLAP, |
|
V |
HOLAP, Data Mining interface, Security, Backup and Recovery, Tuning Data Warehouse, Testing Dat