Build the DW, ETL (09.12.2010)

Cite

Technische Universität Braunschweig, Institut für Informationssysteme

Balke, Wolf-Tilo

Formal Metadata

Title

Build the DW, ETL (09.12.2010)

Title of Series

Data Warehousing and Data Mining Techniques (WS 2010/2011)

Part Number

Number of Parts

Author

Balke, Wolf-Tilo

Contributors

Homoceanu, Silviu

License

CC Attribution - NonCommercial 3.0 Germany:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal and non-commercial purpose as long as the work is attributed to the author in the manner specified by the author or licensor.

Identifiers

10.5446/319 (DOI)

Publisher

Technische Universität Braunschweig, Institut für Informationssysteme

Release Date

2010

Language

English

Producer

Technische Universität Braunschweig

Institut für Informationssysteme

Balke, Wolf-Tilo

Production Year

2011

Production Place

Braunschweig

Content Metadata

Subject Area

Computer Science

Genre

Lecture

Abstract

In this course, we examine the aspects regarding building maintaining and operating data warehouses as well as give an insight to the main knowledge discovery techniques. The course deals with basic issues like storage of the data, execution of the analytical queries and data mining procedures. Course will be tought completly in English. The general structure of the course is: Typical dw use case scenarios Basic architecture of dw Data modelling on a conceptual, logical and physical level Multidimensional E/R modelling Cubes, dimensions, measures Query processing, OLAP queries (OLAP vs OLTP), roll-up, drill down, slice, dice, pivot MOLAP, ROLAP, HOLAP SQL99 OLAP operators, MDX Snowflake, star and starflake schemas for relational storage Multimedia physical storage (linearization) DW Indexing as search optimization mean: R-Trees, UB-Trees, Bitmap indexes Other optimization procedures: data partitioning, star join optimization, materialized views ETL Association rule mining, sequence patterns, time series Classification: Decision trees, naive Bayes classifications, SVM Cluster analysis: K-means, hierarchical clustering, aglomerative clustering, outlier analysis