Data Mining Overview, Association Rule Mining (16.12.10)

Technische Universität Braunschweig, Institut für Informationssysteme

Balke, Wolf-Tilo

Formale Metadaten

Titel

Serientitel

Data Warehousing and Data Mining Techniques (WS 2010/2011)

Teil

Anzahl der Teile

Autor

Balke, Wolf-Tilo

Mitwirkende

Homoceanu, Silviu

Lizenz

CC-Namensnennung - keine kommerzielle Nutzung 3.0 Deutschland:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen und nicht-kommerziellen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.

Identifikatoren

10.5446/333 (DOI)

Herausgeber

Technische Universität Braunschweig, Institut für Informationssysteme

Erscheinungsjahr

2010

Sprache

Englisch

Produzent

Technische Universität Braunschweig

Institut für Informationssysteme

Balke, Wolf-Tilo

Produktionsjahr

2011

Produktionsort

Braunschweig

Inhaltliche Metadaten

Fachgebiet

Informatik

Genre

Vorlesung

Abstract

In this course, we examine the aspects regarding building maintaining and operating data warehouses as well as give an insight to the main knowledge discovery techniques. The course deals with basic issues like storage of the data, execution of the analytical queries and data mining procedures. Course will be tought completly in English. The general structure of the course is: Typical dw use case scenarios Basic architecture of dw Data modelling on a conceptual, logical and physical level Multidimensional E/R modelling Cubes, dimensions, measures Query processing, OLAP queries (OLAP vs OLTP), roll-up, drill down, slice, dice, pivot MOLAP, ROLAP, HOLAP SQL99 OLAP operators, MDX Snowflake, star and starflake schemas for relational storage Multimedia physical storage (linearization) DW Indexing as search optimization mean: R-Trees, UB-Trees, Bitmap indexes Other optimization procedures: data partitioning, star join optimization, materialized views ETL Association rule mining, sequence patterns, time series Classification: Decision trees, naive Bayes classifications, SVM Cluster analysis: K-means, hierarchical clustering, aglomerative clustering, outlier analysis