Course detail

Knowledge Discovery in Databases

FIT-ZZNAcad. year: 2009/2010

Basic concepts concerning knowledge discovery in data, relation of knowledge discovery and data mining. Data sources for knowledge discovery. Principles and techniques of data preprocessing for mining. Systems for knowledge discovery in data, data mining query languages. Data mining techniques – characterization and discrimination, association rules, classification and prediction, clustering. Complex data type mining. Trends in data mining. Working-out a data mining project by means of an available data mining tool.

Language of instruction

Czech

Number of ECTS credits

5

Mode of study

Not applicable.

Learning outcomes of the course unit

Students get a broad, yet in-depth overview of the field of data mining and knowledge discovery. They are able both to use and to develop knowledge discovery tools.

Prerequisites

There are no prerequisites

Co-requisites

Not applicable.

Planned learning activities and teaching methods

Not applicable.

Assesment methods and criteria linked to learning outcomes

Duty credit consists of working-out the project and of obtaining at least 25 points for activities during semester.

Course curriculum

  1. Introduction - motivation, fundamental concepts, data source and knowledge types.
  2. Data Warehouse and OLAP Technology for knowledge discovery.
  3. Data Preparation.
  4. Mining frequent patterns and associations - basic concepts, efficient and scalable frequent itemset mining methods.
  5. Multi-level association rules, association mining and correlation analysis, constraint-based association rules.
  6. Classification and prediction - basic concepts, decision tree, Bayesian classification, rule-based classification.
  7. Classification by means of neural networks, SVM classifier, other classification methods, prediction.
  8. Cluster analysis - basic concepts, types of data in cluster analysis, partitioning and hierarchical methods.
  9. Other clustering methods.
  10. Mining stream, time-series and sequence data.
  11. Graph mining, social network analysis, multirelational data mining.
  12. Mining object , spatial and multimedia data, text mining, mining the Web.
  13. Applications and Trends in Data Mining.

Work placements

Not applicable.

Aims

To familiarize students with knowledge discovery in data sources, to explain useful knowledge types and the steps of the knowledge discovery process, and to familiarize them with techniques, algorithms and tools used in the process.

Specification of controlled education, way of implementation and compensation for absences

A mid-term test, formulation of a data mining task, presentation of the project.

Recommended optional programme components

Not applicable.

Prerequisites and corequisites

Not applicable.

Basic literature

Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Third Edition. Morgan Kaufmann Publishers, 2012, 703 p., ISBN 978-0-12-381479-1.Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Second Edition. Elsevier Inc., 2006, 770 p., ISBN 1-55860-901-3.    

Recommended reading

Not applicable.

Classification of course in study plans

  • Programme IT-MSC-2 Master's

    branch MBI , 2 year of study, winter semester, compulsory
    branch MBS , 0 year of study, winter semester, compulsory-optional
    branch MGM , 2 year of study, winter semester, elective
    branch MGM , 2 year of study, winter semester, elective
    branch MIN , 2 year of study, winter semester, compulsory
    branch MIN , 2 year of study, winter semester, compulsory
    branch MIS , 2 year of study, winter semester, compulsory-optional
    branch MIS , 2 year of study, winter semester, elective
    branch MMI , 0 year of study, winter semester, elective
    branch MMM , 0 year of study, winter semester, elective
    branch MPS , 0 year of study, winter semester, elective
    branch MPV , 1 year of study, winter semester, compulsory-optional
    branch MSK , 2 year of study, winter semester, compulsory-optional

Type of course unit

 

Lecture

39 hod., optionally

Teacher / Lecturer

Syllabus

  1. Introduction - motivation, fundamental concepts, data source and knowledge types.
  2. Data Warehouse and OLAP Technology for knowledge discovery.
  3. Data Preparation.
  4. Mining frequent patterns and associations - basic concepts, efficient and scalable frequent itemset mining methods.
  5. Multi-level association rules, association mining and correlation analysis, constraint-based association rules.
  6. Classification and prediction - basic concepts, decision tree, Bayesian classification, rule-based classification.
  7. Classification by means of neural networks, SVM classifier, other classification methods, prediction.
  8. Cluster analysis - basic concepts, types of data in cluster analysis, partitioning and hierarchical methods.
  9. Other clustering methods.
  10. Mining stream, time-series and sequence data.
  11. Graph mining, social network analysis, multirelational data mining.
  12. Mining object , spatial and multimedia data, text mining, mining the Web.
  13. Applications and Trends in Data Mining.

Project

13 hod., optionally

Teacher / Lecturer