Data Mining

Data is displayed for academic year: 2023./2024.

Course Description

Data mining - definitions and areas of application. Types of data. Data sources and their acquisition. Data preprocessing - data manipulation, data filtering, data transformation, feature selection. Imbalanced datasets, concept drift. Classifier ensembles. Models with clear interpretation based on induction rules and decision trees. Model explainability. Association rules. Time series analysis. Deep learning in data mining. Deep learning architectures in applications. Specifics of data mining in different fields of application. Use of freely available tools for data mining. Data mining project.

Study Programmes

University graduate
[FER3-EN] Data Science - profile
Recommended elective courses (2. semester)

Learning Outcomes

  1. identify any potential shortcomings of the analyzed data set
  2. evaluate the suitability of the used sequence of machine learning methods in various fields of application
  3. combine feature selection methods on a given problem
  4. analyze the given data set using a suitable sequence of machine learning methods in at least one existing software tool
  5. develop your own software to analyze a particular dataset
  6. classify machine learning techniques by the type of problem they are solving
  7. analyze time series from different domains with predictive analytics techniques
  8. construct explainable machine learning models to facilitate reaching decisions in specific domain

Forms of Teaching


Lectures - theory

Independent assignments

Data mining project

Grading Method

Continuous Assessment Exam
Type Threshold Percent of Grade Threshold Percent of Grade
Seminar/Project 40 % 60 % 40 % 60 %
Final Exam: Written 40 % 40 %
Exam: Written 40 % 40 %

Week by Week Schedule

  1. Course administration. Introduction to data mining. Description of the field. Data mining process models. References.
  2. Data preparation for data mining: data preparation process, problems in data and their solutions. Examples. Project.
  3. Data transformation, dimensionality reduction and feature extraction. Project.
  4. Feature selection: filter methods, wrapper methods, embedded methods, hybrid methods. Examples. Project.
  5. Imbalanced data, concept drift. Algorithms for solving these problems. Project.
  6. Classification and regression ensembles. Ensemble algorithms. Ensemble models' explanation methods. Project.
  7. Interpretable machine learning. Rules induction. Algorithms for induction rules. Project.
  8. -
  9. Frequent pattern mining and association rules. High-utility itemset mining. Applications in recommender systems. Algorithms. Project.
  10. Time series data mining: introduction and terminology. Time series analysis components. Feature extraction-based time series analysis. Project.
  11. Time series data mining: classification and prediction algorithms. Project.
  12. Deep learning in data mining: introductory topics. Project.
  13. Deep learning in data mining: architectures in application areas: natural language processing, time series classification, image classification, image generation from text. Deep learning model explainability. Project delivery.
  14. Project presentations
  15. Final exam


Witten IH, Frank E, Hall MA, Pal CJ. (2016.), Data Mining: Practical Machine Learning Tools and Techniques. 4th ed., Morgan Kaufmann
Fuernkranz J, Gamberger D, Lavrač N. (2012.), Foundations of Rule Learning, Springer
James G, Witten D, Hastie T, Tibshirani R. (2014.), An Introduction to Statistical Learning: with Applications in R, Springer
Raschka S, Mirjalili V. (2017.), Python Machine Learning. 2nd ed., Packt Publishing, Birmingham UK
Ryza S, Laserson U, Owen S, Wills J. (2017.), Advanced Analytics with Spark: Patterns for Learning from Data at Scale. 2nd ed., O'Reilly Media, Sebastopol CA, USA
Mitchell, R. (2018.), Web Scraping with Python: Collecting more data from the Modern Web. 2nd ed., O'Reilly Media, Sebastopol CA, USA
Masis, S. (2023.), Interpretable Machine Learning with Python, Packt Publishing, Birmingham UK

For students


ID 223066
  Summer semester
L1 English Level
L1 e-Learning
45 Lectures
0 Seminar
0 Exercises
18 Laboratory exercises
0 Project laboratory
0 Physical education excercises

Grading System

88 Excellent
75 Very Good
63 Good
50 Sufficient