Course detail
Multidimensional Analysis of Biomedical Data
FEKT-MPA-VMMAcad. year: 2025/2026
The course is oriented on commonly used methods in the field of multivariate data analysis: cluster analysis, principal component method, t-SNE, UMAP, etc. Both theoretical (basic principles of each method) and practical (applications in display and analysis of multivariate data) aspects are discussed. The theory is discussed in direct connection with practical examples. All computational techniques are practiced using the Python environment. The course prepares students to independently use the given methods for data analysis in their own scientific or routine work.
Language of instruction
Number of ECTS credits
Mode of study
Guarantor
Entry knowledge
The student should have knowledge of basic statistical data analysis and linear algebra. Knowledge of Python is required during PC excercise.
Rules for evaluation and completion of the course
1) Team project (max. 20 points):
- Developing an original solution to the team project and defending it at the end of the semester (according to the guidelines)
Notes:
- The completion of the assignment and the quality of the presentation of the results by all team members will be evaluated
- Plagiarism will result in no credit being awarded
- At least one team consultation with the advisor is mandatory!
2) Final exam (max. 80 points):
- oral form
- two parts in total, each for a maximum of 40 points
Conditions for credit and admission to the final exam:
- obtaining a non-zero number of points for the team project.
- maximum of two excused absences (in exceptional cases the course supervisor will decide on the solution)
Conditions for successful completion of the course:
- obtaining credit
- obtaining at least 20 points in each of the two parts of the exam
- obtaining a total (i.e. from the project and the exam) of at least 50 points
Aims
The aim of the course is to provide students with knowledge in the field of multivariate data analysis and to present the possibilities of using selected procedures in the processing and analysis of biomedical data. The student will acquire basic knowledge and skills in the use of multivariate analysis methods. The student will be able to apply the most commonly used methods in practice in order to process and analyse data.
The examination verifies that the graduate of the course is able to:
- explain the basic concepts of multivariate analysis,
- describe the basic methods in this field, discuss the advantages and disadvantages of each method,
- select and use appropriate tools for a given problem in this area,
- evaluate the quality of obtained results and present them in an appropriate form,
- interpret the obtained results.
Study aids
Prerequisites and corequisites
Basic literature
J.H. McDonald: Handbook of Biological Statistics, Sparky House Publishing, 2008 (CS)
Recommended reading
Classification of course in study plans
Type of course unit
Lecture
Teacher / Lecturer
Syllabus
1. Introduction to multivariate analysis of data. Goals of multivariate analysis. Classification of methods.
2. Basics of linear algebra - review.
3. Multivariate statistical distributions and tests.
4. Data preprocessing methods. Transformation and standardization. The issue of missing data.
5. Relationships between variables in multivariate space. Similarity and distance metrics. Regression, correlation, covariance.
6. Clustering data analysis. Hierarchical and non-hierarchical methods. Determining the optimal number of clusters. Validation of clustering results.
7. Ordination analysis. Principal component analysis (PCA). Matrix decomposition principles.
8. Dual forms and kernels - kernel PCA.
9. Ordination analysis. Factor analysis. Factor rotation.
10. Non-linear dimensionality reduction. t-SNE.
11. Non-linear dimensionality reduction. UMAP.
12. Examples of using the multivariate analysis: visualization, feature selection and extraction, analysis of relationships between variables.
Exercise in computer lab
Teacher / Lecturer
Syllabus
1. Basics of linear algebra - review.
2. Exploratory data analysis: visualization, statistical descriptive analysis.
3. Exploratory data analysis: data processing.
4. Relationships in multivariate space: correlation analysis.
5. Relationships in multivariate space: regression analysis, logistic regression.
6. Relationships in multivariate space: MANOVA.
7. Ordination analysis: PCA.
8. Ordination analysis: kernel PCA.
9. Clustering analysis: k-means, UPGMA.
10. Clustering analysis: evaluation of clustering quality.
11. Visualization of multivariate data: t-SNE.
12. Visualization of multivariate data: UMAP.