Course detail

Practics of Bioinformatics

FEKT-APBIAcad. year: 2013/2014

The course is focused on practical application of basic bioinformatical analyses of DNA and amino acids sequences. Primarily, it is oriented on global, local and multi alignment algorithms and algorithms for RNA and protein sequence secondary structure prediction. The signal processing methods for genomic and proteomic data analyses are studied. Further, practical application of phylogenetic tree construction is applied on suitable dataset of DNA sequences.

Language of instruction

Czech

Number of ECTS credits

3

Mode of study

Not applicable.

Learning outcomes of the course unit

The student is able to:
- find protein-coding DNA sequences in GenBank database and load the data in desired format
- find protein sequence, which is coded by the DNA sequence, in Uniprot database
- find coding regions in DNA sequences
- program dotmatrix plot for nucleotide and aminoacid sequences and suitably select filtering parameters
- use alignment online tools and suitably choose scoring parameters according data type
- program algorithms for global and local alignment
- predict secondary structure of RNA sequences with online tools
- predict secondary structure of protein sequences with online tools
- program numerical representation of DNA sequences in 1. and 4. quadrant and compare the numerical signals by calculation of Eucledian distance
- program discrete Fourier transform for DNA sequences with fixed size of window and with moving window
- program calculation of DNA spectrograms
- construct phylogenetic tree from DNA sequences by online tools

Prerequisites

Student should have knowledge equivalent to completion of the course ABIN. Student should be able to work with Matlab or other programming language. Student have to know the basics of molecular biology of nucleotide and protein sequences and understand to principles of global and local sequence alignment and principle of protein secondary structure prediction.

Co-requisites

Not applicable.

Planned learning activities and teaching methods

Teaching methods depend on the type of the course unit as specified in the article 7 of BUT Rules for Studies and Examinations.

Assesment methods and criteria linked to learning outcomes

Student can get maximally 30 points for activity in lessons and maximally 70 points for elaboration of analysis report of DNA and protein sequences. It is necessary to get minimally 35 points for the report and in sum minimally 50 points to successfully pass the course.

Course curriculum

1. Searching in sequence databases.
2. Prediction of protein coding regions and alignment of sequences.
3. Multialignment of sequences.
4. RNA secondary structure prediction.
5. Protein secondary structure prediction.
6. Numerical representations.
7. Discrete Fourier transform of DNA and protein sequences.
8. Spectrograms of DNA and protein sequences.
9. EIIP values for numerical representation and its spectrograms.
10. Phylogenetic tree construction.

Work placements

Not applicable.

Aims

The goal of this course is to teach the students how to search in the basic genomic and proteomic databases like GenBank and Uniprot, analyse data from the databases with commonly used bioinformatic online tools, and the student are also teached how to program some basic bioinformatics algorithms.

Specification of controlled education, way of implementation and compensation for absences

Computer exercises are mandatory, properly excused missed lectures can be compensated individually after discussion with teacher.

Recommended optional programme components

Not applicable.

Prerequisites and corequisites

Not applicable.

Basic literature

Cvrčková F: Úvod do praktické bioinformatiky, Academia, 2006 (CS)

Recommended reading

Not applicable.

Classification of course in study plans

  • Programme BTBIO-A Bachelor's

    branch A-BTB , 3 year of study, summer semester, compulsory

  • Programme EEKR-CZV lifelong learning

    branch EE-FLE , 1 year of study, summer semester, compulsory

Type of course unit

 

Exercise in computer lab

39 hod., compulsory

Teacher / Lecturer

Syllabus

1. Bioinformatic data. Genome sequences of organisms., Databases accessible to the public. Reading, data imaging in Matlab. Basic problems from bioinformatics solved in the programming environment Matlab
2. Processing of geonome sequences according to the principal statistical standards.
3. Comparison of sequences. Levelling of sequences. Coincidence rate.
4. Seeking patterns in sequences
5. Non-linear methods for comparison of samples, method of dynamic time warping
6. Hidden Markov models in resolution methods
7. Seeking patterns by means of non-linear methods for classification problems.
8. Cluster analysis using non-linear comparative approaches
9. Statistical evaluation of classification procedures, volumes of processed data.
10. Expert system as a classifier.
11. Presentation of individual work.
12. Presentation of individual work.
13. Test.