Course detail
Practics of Bioinformatics
FEKT-APBIAcad. year: 2015/2016
The course is focused on practical application of basic bioinformatical analyses of DNA and amino acids sequences. Primarily, it is oriented on global, local and multi alignment algorithms and algorithms for RNA and protein sequence secondary structure prediction. The signal processing methods for genomic and proteomic data analyses are studied. Further, practical application of phylogenetic tree construction is applied on suitable dataset of DNA sequences.
Language of instruction
Number of ECTS credits
Mode of study
Guarantor
Learning outcomes of the course unit
- find protein-coding DNA sequences in GenBank database and load the data in desired format
- find protein sequence, which is coded by the DNA sequence, in Uniprot database
- find coding regions in DNA sequences
- program dotmatrix plot for nucleotide and aminoacid sequences and suitably select filtering parameters
- use alignment online tools and suitably choose scoring parameters according data type
- program algorithms for global and local alignment
- predict secondary structure of RNA sequences with online tools
- predict secondary structure of protein sequences with online tools
- program numerical representation of DNA sequences in 1. and 4. quadrant and compare the numerical signals by calculation of Eucledian distance
- program discrete Fourier transform for DNA sequences with fixed size of window and with moving window
- program calculation of DNA spectrograms
- construct phylogenetic tree from DNA sequences by online tools
Prerequisites
Co-requisites
Planned learning activities and teaching methods
Assesment methods and criteria linked to learning outcomes
30 points for programming,
30 points for test,
40 points for practical analysis.
Course curriculum
2. Prediction of protein coding regions and alignment of sequences.
3. Multialignment of sequences.
4. RNA secondary structure prediction.
5. Protein secondary structure prediction.
6. Numerical representations.
7. Discrete Fourier transform of DNA and protein sequences.
8. Spectrograms of DNA and protein sequences.
9. EIIP values for numerical representation and its spectrograms.
10. Phylogenetic tree construction.
Work placements
Aims
Specification of controlled education, way of implementation and compensation for absences
Recommended optional programme components
Prerequisites and corequisites
Basic literature
Recommended reading
Classification of course in study plans
Type of course unit
Exercise in computer lab
Teacher / Lecturer
Syllabus
2. Processing of geonome sequences according to the principal statistical standards.
3. Comparison of sequences. Levelling of sequences. Coincidence rate.
4. Seeking patterns in sequences
5. Non-linear methods for comparison of samples, method of dynamic time warping
6. Hidden Markov models in resolution methods
7. Seeking patterns by means of non-linear methods for classification problems.
8. Cluster analysis using non-linear comparative approaches
9. Statistical evaluation of classification procedures, volumes of processed data.
10. Expert system as a classifier.
11. Presentation of individual work.
12. Presentation of individual work.
13. Test.