An Introduction To Sequence Comparison and Database Search

UPF, October 2016

Cedric Notredame


This course is an 8 hours primer on sequence alignments. Its goal is to present an overview of the basic concepts of sequence alignments and some of their applications. The first two hours will be dedicated to molecular evolution. We will focus on the implications of molecular evolution on sequence variation. We will use these concepts to define homology. We will then see how specific mathematical models (the substitution matrices) have been derived in order to quantify the evolutionary relationship between sequences. The next two hours will be used to introduce the Needleman and Wunsch algorithm (Dynamic programming), a very basic algorithm that makes it possible to derive pairwise alignments from the sequences while using the substitution matrices. Over the following 2 hours, we will see how these pairwise alignment methods can be applied to database searches and we will develop the main concepts behind the BLAST algorithm. I will finally introduce the notion of multiple sequence alignment and show how a group of related sequences can be compared in order to infer common properties. We will then see the main principles behins two multiple sequence alignment package: ClustalW and T-Coffee.

Send your Questions to:

1UPFLECTUREPairwise comparisons in an evolutionary context -1L
2UPFLECTUREPairwise comparisons in an evolutionary context -2L
3UPFLECTURESubstitution Matrices -1 L
4UPFLECTURESubstitution Matrices -2L
5UPFLECTUREIntroduction to Dynamic Programming -1L
6UPFLECTUREIntroduction to Dynamic Programming -2L
7UPFLECTUREDatabase Searches with BLAST -1 L
8UPFLECTUREDatabase Searches with BLAST -2 L
9UPFLECTUREMultiple Sequence Alignments -1 L
10UPFLECTUREMultiple Sequence Alignments -2L
UPF PRACTICALDatabase SearchesP
UPFPRACTICALIntroduction to Dynamic ProgrammingP


1. Claverie and Notredame, Bioinformatics for Dummies, 2007, Wiley

2. Durbin et al., Biological Sequence Analysis, 1999, Oxford Press

3. Tisdall, Begining Perl for Bioinformatics, 2001, O'Reilley

4. Patthy, Protein Evolution, 2007, Blackwell

This Entire Course Was Automatically Generated Using BED, the Bioinformatics Exercise Database. BED is a freeware available on request Cedric Notredame