An Introduction To Sequence Comparison and Database Search

UPF, October 2018

Cedric Notredame


This course is an 8 hours primer on sequence alignments. Its goal is to present an overview of the basic concepts of sequence alignments and some of their applications. The first two hours will be dedicated to molecular evolution. We will focus on the implications of molecular evolution on sequence variation. We will use these concepts to define homology. We will then see how specific mathematical models (the substitution matrices) have been derived in order to quantify the evolutionary relationship between sequences. The next two hours will be used to introduce the Needleman and Wunsch algorithm (Dynamic programming), a very basic algorithm that makes it possible to derive pairwise alignments from the sequences while using the substitution matrices. Over the following 2 hours, we will see how these pairwise alignment methods can be applied to database searches and we will develop the main concepts behind the BLAST algorithm. I will finally introduce the notion of multiple sequence alignment and show how a group of related sequences can be compared in order to infer common properties. We will then see the main principles behins two multiple sequence alignment package: ClustalW and T-Coffee.

Send your Questions to:

1UPFLECTURESPairwise comparisons in an evolutionary context -1L
2UPFLECTURESPairwise comparisons in an evolutionary context -2L
3UPFLECTURESSubstitution Matrices -1 L
4UPFLECTURESSubstitution Matrices -2L
5UPFLECTURESIntroduction to Dynamic Programming -1L
6UPFLECTURESIntroduction to Dynamic Programming -2L
7UPFLECTURESDatabase Searches with BLAST -1 L
8UPFLECTURESDatabase Searches with BLAST -2 L
9UPFLECTURESMultiple Sequence Alignments -1 L
10UPFLECTURESMultiple Sequence Alignments -2L
UPF PRACTICALSDatabase SearchesP
UPFPRACTICALSIntroduction to Dynamic ProgrammingP


1. Claverie and Notredame, Bioinformatics for Dummies, 2007, Wiley

2. Durbin et al., Biological Sequence Analysis, 1999, Oxford Press

3. Tisdall, Begining Perl for Bioinformatics, 2001, O'Reilley

4. Patthy, Protein Evolution, 2007, Blackwell

This Entire Course Was Automatically Generated Using BED, the Bioinformatics Exercise Database. BED is a freeware available on request Cedric Notredame