An Introduction To Sequence Comparison and Database Search

UPF, January 2010

Cedric Notredame


This course is an 6 hours primer on sequence alignments. Its goal is to present an overview of the basic concepts of sequence alignments and some of their applications. The first two hours will be dedicated to molecular evolution. We will focus on the implications of molecular evolution on sequence variation. We will use these concepts to define homology. We will then see how specific mathematical models (the substitution matrices) have been derived in order to quantify the evolutionary relationship between sequences. The next two hours will be used to introduce the Needleman and Wunsch algorithm (Dynamic programming), a very basic algorithm that makes it possible to derive pairwise alignments from the sequences while using the substitution matrices. Over the following 2 hours, we will see how these pairwise alignment methods can be applied to database searches and we will develop the main concepts behind the BLAST algorithm. I will finally introduce the notion of multiple sequence alignment and show how a group of related sequences can be compared in order to infer common properties. We will then see the main principles behins two multiple sequence alignment package: ClustalW and T-Coffee.

Send your Questions to:

1UABLECTUREEvolution and Comparison ModelsL
2UABLECTUREDynamic Programming Based Sequence ComparisonL
3UABLECTUREBlast and Database SearchesL
4UABLECTUREMultiple Sequence Alignments: Algorithms and StrategiesL
Paper BIBLIOA Review on Multiple Sequence AlignmentsP


1. Claverie and Notredame, Bioinformatics for Dummies, 2007, Wiley

2. Durbin et al., Biological Sequence Analysis, 1999, Oxford Press

3. Patthy, Protein Evolution, 2007, Wiley

This Entire Course Was Automatically Generated Using BED, the Bioinformatics Exercise Database. BED is a freeware available on request Cedric Notredame