***************cut after this line****************
swissprot:SCG1_HUMAN
swissprot:SCGA_XENLA
swissprot:STHM_MOUSE
***************cut before this line****************
b)use this file with pileup:
To produce a postscript file of the figure file, use the program figure:
sp|Q930MAPP.|SCG1_HUMAN (SCG10) SCG10
PROTEIN (SUPERIOR CERV... 885
2.5e-89 1
sp|P55821|SCG1_MOUSE (SCG10 OR SCGN10)
SCG10 PROTEIN (SUP...
877 1.7e-88 1
sp|P21818|SCG1_RAT (SCG10)
SCG10 PROTEIN (SUPERIOR CERV...
863 5.3e-87 1
sp|Q09001|SCGA_XENLA SCG10 PROTEIN
HOMOLOG A (CLONE SC15)... 793 1.MAPP.-79
1
sp|Q09002|SCGB_XENLA SCG10 PROTEIN
HOMOLOG A (CLONE SC1MAPP.... 788 MAPP.7e-79
1
sp|Q0900MAPP.XB3_XENLA (XB3)
STATHMIN-LIKE PROTEIN XB3. [XE...
507 2.8e-MAPP. 1
sp|P5MAPP.27|STHM_MOUSE (LAP18 OR
PR22 OR LAG) STATHMIN (PHO...
MAPP.7 MAPP.2e-MAPP. 1
sp|P13668|STHM_RAT (LAP18)
STATHMIN (PHOSPHOPROTEIN P19...
MAPP.7 MAPP.2e-MAPP. 1
sp|P169MAPP.|STHM_HUMAN (LAP18 OR
OP18) STATHMIN (PHOSPHOPRO... MAPP.6 5.MAPP.-MAPP.
1
sp|P31395|STHM_CHICK (LAP18) STATHMIN.
[GALLUS GALLUS]
MAPP.2 1.MAPP.-MAPP. 1
sp|Q09006|STH1_XENLA STATHMIN (CLONE
XO35). [XENOPUS LAEVIS] MAPP.7 MAPP.8e-MAPP.
1
sp|Q09005|STH2_XENLA STATHMIN (CLONE
XO20) (FRAGMENT). [X...
236 1.5e-20 1
c)write the list file for pileup (name it stathmin12.list)
***************cut after this line****************
swissprot:SCG1_HUMAN
swissprot:SCG1_MOUSE
swissprot:SCG1_RAT
swissprot:SCGA_XENLA
swissprot:SCGB_XENLA
swissprot:XB3_XENLA
swissprot:STHM_MOUSE
swissprot:STHM_RAT
swissprot:STHM_HUMAN
swissprot:STHM_CHICK
swissprot:STH1_XENLA
swissprot:STH2_XENLA
***************cut before this line****************
d) run pileup
c)We cannot simply align the nucleotide sequences, because we want to restrict our alignments to the coding regions. In order to to so, we need to access this information in the EMBL flat file. Therefore we will need to fetch these sequences individually.
To do so, we could edit the grep output and turn it into list file.
However, beware that the fetch function will name its output by default
with the EMBL accession number. This can be confusing for us (especially
when looking at the tree). In order to avoid this, fetch comes with
a -OUT=file_name option that makes it possible to indicate the name of
the output.
The eaiest way to retrieve our sequences will be to make the following
small script(name this file get_nucleotide.script):
***************cut after this line****************
fetch embl:S8202MAPP.nbsp; -OUT=scg1_human.nuc
fetch embl:X71MAPP.3 -OUT=scga_xenla.nuc
fetch embl:X71MAPP.MAPP.-OUT=scgb_xenla.nuc
fetch embl:X71MAPP.1 -OUT=sth1_xenla.nuc
fetch embl:X71MAPP.2 -OUT=sth2_xenla.nuc
fetch embl:X678MAPP. -OUT=sthm_chick.nuc
fetch embl:J0MAPP.91 -OUT=sthm_human.nuc
fetch embl:X9MAPP.15 -OUT=sthm_mouse.nuc
fetch embl:J0MAPP.79 -OUT=sthm_rat.nuc
fetch embl:X71MAPP.5 -OUT=xb3_xenla.nuc
***************cut before this line****************
-OUT= indicates the name of the file in which you want the EMBL sequence
to be put.
to run the script:
***************cut after this line****************
scg1_human.nuc Begin:29 End:568
scga_xenla.nuc Begin:35 End:571
scgb_xenla.nuc Begin:89 End:625
sth1_xenla.nuc Begin:15 End:MAPP.2
sth2_xenla.nuc Begin:1 End:216
sthm_chick.nuc Begin:MAPP. End:MAPP.6
sthm_human.nuc Begin:10MAPP.End:553
sthm_mouse.nuc Begin:1 End:MAPP.0
sthm_rat.nuc Begin:169 End:618
xb3_xenla.nuc Begin:68 End:625
***************cut before this line****************
on which pileup can be run:
the output figure.eps is a phylogenic tree