Sciences and methods > Biosciences > Programs > SeaView > Seaview Manual
 
Tehdyt toimenpiteet

Seaview Manual

About seaview

Program seaview was published in:
Galtier, N., Gouy, M. and Gautier, C. (1996) SEAVIEW and PHYLO_WIN: two graphic
tools for sequence alignment and molecular phylogeny. Comput. Appl. Biosci.,
12:543-548.

Seaview is available by anonymous FTP at pbil.univ-lyon1.fr in directory
/pub/mol_phylogeny/seaview for several unix platforms and for Windows95/98.

Seaview uses the FLTK library by Bill Spitzak and others.

Alignment panel

Mouse Use:

  • Click on a site to position cursor (current cursor position -- sequence number, positions in alignment and in sequence -- appears at panel top center)
  • Click or drag mouse on several sequence names to group them (names will appear in black background). Alignment operations (gap insertions/deletions) will operate on all grouped sequences simultaneously.
  • Double-click on any name to group all sequences.
    Ctrl-Click somewhere in sequence names to move grouped sequences to another place in alignment.
  • Shift-Click to deactivate the current grouping; names return in normal print.
  • Click with middle mouse button somewhere in sequence names to move grouped sequences to another place in alignment.
  • Click with right mouse button to deactivate the current grouping; all names return in normal print.

Active keys:

  • The 4 keyboard arrows move the cursor
  • > < move the view by 50 sites to right and left (other keys can be used)
  • o ] [ move the view by 5 sites to right and left (other keys can be used) see "Miscellaneous options" for how to use other keys for same purpose
  •  - insert a gap in the cursor sequence and members of its group if any
  • or delete a gap at left of cursor
  • + inserts a gap in all sequences but that under the cursor and members of its group if any
  • _ ("underscore") deletes a gap in all sequences but that under the cursor and members of its group if any
  • 1234567890 (digit keys) allow to compose a number (appearing under mult=number at bottom left of panel) used as a multiplier to an operation to follow (e.g., keys 4, 5, and - in sequence will insert 45 gaps)
  • Alt+S (both keys together) save the alignment in the current filename using the current file saving format displayed by menu "Props". If format of the last loaded file does not match the current saving format, this key is not active.

Mouse use on the site line (when there is one, white line at bottom):

  • Click mouse to select/unselect a site.
  • Drag with the mouse button an extremity of a block of sites to extend or reduce it.
  • Ctrl-Click to extend the block of sites at left.
  • Shift-Click on a block of sites to delete it.
  • Click with the left mouse button to select/unselect a site.
  • Drag with the left mouse button an extremity of a block of sites to extend or reduce it.
  • Click with the middle mouse button to extend the block of sites at left.
  • Click with the right mouse button on a block of sites to delete it.

File menu

Open Mase, Open Phylip, Open Clustal, 
Open MSF, Open Fasta, Open NEXUS

to load an alignment in one of these formats.
Mase and Nexus formats have the useful feature of allowing extra data beyond sequences and names (comments, accession numbers are really useful!). They can also store site sets [see item `Sites menu'].

S Save   

when active, saves the alignment in the current file (which name appears as window title), and indicates that keys + S pressed together will perform this operation.

Save as...    

allows to save the alignment under a filename to be chosen in the file selector appearing next [Pressing button Rescan therein may be necessary to have all files appear].

Save current sites            

only active when a site line is displayed; allows to save in a file only those sites of the sequences where an X is in the site line. If some sequences are selected, only these are saved.

Prepare printout

Prepares a file adequate for alignment printout on a printer. The file selector appearing next allows to specify 3 parameters:

block size

alignment lines may be divided in blocks (e.g. of size 10 for DNA/RNA sequences or 3 for protein coding sequences) to ease alignment reading.

chars/line

the max number of characters that fit on a printer line (not the desired number of residues per line, which is set to the largest fitting value by the program).

lines/page

the number of lines that fit on a printer page (in order that blocks of alignment lines are not divided by page borders).

Quit           

guess what?

Props menu

Fontsize >   Tiny, Small, ..., Huge

Sets the font size used to display sequences.

Save format > Save as Mase, as Phylip, 
as Clustal, as MSF, as Fasta, as NEXUS

Sets the file format used to save alignments [`Miscellaneous options' topic explains how to set default format]

Colors >    DNA/RNA colors

Forces sequences to be colored as DNA/RNA in case they were erroneously recognized as protein data.

Colors >    Protein colors

Forces sequences to be colored as proteins in case they were erroneously recognized as DNA/RNA data.

Colors >    Alt. colors

Uses alternate coloring rules defined for protein sequences [`Coloring schemes' topic below explains how to define them].

Colors >    Inverted colors

Display residues as black letters on colored background instead of as colored letters on grey background.

Allow seq. edition

When ON, residues can be deleted/inserted. When OFF (normally), only gaps can be deleted/inserted.

by Reference 

To display the alignment by reference to a particular sequence: residues identical to that of the ref. seq. are displayed as dots. Possible only when one sequence, the reference, is selected.

Alignment edition is impossible while in the reference mode. When used, File:Prepare printout produces a printout by reference.

Combination +R is a shortcut for calling/exiting this mode.

DNA keys     

Allows to use keys "hjkl" to enter DNA sequences. "Allow seq. edition" should be ON for this item to be usable.

Choose in the submenu what bases will be mapped to these keys
> hjkl => GATC, hjkl => TCGA, hjkl => ACGT

Also, the space bar will be mapped to base N.

Clustalw options

Allows to specify options sent to the clustalw alignment program in order to set, e.g., specific alignement parameter values.

Go to submenu "Edit options" and type desired options in the dialog window that appears.

An example is to type /gapopen=5

An example is to type -gapopen=5

When an option has been typed it is active unless you switch it off by calling again "Clustalw options" and sliding to the check box.

Thus a set of options can be memorized and turned on and off.

Consensus option

Allows to set the percent of identical residues needed to define a residue when building a consensus sequence. Sites where this fraction is not reached are filled with N or X.

Fast-Rough   

Allows to switch between smooth-slow or rough-fast display.

Allow lowercase

If set, lowercase residues will be apparent. If not set, lowercase residues in sequence files display as uppercase.

Sites menu

Site sets allow to specify parts of a multiple alignment to be retained for further analyses (typically, those parts of the alignment taken as reliably aligned). Retained sites are depicted as series of Xs on a special line at bottom of the alignment panel. On color screens, they are also displayed by a dark background. Mouse clicks and drags on this line allow to construct/alter the set.

Several sets of sites can be created and stored with the alignment if the Mase or NEXUS formats of alignment files are used. Each set has a name chosen by the user. One set of sites at most can be displayed at any time through this menu.

Item "Save current sites" of menu "File" allows to save in an alignment file only those sites of the alignment pertaining to the currently displayed set. The resulting file may then be processed by, e.g., tree building programs.

Create set      

creates a line at bottom of alignment to hold the location of alignment sites.

Save set        

stores changes done to the current set of sites

Rename set      

stores the current set of sites under a new name

Hide set        

hides (but remembers except unsaved changes) the current set of sites

Delete set      
  • removes the currently displayed set of sites from list of known sets
  • shows a sites line of name constructed from data stores in the alignment, or previously created


Species menu

Species sets can be created and stored with the alignment if the Mase or NEXUS formats of alignment files are used.

To select one or several species, click or drag on their names; they will appear in black background.

To memorize the current set of selected species, choose "Create set" from this menu. The program will ask for a name for this set.

Delete set	

deletes (just from memory) the current set of species.

Displays with black background the set of species memorized under that name.

Footers menu

Comment lines can be created and displayed at bottom of the screen. These lines can contain any text and the program will maintain the vertical alignment between this text and sequences. This text can be saved using the mase or NEXUS file formats only.

To edit this text, click on the line name, position the cursor, and type text.

Click again on the line name to stop editing this text.

Show / Hide footers

 To show / hide all footer lines

Create footer             

To create a new footer line

Delete footer             

To delete the currently selected footer line

Search button & box

Type a string in box at right and strike key or push button to position the cursor in the next occurence of this string from its current place.

Push button to position the cursor at next occurence of the current search string.

Sequence gaps are ignored by the search procedure.

Goto button and box

Moves the cursor to desired position or sequence
Position:
Type a number in box at right and hit key or push button to position the cursor at that position of the alignment.
Sequence:
Type a (fragment of) sequence name in box at right and hit key or push button to position the cursor in that sequence. Case is not significant.

Edit menu

Rename sequence          

To rename the currently selected (= name in black background) sequence.

Edit comments            

To see or change comments of the currently selected sequence (Comments can only be saved in mase/NEXUS).

Delete sequence(s)       

Deletes all selected sequences from the alignment.

Create sequence          

Allows to create a new-empty sequence in the alignment; set "Allow seq. edition" from "Props" menu ON to be able to type the sequence in.

Load sequence            

Allows to load a new sequence in the alignment. The sequence can be typed in or pasted from a selection made in another window: drag on the sequence data, and paste it with the middle mouse button in the `Sequence loading' window.

Duplicate sequence       

Allows to duplicate the currently selected sequence wih prefix D_ in its name.

Complement sequence      

Creates a new sequence equal to the complementary strand of the currently selected sequence with prefix C_ in its name.

Reverse sequence        

Creates a new sequence by reading 3' -> 5' the currently selected sequence and named with prefix R_

Exchange Us and Ts       

Exchange bases Us and Ts in all currently selected sequences.

Align sites              

Runs the clustalw alignment program on the block of selected sites and the set of selected sequences. A window will ask for choosing the reference sequence: gaps present before clustalw alignment in the chosen sequence will be preserved in the new alignment.

Dot plot                 

Performs a "dot plot" analysis of the two selected sequences. See "Dot plot" item of the help data.

Consensus sequence       

Computes the consensus of all currently selected sequences. At any site, the consensus residue is the most frequent one if its frequency is above a threshold value. This threshold (60 % by default) can be changed through item "Consensus threshold" of menu "Props".

Below threshold, N or X is used.

Dot plot

Performs a "dot plot" analysis of two sequences. Enter desired values for the window size and # of matches/window, and click on button "Compute", the dot plot will appear.

Click in the dot plot, the corresponding sequence regions appear in the alignment panel above the dot plot. Use "Magnify" to take a close look.

Click on arrows at left to move the hit point by one residue in either of six directions.

Move the slider below the alignment panel to control the number of displayed residues.

Fit to window, Reduce, Magnify 

perform zoom in and out operations

Write Postcript 

saves the dot plot in a file called from the alignment name with extension .ps

Close

closes the dot plot window.


To perform a dot plot-guided alignment

Click on a diagonal representing two sequence parts to be aligned, then click on button "align".

Repeat this for each sequence parts you want to be aligned to each other.

To transfer into the multiple alignment the result of these operations, click on button "Record alignment".

Most often, one of the two compared sequences is already aligned to other sequences of the multiple alignment, while the other one is not. Choose which of the two sequences is already aligned by clicking on the arrow next to "Ref. sequence".

Mase file format

Mase files follow the following format:

Zero or more header lines each beginning with ;;

Next, for each sequence in the alignment:

One or more comment lines each beginning with ;

Sequence name alone on a line (may be long and may contain spaces)

Sequence data in free form, possibly with numbers and spaces ignored while reading the file. Dashes denote gaps.

Header lines may contain any text and also contain descriptions of site sets and of species groups when such data have been defined.

Site sets are written as in this example:

;;# of segments=10 mychoice

;; 14,74 221,256 416,449 990,1148 1363,1384 1474,1483 1556,1668 2034,2062

;; 2114,2139 2756,2859

where "mychoice" is the name of the set of sites and where the series of pairs of numbers lists the endpoints of successive block of sites.


Species groups are written as in this example:

;;@ of species = 4 distant outgroup

;; 2, 3, 4, 5

where "distant outgroup" is the name of the species group and where the series of numbers lists the ranks of sequences members of the species group.

>>>Program arguments

#ifdef WIN32

Arguments can be used to associate files with the seaview program.

#endif

Program is run by command

seaview [options] [filename]

where options may be

-save 

[format_name, either mase, clustal, phylip, msf, nexus or fasta sets the initial file format for saving operations and for opening the file given on command line; default is mase]

-fast

[sequences will be displayed faster but less smoothly]

-inverted

[residues will appear as black letters on a colored background]

and filename 

is an optional alignment file to be loaded. The expected format for this file is that of the default format for saving operations.

See `Program customization' topic to know how to further customize the program.

Program customization

Seaview can be customized through several options to be put in a text file called "seaview.ini" located in the same directory as program seaview itself.

Next help items describe all these options.

Each option is specified by a line in this file of the form =

stdcolorgroups  =  AC,DEFGHIK,LMNPQRS,WY,TV
save = clustal
printoutcpl = 90
inverted = True
#else

Seaview can be customized by several options specified as program resources.

Next help items describe all these options.

Each resource is specified by a line of the form Seaview.:

Seaview.dnacolors:		blue,yellow,green,255 0 0
Seaview.altcolorgroups: AC,DEFGHIK,LMNPQRS,WY,TV
Seaview.save: clustal
Seaview.printoutcpl: 90
Seaview.inverted: True

Resource lines are added by any of 4 means:

1) in a file called Seaview in directory /usr/lib/X11/app-defaults

2) in a file called Seaview in any directory pointed to by environment variable XAPPLRESDIR

3) in your $HOME/.Xdefaults file

(note that this is effective only if this file is read by your login procedure)

4) interactively loaded by command

xrdb -merge fname

where "fname" is a file containing resource lines and checked by command

xrdb -query

Miscellaneous options

See 'Program customization' topic for how to set these options.

                       MISCELLANEOUS OPTIONS

Purpose (note)| Name | Default value | Example of possible value
---------------------|-----------|---------------|-----------------------------
saving format (1)| save | mase | clustal
name of help file (2)| helpfile | seaview.hlp | seaview.aide
name of help file (2)| helpfile | seaview.help | seaview.aide
fast, rough display | fast | false | true
colored background | inverted | false | True
keys to move view | movekeys | ][>< | }{.,
allow lowercase | lowercase | false | TRUE
window size | window | 900x550 | 800x400
window size | window | 1000x700 | 800x400
emul. 3-button mouse |emulate_3_b| false | True
(3)| | |

Notes:

1) Possible file formats are mase, clustal, phylip, msf, fasta, nexus.

2) The help file is sought in the directory containing program seaview.

3) The help file is sought in all directories listed in the PATH variable

and in the current directory.

4) if yes, use CTRL-click for middle button & SHIFT-click for right button

Coloring schemes

See 'Program customization' topic for how to set these options.

SETTING GROUPS OF AMINO ACIDS DISPLAYED WITH THE SAME COLOR

By default, amino acids are split in 5 families each displayed in a different
color. These families can be customized, up to a maximum of 10 families. Also, an alternative coloring scheme can be defined, and seaview will allow to switch between the first and the alternative coloring schemes.

Purpose             | Name           | Default value   | Example of value
--------------------|----------------|-----------------|-----------------
standard coloring | stdcolorgroups | EDQNHRKBZ,ILMV, | DEBZ,ACWY,FGH
scheme of aminoacids| | APSGT,FY,WC | IK,LMNPQRSTV
| | |
alternate coloring | altcolorgroups | | ILMV,ACGPST,DEN
scheme of aminoacids| | | QBZ,HKR,FWY

Note: Default colors are red, green, yellow, blue, cyan, magenta, salmon, purple, aquamarine, and dark-gray. These can be changed (see 'Defining colors').

White is used for gaps and for unlisted residues.

The above 2 options allow to set which aminoacids will be painted in what of the above colors. The format for these is, e.g., EDQNHRKBZ,ILMV,APSGT,FY,WC to specify

aminoacids | EDQNHRKBZ | ILMV  |  APSGT |  FY  |   WC   | others + gaps |
color | red | green | yellow | blue | cyan | white |

The number of listed groups of aminoacids determines the number of colors used.

Defining colors

See 'Program customization' topic for how to set these options.

                 CHANGING COLORS USED TO DISPLAY SEQUENCES

Purpose | Name | Default value | Example of value
----------------|------------|----------------------------|---------------------
set DNA colors | dnacolors | red,green,yellow,blue | 100 0 0,0 100 0,0 0
| | | 100,100 100 0
| | |
set prot. colors| protcolors | red,green,yellow,blue, | 100 0 0,0 100 0,
| | cyan,magenta,salmon,purple,| 0 0 100,100 100 0,
| | aquamarine,dark-gray | 0 100 100,100 0 100
set DNA colors | dnacolors | red,green,yellow,blue | blue,yellow,255 0 0,
| | | green
| | |
set prot. colors| protcolors | red,green,yellow,blue, | salmon,154 255 154,
| | cyan,magenta,salmon,purple,| lightyellow,lavender
| | aquamarine,dark-gray |

Notes:

Colors are defined by 3 RGB numbers in range 0-255.

Colors are defined either by a name from your rgb.txt file or by 3 RGB numbers in range 0-255.

For DNA/RNA, the 4 listed colors apply to A, C, G, and T/U, respectively.

For proteins, at most 10 colors can be listed.

Printout options

See 'Program customization' topic for how to set these options.

                     CONTROLING PRINTOUT OPERATION

Purpose | Name | Default value | Example of possible value
--------------|---------------|---------------|--------------------------
block size | printoutblock | 10 | 3
chars/line | printoutcpl | 80 | 90
lines/page | printoutlpp | 66 | 60

Note: the value of printoutcpl stands for the full page width.