Sciences and methods > Biosciences > Programs > EMBOSS
 
Tehdyt toimenpiteet

EMBOSS at CSC

General sequence analysis package EMBOSS.

Version


Version 6.3.1 is installed in hippu.csc.fi. ( ChangeLog for EMBOSS 6.3.1)

Version 6.3.1 is installed in vuori.csc.fi

Version 6.3.1 is installed in murska.csc.fi.

Description

EMBOSS (European Molecular Biology Open Software Suite) package contains over 200 programs for sequence analysis. Examples of application areas of EMBOSS tools are given below.

  • Sequence alignment
  • Phylogeny
  • Hidden Markow models
  • Rapid database searching with sequence patterns
  • Protein motif identification, including domain analysis
  • EST analysis
  • Nucleotide sequence pattern analysis, for example to identify CpG islands.
  • Simple and species-specific repeat identification
  • Codon usage analysis for small genomes
  • Rapid identification of sequence patterns in large scale sequence sets.
  • Presentation tools for publication
  • RNA secondary structure prediction


Usage


Graphical EMBOSS interfaces

Most EMBOSS progams can be used via the Embster server. To use Embster you need a user account at CSC.


    EMBOSS in Hippu

    To make EMBOSS programs available in Hippu, give command:

    module load emboss

    After this you can start any of the EMBOSS programs by typing its name. For example:

    wossname

    If you can open an X-term connection from  hippu.csc.fi  to your local machine, you can use the graphical Jemboss user interface. In addition to the EMBOSS programs, Jemboss includes Jalview and Jemboss sequence alignment editorts and Jemboss DNA Editor. You can start Jemboss with commands:

    module load emboss
    jemboss


    EMBOSS in Vuori and Murska

    To make EMBOSS programs available in Vuori or Murska, give command:

    module load emboss

    After this you can start any of the EMBOSS programs by typing its name. For example:

    wossname


    EMBOSS databases in Hippu

    The databases listed bellow are indexed for EMBOSS in Hippu. This means that in Hippu and Scientist's Interface you can access sequences from these databases using the USA sequence address: database:access_code. For example:

      infoseq swiss:CASA1_HUMAN

    Name database source files
    Nucleotides
    embl EMBL database excluding WGS divisions EMBL dat files
    emblwgs WGS divisions of EMBL database EMBL wgs files
    emblnew EMBL updates EMBL cum files



    Proteins
    pdb_seq PDB protein structure database pdb_seqres.txt
    swiss Uniprot/Swiss database uniprot_sprot.fasta.gz
    trembl Uniprot/TrEMBL database uniprot_trembl.fasta.gz
    uniref100
    Uniref100 database uniref100.fasta.gz
    uniref90
    Uniref90 database uniref90.fasta.gz
    uniref50 Uniref50 database uniref50.fasta.gz



     Genome databases


     ensembl  gene and protein sequences in Ensembl database
     Ensembl MySQL server




    Help

    Tutorial

    Korpelainen Eija +358 9 457 2030 Eija.Korpelainen at csc.fi
    Mattila Kimmo +358 9 457 2708 Kimmo.Mattila at csc.fi
    Saren Ari-Matti +358 9 457 2282 Ari-Matti.Saren at csc.fi