Sciences and methods > Biosciences > Programs > SHRiMP
 
Tehdyt toimenpiteet

SHRiMP at CSC

Version

SHRiMP 2.2.2 is available in Hippu ans Vuori and as a grid implementation.

Description

SHRiMP is a software package for mapping reads from a donor genome against a target (reference) genome. SHRiMP was primarily developed to work with short reads produced by Next Generation Sequencing (NGS) machines. SHRiMP sopports both letter space (Illumina/Solexa) and colour space (AB SOLiD) reads. Both single-end and paired end reads can be analysed.


SHRiMP is considered to be more sensitive but significantly slower than Bowtie or BWA read mapping tools.


Usage


To initialize the program you need to give command:

module load shripm

After this you could send a shrimp alignment job with command gmapper-ls (for letter space reads) or gmapper-cs ( for colour space reads).  For example

gmapper-ls reads.fasta genome.fasta > result.sam

 You can use optio --help to see all the command line options of the gmapper commands.

gmapper-ls --help

By default SHRiMP assumes that both the query reads to be mapped and the genome sequence files are in fasta format. Fastq formatted read files can be used with option -Q.  Please note that your $HOME directory is often too small for working with complete genomes. In stead you should do the analysis in temporary directories like $WRKDIR, $METAWRK or $FCWRKDIR.


Running SHRiMP alignments utilizing grid computing

Aligning millions of reads to a large reference genome can take several hours or even days. Using grid computing through grid_gmapper_cs and  grid_gmapper_ls commands you can speed up the alignment process ten fold or more. grid_gmapper commands split the alignment task into several subtasks that it submits to be simultaneously executed in the FGI grid environment. When all the subtasks are ready they are collected and combined into a single result alignment.

To be able to use grid_ gmapper commands you should have:

  1.  A valid grid certificate installed in the hippu.csc.fi server.
  2.  Membership of FGI Virtual Organization


For detailed instructions, see the document below:


Once you have the certificate installed and the Virtual organization membership is approved,  you can submit a SHRiMP jobs to grid with for example following commands:

module load shrimp 
module load nordugrid-arc
grid-proxy-init -valid 72:00
grid_gmapper-ls -query seq_set-fasta -ref ref_genome.fasta -out results.sam
For more information about grid_gmapper-ls and grid_gmapper-cs, run command
grid_gmapper-ls --help
or
grid_gmapper-cs --help


More information

More information about SHRiMP can be found from: