CSC keeps up-to-date copies of the following databases on its servers. The databases contain sequences, protein motifs, 3D structures etc, and they are directly linked to different programs at CSC. For example, sequence similarity searches with BLAST, sequence analysis with EMBOSS, or molecular modeling with Discovery Studio. For more detailed description of database content and usage, please click on the database name below.
Sequence databases
- EMBL: nucleotide sequences.
- Ensembl: Genome databases.
- RefSeq: nucleotide sequences (mRNA and human genomic contigs).
- nt: nucleotide sequences, composite of RefSeq, PDB and EMBL/GenBank/DDBJ (no EST,HTG,GSS,STS).
- UniProt: protein sequences.
- nr: protein sequences, composite of SwissProt, GenPept, PIR, PRF and PDB.
- PairsDB: a database of BLAST and PSI-BLAST results for protein sequences W
Protein motif databases
- PROSITE: patterns and profiles for protein families, domains and sites.
- PRINTS: fingerprints for protein families and domains.
- Pfam: HMMs for protein families and domains.
Structure databases
- PDB: 3D structures of proteins, nucleic acids and carbohydrates.