SBC logo

Supplementary material

Multi-domain proteins in the three kingdoms of life - Orphan domains and other unassigned regions

Ekman D, Björklund Å, Frey-Skött J and Elofsson A, JMB 2005....

Domain assignments to 21 complete genomes used in "Multi-domain proteins in the three kingdoms of life - Orphan domains and other unassigned regions" by Ekman et. al.

Two files for each genome is provided, one with Pfam-A and Pfam-B domain assignments named "genome.pfam.dat" and one with SCOP and MAS assignments named "genome.scop_mas.dat". Both files also contain the sequences with predicted secondary structure. All assigned are described in methods section of the article, Pfam version 12 and SCOP version 1.63 were used. For more details about the files see the README file. All files can be downloaded as a zip archive in "assignments.tar.gz"

README

assignments.tar.gz

EUKARYOTA

Homo_sapiens.pfam.dat.gz
Mus_musculus.pfam.dat.gz
Arabidopsis_thaliana.pfam.dat.gz
Caenorhabditis_elegans.pfam.dat.gz
Drosophila_melanogaster.pfam.dat.gz
Saccharomyces_cerevisiae.pfam.dat.gz
Schizosaccharomyces_pombe.pfam.dat.gz
  Homo_sapiens.scop_mas.dat.gz
Mus_musculus.scop_mas.dat.gz
Arabidopsis_thaliana.scop_mas.dat.gz
Caenorhabditis_elegans.scop_mas.dat.gz
Drosophila_melanogaster.scop_mas.dat.gz
Saccharomyces_cerevisiae.scop_mas.dat.gz
Schizosaccharomyces_pombe.scop_mas.dat.gz

BACTERIA

Escherichia_coli.pfam.dat.gz
Bacillus_subtilis.pfam.dat.gz
Mycoplasma_pulmonis.pfam.dat.gz
Treponema_pallidum.pfam.dat.gz
Prochlorococcus_marinus.pfam.dat.gz
Pseudomonas_aeruginosa.pfam.dat.gz
Rickettsia_conorii.pfam.dat.gz
  Escherichia_coli.scop_mas.dat.gz
Bacillus_subtilis.scop_mas.dat.gz
Mycoplasma_pulmonis.scop_mas.dat.gz
Treponema_pallidum.scop_mas.dat.gz
Prochlorococcus_marinus.scop_mas.dat.gz
Pseudomonas_aeruginosa.scop_mas.dat.gz
Rickettsia_conorii.scop_mas.dat.gz

ARCHAEA

Aeropyrum_pernix.pfam.dat.gz
Nanoarchaeum_equitans.pfam.dat.gz
Methanococcus_jannaschii.pfam.dat.gz
Pyrococcus_abyssi.pfam.dat.gz
Thermoplasma_volcanium.pfam.dat.gz
Archaeoglobus_fulgidus.pfam.dat.gz
Methanosarcina_mazei.pfam.dat.gz
  Aeropyrum_pernix.scop_mas.dat.gz
Nanoarchaeum_equitans.scop_mas.dat.gz
Methanococcus_jannaschii.scop_mas.dat.gz
Pyrococcus_abyssi.scop_mas.dat.gz
Thermoplasma_volcanium.scop_mas.dat.gz
Archaeoglobus_fulgidus.scop_mas.dat.gz
Methanosarcina_mazei.scop_mas.dat.gz