Home
Compare
Find functional equivalents
Help
Download

Help



1. What is FACT?

2. Which features will be annotated?

3. How does the scoring work?

4. Example output

5. How can I cite FACT

6. Data sources


What is FACT?
FACT compares the architecture of features such as functional domains, secondary structure motifs and compositional properties between pairs of proteins. A feature dotplot (FDP) allows for a rapid and intuitive assessment to what extent two proteins agree in their feature architecture, and thus may share a similar function. An automated scoring routine complements the FDP and is used to search entire proteomes for proteins with potentially similar function to a given query sequence.

Which features will be annotated?
We annotate the features listed below. Unless otherwise noted, the programs used for feature prediction are embedded into the SFINX package (Sonnhammer and Wootton (2001) Protein: Structure, Function, and Genetics 45: 262-273).



How does the similarity scoring work?
FACT calculates the similarity between a query protein and every protein from the chosen search proteome. The similarity is based on the feature architectures of the proteins. Four such scoring schemata are implemented and the scores will be calculated simultanously. Details about the FACT, MS_uni and MS_st scoring will follow soon. The Lib score is a modified version from Lin et. al.(2006 Bioinformatics 22(17):2081-2086).

Example output
To illustrate FACT we have used the human glutathione S-transferase to search for a functionally equivalent protein in the proteome of yeast Saccharomyces cerevisiae. For the query protein and all proteins in the search-proteome the FACT, MS_uni, MS_st and Lin scores will be computed. The sequences from the search-proteome are ranked according to their score. From the resulting list, any pair-wise comparison can be extracted and displayed in the feature dotplot. Finally, a histograms of all FACT scores are displayed.

Click the links below to view the example output.

If you want to performe the search yourself go to FACT search page and enter the following sequence in FASTA format into the textbox:

>HUMAN_glutathione_S_transferase
RWSFAAAVFATMPPYTVVYFPVRGRCAALRMLLADQGQSWKEEVVTVETWQEGSLKAS
CLYGQLPKFQDGDLTLYQSNTILRHLGRTLGLYGKDQQEAALVDMVNDGVEDLRCKYI
SLIYTNYEAGKDDYVKALPGQLKPFETLLSQNQGGKTFIVGDQISFADYNLLDLLLIH
EVLAPGCLDAFPLLSAYVGRLSARPKLKAFLASPEYVNLPINGNGKQ


Choose Saccharomyces cerevisiae from the menu and click the "run scoring" button.

Another example is to search for a functional equivalent to the human GolgA5 protein in Trypanosoma brucei. The individual feature architecture similarity scores and BLAST show different T. brucei proteins as top hit. Comparing the different query/top-hit pairs with the feature dotplot gives further insights which T. brucei protein is most likely a functional equivalent to the human GolgA5.

>Human_GolgA5_ENSP00000163416
MSWFVDLAGKAEDLLNRVDQGAATALSRKDNASNIYSKNTDYTELHQQNTDLIYQTGPKSTYISSAADNIRNQKATILAG
TANVKVGSRTPVEASHPVENASVPRPSSHFVRRKKSEPDDELLFDFLNSSQKEPTGRVEIRKEKGKTPVFQSSQTSSVSS
VNPSVTTIKTIEENSFGSQTHEAASNSDSSHEGQEESSKENVSSNAACPDHTPTPNDDGKSHELSNLRLENQLLRNEVQS
LNQEMASLLQRSKETQEELNKARARVEKWNADHSKSDRMTRGLRAQVDDLTEAVAAKDSQLAVLKVRLQEADQLLSTRTE
ALEALQSEKSRIMQDQSEGNSLQNQALQTFQERLHEADATLKREQESYKQMQSEFAARLNKVEMERQNLAEAITLAERKY
SDEKKRVDELQQQVKLYKLNLESSKQELIDYKQKATRILQSKEKLINSLKEGSGFEGLDSSTASSMELEELRHEKEMQRE
EIQKLMGQIHQLRSELQDMEAQQVNEAESAREQLQDLHDQIAGQKASKQELETELERLKQEFHYIEEDLYRTKNTLQSRI
KDRDEEIQKLRNQLTNKTLSNSSQSELENRLHQLTETLIQKQTMLESLSTEKNSLVFQLERLEQQMNSASGSSSNGSSIN
MSGIDNGEGTRLRNVPVLFNDTETNLAGMYGKVRKAASSIDQFSIRLGIFLRRYPIARVFVIIYMALLHLWVMIVLLTYT
PEMHHDQPYGK

FACT output page for the GolgA5 example GolgA5 result.


How can I cite FACT?


T. Koestler, A. von Haeseler, and I. Ebersberger (2010) FACT: Functional annotation transfer between proteins with similar feature architecture. BMC Bioinformatics, 11(1):417 (PMID: 20696036)

Data sources


Protein sequences and the feature annotations can be downloaded here.
Speciessource
Arabidopsis thalianaUNIPROT 1.0http://www.uniprot.org/taxonomy/3702
Batrachochytrium dendrobatidisJGI May08 5.0www.jgi.doe.gov/Batrachochytrium
Branchiostoma floridaeJGI 1.0http://genome.jgi-psf.org/Brafl1/Brafl1.home.html
Caenorhabditis elegansEnsembl 52http://www.ensembl.org/Caenorhabditis_elegans/Info/Index
Ciona intestinalisEnsembl 5http://www.ensembl.org/Ciona_intestinalis/Info/Index
Cryptococcus neoformansuniprot integr8http://www.uniprot.org/taxonomy/5207
Dictyostelium discoideumDictybase Jan2010http://dictybase.org
Drosophila melanogasterEnsembl 52http://www.ensembl.org/Drosophila_melanogaster/Info/Index
Encephalitozoon cuniculiIntegr8 79http://www.ebi.ac.uk/integr8/OrganismSelection.do?action=makeCurrent&proteomeId=79
Gallus gallusEnsembl 52http://www.ensembl.org/Gallus_gallus/Info/Index
Homo sapiensEnsembl 51http://www.ensembl.org/Homo_Sapiens/Info/Index
Homo sapiensEnsembl 51
Lottia giganteaJGI 1.0http://genome.jgi-psf.org/Lotgi1/Lotgi1.home.html
Nematostella vectensisJGI 1.0http://genome.jgi-psf.org/Nemve1/Nemve1.home.html
Ornithorhynchus anatinusEnsembl 52http://www.ensembl.org/Ornithorhynchus_anatinus/Info/Index
Oryzias latipesEnsembl 52http://www.ensembl.org/Oryzias_latipes/Info/Index/
Ostreococcus lucimarinusJGI 2.0http://genome.jgi-psf.org/Ost9901_3/Ost9901_3.home.html
Phycomyces blakesleeanusJGI May08 1.0www.jgi.doe.gov/Phycomyces
Physcomitrella patensJGI 1.1http://genome.jgi-psf.org/physcomitrella/physcomitrella.home.html
Saccharomyces cerevisiaeEnsembl 52http://www.ensembl.org/Saccharomyces_cerevisiae/Info/Index
Schistosoma mansoniSanger Center 4.0ghttp://www.sanger.ac.uk/resources/downloads/helminths/schistosoma-mansoni.html
Strongylocentrotus purpuratusNCBI 2.1http://www.ncbi.nlm.nih.gov/projects/genome/guide/sea_urchin/
Takifugu rubripesEnsembl 52http://www.ensembl.org/Takifugu_rubripes/Info/Index/
Tetrahymena thermophilaciliate.org 2007http://ciliate.org/index.php/home/welcome
Trypanosoma bruceiSanger Tb927http://www.sanger.ac.uk/resources/downloads/protozoa/trypanosoma-brucei.html
Ustilago maydisBroad 1http://www.broadinstitute.org/annotation/genome/ustilago_maydis
Yarrowia lipolyticauniprot integr8http://www.uniprot.org/taxonomy/4952