Subcellular Localization
min:
: max
Winner_takes_all: nucleus
Predictor Summary:
Predictor Summary:
- nucleus 4
- mitochondrion 1
Predictors | GFP | MS/MS | Papers | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
PPI
Inferred distinct locusB in Crop
Inferred from Arabidopsis experimental PPI
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
KRH71872 | Soybean | nucleus | 69.79 | 70.16 |
Solyc03g063220.1.1 | Tomato | nucleus | 54.58 | 69.65 |
KRH38118 | Soybean | nucleus | 69.3 | 69.4 |
CDY08401 | Canola | nucleus | 63.69 | 65.19 |
CDY52874 | Canola | nucleus | 63.84 | 64.95 |
Bra034727.1-P | Field mustard | nucleus | 63.69 | 64.73 |
AT3G12810.1 | Thale cress | nucleus | 64.47 | 64.38 |
OQU85588 | Sorghum | nucleus | 58.38 | 59.1 |
GSMUA_Achr3P26510_001 | Banana | nucleus | 45.27 | 57.1 |
Zm00001d051507_P007 | Maize | nucleus | 57.31 | 57.06 |
Zm00001d017660_P021 | Maize | nucleus | 57.31 | 57.03 |
Os02t0689800-01 | Rice | nucleus | 28.85 | 55.53 |
TraesCS7A01G544700.1 | Wheat | nucleus | 57.7 | 54.92 |
TraesCS7B01G468100.1 | Wheat | nucleus | 57.99 | 54.81 |
TraesCS7D01G531100.1 | Wheat | nucleus | 57.99 | 54.81 |
VIT_04s0023g01610.t01 | Wine grape | cytosol | 11.65 | 31.49 |
VIT_01s0010g02080.t01 | Wine grape | cytosol | 10.77 | 30.36 |
VIT_05s0020g01780.t01 | Wine grape | nucleus | 13.69 | 26.02 |
VIT_05s0020g02960.t01 | Wine grape | nucleus | 13.6 | 25.29 |
VIT_15s0021g02180.t01 | Wine grape | cytosol | 9.8 | 23.51 |
VIT_08s0007g09020.t01 | Wine grape | nucleus | 16.13 | 21.26 |
VIT_04s0008g05880.t01 | Wine grape | nucleus | 14.13 | 19.7 |
VIT_06s0009g03750.t01 | Wine grape | nucleus | 14.91 | 17.36 |
VIT_01s0011g01480.t01 | Wine grape | nucleus | 15.45 | 14.7 |
VIT_15s0046g02290.t01 | Wine grape | plastid | 14.77 | 13.39 |
VIT_05s0020g02020.t01 | Wine grape | nucleus | 12.57 | 9.65 |
Protein Annotations
MapMan:12.4.4.1 | Gene3D:3.40.50.10810 | Gene3D:3.40.50.300 | ProteinID:CCB55255 | ProteinID:CCB55255.1 | ncoils:Coil |
UniProt:F6HLJ9 | EMBL:FN595991 | GO:GO:0000166 | GO:GO:0003674 | GO:GO:0005488 | GO:GO:0005524 |
GO:GO:0005575 | GO:GO:0005618 | GO:GO:0005623 | GO:GO:0006950 | GO:GO:0008150 | GO:GO:0009605 |
GO:GO:0009607 | GO:GO:0016020 | GO:GO:0016021 | GO:GO:0030312 | GO:GO:0042742 | GO:GO:0046686 |
InterPro:HSA_dom | InterPro:Helicase_ATP-bd | InterPro:Helicase_C | InterPro:IPR001650 | InterPro:IPR014001 | InterPro:IPR014012 |
InterPro:IPR017877 | InterPro:IPR038718 | InterPro:Myb-like_dom | InterPro:P-loop_NTPase | PFAM:PF00176 | PFAM:PF00271 |
PFAM:PF07529 | PFscan:PS50090 | PFscan:PS51192 | PFscan:PS51194 | PFscan:PS51204 | PANTHER:PTHR10799 |
PANTHER:PTHR10799:SF935 | SMART:SM00487 | SMART:SM00490 | SMART:SM00573 | InterPro:SNF2-like_sf | InterPro:SNF2_N |
SUPFAM:SSF52540 | UniParc:UPI00021087FB | ArrayExpress:VIT_08s0007g06370 | EnsemblPlantsGene:VIT_08s0007g06370 | EnsemblPlants:VIT_08s0007g06370.t01 | SEG:seg |
Description
No Description!
Coordinates
chr8:+:20149390..20170671
Molecular Weight (calculated)
234332.0 Da
IEP (calculated)
5.087
GRAVY (calculated)
-0.615
Length
2052 amino acids
Sequence
(BLAST)
(BLAST)
0001: MASKGPRSKL DHETRARRQK ALEAPREPRR PKTHWDHVLD EMVWLSKDFE SERKWKLAQA KKVALRASKG MLDQATRGEK RVKEEEQRLR KVALTISKDV
0101: KKFWIKIEKL VLYKHQMELD EKKKKALDKQ LEFLLGQTER YSTMLAENLA DTYQPTQQYL PKERCSIQYK EVDDPGFKEV PQSGIADVDE DYDMQSEEEL
0201: EDDEHTIEED EALITEEERQ EELEALHNEI DLPLEELLKR YAMKKVLSVS SGSSQDKDEE EAEPTSVGDD HFGGEGQDLS DTCKIDKNSS LTVIGRRCGE
0301: SNGSLSISEH HLLEVDTCQA KNVSEISRES DEESKVYDFN DEQEDGDFVL ATGEEKDDET TLLEEEELAK EESNDPIDEI ALLQKESEIP LEELLARYKK
0401: DADEDVEDDS DYASASEDFL DSPAHQDTEL NQQPGCVDDD DDEPGGRQPF VQSVTEEHAE GSEKQSDEAR ESENRIADAA AAARSAQPTG NTFSTTKVRT
0501: KFPFLLKHSL REYQHIGLDW LVTMYEKRLN GILADEMGLG KTIMTIALLA HLACEKGIWG PHLIVVPTSV MLNWETEFLK WCPAFKILTY FGSAKERKFK
0601: RQGWLKPNSF HVCITTYRLV IQDSKVFKRK KWKYLILDEA HLIKNWKSQR WQTLLNFNSK RRILLTGTPL QNDLMELWSL MHFLMPHIFQ SHQEFKDWFC
0701: NPISGMVEGQ EKVNKEVIDR LHNVLRPFLL RRLKRDVEKQ LPMKFEHVIY CRLSKRQRNL YEDFIASSET QATLASANFF GMISVIMQLR KVCNHPDLFE
0801: GRPIVSSFDM GGIDIQLSSS VCSMLSPGPF STVDLRDLGF LFTHLDFSMA SWESDEVQAI ATPTSLIKGR ADPDNLAEIG FGFKHQRKSQ GTNIFEEIRK
0901: AILEVRLTEA KERAASIAWW NSLRCRKKPM YSTTLRDLVT VKHPVHDIHR QKSDRLSYMY SSKLADIVLS PVELFKRMIG QVECFMFAIP AARAPTPVCW
1001: CSKTNHSVFL QPTYKEKCTE TLSPLLSPIR PAIVRRQVYF PDRRLIQFDC GKLQELAVLL RKLKSEGHRA LIFTQMTKML DVLEAFINLY GYTYMRLDGS
1101: TQPEERQTLM QRFNTNPKIF IFILSTRSGG VGINLVGADT VIFYDSDWNP AMDQQAQDRC HRIGQTREVH IYRLISESTI EENILKKANQ KRALDDLVIQ
1201: SGGYNTEFFK KLDPMELFSG HRALPNKNMQ KEKNHNIGIE GSVSVADVEA ALKYAEDEAD YMALKKVEQE EAVENQEFTE DAIGRVEDDE LVNEDDMKPD
1301: EAVEQVGCTT SSKDSGLMLI GSDPNEERAL TFAGKEDDVD MLADVKQMAA AAAAAGQAIS SFESQLRPID RYAIRFLELW DPIIDKAAME SQATFEEAEW
1401: ELDRIEKFKE DMEAEIDNDE EPFVYERWDS DFATEAYRQQ VEALAQHQLM EELECEAKEK DDADDENNGS TRNDMASDPK PKSKKKPKKA KFKSLKKGSL
1501: ASDSKAVKEE PLMEPMSIDD EDIFHGMVTF SDMMSSHSSM QKKRKKAEAT ADGEEDRIMK KRSKKFKKAP EIGPLSFETN LSNKQHDESK ESNPCESAVV
1601: DLELKSASRG KMGGKISITV MPVKRILMIK PEKLKKGNIW SRDCVPSPDF WFPQEDAVLC AVVHEYGPHW SLVSETLYGM TAGGFYRGRY RHPVHCCERF
1701: RELVQRYVLS APENPNNEKV SNTGSGKALL KVTEDNIRML LDVAIDLPDS ELLLQKHFTA LLTSVWRMTS RVHHRQNHLP YRNGQYSTGR FFSSTVNQIS
1801: WNSVREPTER TNWNNFGYSS SRLVAAALHD ANNKQHDDSA FLSNRREEVS TVPEQLEIRL EIERDFCDSM IPLPSVINLS ILGSEPPSAV NNPIEESQIL
1901: KSSQDMAENR FRAASRACFD GTLDWASSAF PTSDIKPRSA IKSHSLGKHK ICTSDSIRPS KSKFKKVAVE PSEMHHLILS PLPKPTVAFN DSNPRFDLGS
2001: PVSLDAGIST PSFNEELCWE PESLELFSHH YSPNLISDLD DFSLLPEYID IG
0101: KKFWIKIEKL VLYKHQMELD EKKKKALDKQ LEFLLGQTER YSTMLAENLA DTYQPTQQYL PKERCSIQYK EVDDPGFKEV PQSGIADVDE DYDMQSEEEL
0201: EDDEHTIEED EALITEEERQ EELEALHNEI DLPLEELLKR YAMKKVLSVS SGSSQDKDEE EAEPTSVGDD HFGGEGQDLS DTCKIDKNSS LTVIGRRCGE
0301: SNGSLSISEH HLLEVDTCQA KNVSEISRES DEESKVYDFN DEQEDGDFVL ATGEEKDDET TLLEEEELAK EESNDPIDEI ALLQKESEIP LEELLARYKK
0401: DADEDVEDDS DYASASEDFL DSPAHQDTEL NQQPGCVDDD DDEPGGRQPF VQSVTEEHAE GSEKQSDEAR ESENRIADAA AAARSAQPTG NTFSTTKVRT
0501: KFPFLLKHSL REYQHIGLDW LVTMYEKRLN GILADEMGLG KTIMTIALLA HLACEKGIWG PHLIVVPTSV MLNWETEFLK WCPAFKILTY FGSAKERKFK
0601: RQGWLKPNSF HVCITTYRLV IQDSKVFKRK KWKYLILDEA HLIKNWKSQR WQTLLNFNSK RRILLTGTPL QNDLMELWSL MHFLMPHIFQ SHQEFKDWFC
0701: NPISGMVEGQ EKVNKEVIDR LHNVLRPFLL RRLKRDVEKQ LPMKFEHVIY CRLSKRQRNL YEDFIASSET QATLASANFF GMISVIMQLR KVCNHPDLFE
0801: GRPIVSSFDM GGIDIQLSSS VCSMLSPGPF STVDLRDLGF LFTHLDFSMA SWESDEVQAI ATPTSLIKGR ADPDNLAEIG FGFKHQRKSQ GTNIFEEIRK
0901: AILEVRLTEA KERAASIAWW NSLRCRKKPM YSTTLRDLVT VKHPVHDIHR QKSDRLSYMY SSKLADIVLS PVELFKRMIG QVECFMFAIP AARAPTPVCW
1001: CSKTNHSVFL QPTYKEKCTE TLSPLLSPIR PAIVRRQVYF PDRRLIQFDC GKLQELAVLL RKLKSEGHRA LIFTQMTKML DVLEAFINLY GYTYMRLDGS
1101: TQPEERQTLM QRFNTNPKIF IFILSTRSGG VGINLVGADT VIFYDSDWNP AMDQQAQDRC HRIGQTREVH IYRLISESTI EENILKKANQ KRALDDLVIQ
1201: SGGYNTEFFK KLDPMELFSG HRALPNKNMQ KEKNHNIGIE GSVSVADVEA ALKYAEDEAD YMALKKVEQE EAVENQEFTE DAIGRVEDDE LVNEDDMKPD
1301: EAVEQVGCTT SSKDSGLMLI GSDPNEERAL TFAGKEDDVD MLADVKQMAA AAAAAGQAIS SFESQLRPID RYAIRFLELW DPIIDKAAME SQATFEEAEW
1401: ELDRIEKFKE DMEAEIDNDE EPFVYERWDS DFATEAYRQQ VEALAQHQLM EELECEAKEK DDADDENNGS TRNDMASDPK PKSKKKPKKA KFKSLKKGSL
1501: ASDSKAVKEE PLMEPMSIDD EDIFHGMVTF SDMMSSHSSM QKKRKKAEAT ADGEEDRIMK KRSKKFKKAP EIGPLSFETN LSNKQHDESK ESNPCESAVV
1601: DLELKSASRG KMGGKISITV MPVKRILMIK PEKLKKGNIW SRDCVPSPDF WFPQEDAVLC AVVHEYGPHW SLVSETLYGM TAGGFYRGRY RHPVHCCERF
1701: RELVQRYVLS APENPNNEKV SNTGSGKALL KVTEDNIRML LDVAIDLPDS ELLLQKHFTA LLTSVWRMTS RVHHRQNHLP YRNGQYSTGR FFSSTVNQIS
1801: WNSVREPTER TNWNNFGYSS SRLVAAALHD ANNKQHDDSA FLSNRREEVS TVPEQLEIRL EIERDFCDSM IPLPSVINLS ILGSEPPSAV NNPIEESQIL
1901: KSSQDMAENR FRAASRACFD GTLDWASSAF PTSDIKPRSA IKSHSLGKHK ICTSDSIRPS KSKFKKVAVE PSEMHHLILS PLPKPTVAFN DSNPRFDLGS
2001: PVSLDAGIST PSFNEELCWE PESLELFSHH YSPNLISDLD DFSLLPEYID IG
0001: MASKGGKSKP DIVMASKSGK SKPDNESRAK RQKTLEAPKE PRRPKTHWDH VLEEMAWLSK DFESERKWKL AQAKKVALRA SKGMLDQASR EERKLKEEEQ
0101: RLRKVALNIS KDMKKFWMKV EKLVLYKHQL VRNEKKKKAM DKQLEFLLGQ TERYSTMLAE NLVEPYKQGQ NTPSKPLLTI ESKSDEERAE QIPPEINSSA
0201: GLESGSPELD EDYDLKSEDE TEDDEDTIEE DEKHFTKRER QEELEALQNE VDLPVEELLR RYTSGRVSRE TSPVKDENED NLTSVSRVTS PVKDENQDNL
0301: ASVGQDHGED KNNLAASEET EGNPSVRRSN DSYGHLAISE THSHDLEPGM TTASVKSRKE DHTYDFNDEQ EDVDFVLANG EEKDDEATLA VEEELAKADN
0401: EDHVEEIALL QKESEMPIEV LLARYKEDFG GKDISEDESE SSFAVSEDSI VDSDENRQQA DLDDDNVDLT ECKLDPEPCS ENVEGTFHEV AEDNDKDSSD
0501: KIADAAAAAR SAQPTGFTYS TTKVRTKLPF LLKHSLREYQ HIGLDWLVTM YEKKLNGILA DEMGLGKTIM TIALLAHLAC DKGIWGPHLI VVPTSVMLNW
0601: ETEFLKWCPA FKILTYFGSA KERKLKRQGW MKLNSFHVCI TTYRLVIQDS KMFKRKKWKY LILDEAHLIK NWKSQRWQTL LNFNSKRRIL LTGTPLQNDL
0701: MELWSLMHFL MPHVFQSHQE FKDWFCNPIA GMVEGQEKIN KEVIDRLHNV LRPFLLRRLK RDVEKQLPSK HEHVIFCRLS KRQRNLYEDF IASTETQATL
0801: TSGSFFGMIS IIMQLRKVCN HPDLFEGRPI VSSFDMAGID VQLSSTICSL LLESPFSKVD LEALGFLFTH LDFSMTSWEG DEIKAISTPS ELIKQRVNLK
0901: DDLEAIPLSP KNRKNLQGTN IFEEIRKAVF EERIQESKDR AAAIAWWNSL RCQRKPTYST SLRTLLTIKG PLDDLKANCS SYMYSSILAD IVLSPIERFQ
1001: KMIELVEAFT FAIPAARVPS PTCWCSKSDS PVFLSPSYKE KVTDLLSPLL SPIRPAIVRR QVYFPDRRLI QFDCGKLQEL AMLLRKLKFG GHRALIFTQM
1101: TKMLDVLEAF INLYGYTYMR LDGSTPPEER QTLMQRFNTN PKIFLFILST RSGGVGINLV GADTVIFYDS DWNPAMDQQA QDRCHRIGQT REVHIYRLIS
1201: ESTIEENILK KANQKRVLDN LVIQNGEYNT EFFKKLDPME LFSGHKALTT KDEKETSKHC GADIPLSNAD VEAALKQAED EADYMALKRV EQEEAVDNQE
1301: FTEEPVERPE DDELVNEDDI KADEPADQGL VAAGPAKEEM SLLHSDIRDE RAVITTSSQE DDTDVLDDVK QMAAAAADAG QAISSFENQL RPIDRYAIRF
1401: LELWDPIIVE AAMENEAGFE EKEWELDHIE KYKEEMEAEI DDGEEPLVYE KWDADFATEA YRQQVEVLAQ HQLMEDLENE AREREAAEVA EMVLTQNESA
1501: HVLKPKKKKK AKKAKYKSLK KGSLAAESKH VKSVVKIEDS TDDDNEEFGY VSSSDSDMVT PLSRMHMKGK KRDLIVDTDE EKTSKKKAKK HKKSLPNSDI
1601: KYKQTSALLD ELEPSKPSDS MVVDNELKLT NRGKTVGKKF ITSMPIKRVL MIKPEKLKKG NLWSRDCVPS PDSWLPQEDA ILCAMVHEYG PNWNFVSGTL
1701: YGMTAGGAYR GRYRHPAYCC ERYRELIQRH ILSASDSAVN EKNLNTGSGK ALLKVTEENI RTLLNVAAEQ PDTEMLLQKH FSCLLSSIWR TSTRTGNDQM
1801: LSLNSPIFNR QFMGSVNHTQ DLARKPWQGM KVTSLSRKLL ESALQDSGPS QPDNTISRSR LQETQPINKL GLELTLEFPR GNDDSLNQFP PMISLSIDGS
1901: DSLNYVNEPP GEDVLKGSRV AAENRYRNAA NACIEDSFGW ASNTFPANDL KSRTGTKAQS LGKHKLSASD SAKSTKSKHR KLLAEQLEGA WVRPNDPNLK
2001: FDFTPGDREE EEEQEVDEKA NSAEIEMISC SQWYDPFFTS GLDDCSLASD ISEIE
0101: RLRKVALNIS KDMKKFWMKV EKLVLYKHQL VRNEKKKKAM DKQLEFLLGQ TERYSTMLAE NLVEPYKQGQ NTPSKPLLTI ESKSDEERAE QIPPEINSSA
0201: GLESGSPELD EDYDLKSEDE TEDDEDTIEE DEKHFTKRER QEELEALQNE VDLPVEELLR RYTSGRVSRE TSPVKDENED NLTSVSRVTS PVKDENQDNL
0301: ASVGQDHGED KNNLAASEET EGNPSVRRSN DSYGHLAISE THSHDLEPGM TTASVKSRKE DHTYDFNDEQ EDVDFVLANG EEKDDEATLA VEEELAKADN
0401: EDHVEEIALL QKESEMPIEV LLARYKEDFG GKDISEDESE SSFAVSEDSI VDSDENRQQA DLDDDNVDLT ECKLDPEPCS ENVEGTFHEV AEDNDKDSSD
0501: KIADAAAAAR SAQPTGFTYS TTKVRTKLPF LLKHSLREYQ HIGLDWLVTM YEKKLNGILA DEMGLGKTIM TIALLAHLAC DKGIWGPHLI VVPTSVMLNW
0601: ETEFLKWCPA FKILTYFGSA KERKLKRQGW MKLNSFHVCI TTYRLVIQDS KMFKRKKWKY LILDEAHLIK NWKSQRWQTL LNFNSKRRIL LTGTPLQNDL
0701: MELWSLMHFL MPHVFQSHQE FKDWFCNPIA GMVEGQEKIN KEVIDRLHNV LRPFLLRRLK RDVEKQLPSK HEHVIFCRLS KRQRNLYEDF IASTETQATL
0801: TSGSFFGMIS IIMQLRKVCN HPDLFEGRPI VSSFDMAGID VQLSSTICSL LLESPFSKVD LEALGFLFTH LDFSMTSWEG DEIKAISTPS ELIKQRVNLK
0901: DDLEAIPLSP KNRKNLQGTN IFEEIRKAVF EERIQESKDR AAAIAWWNSL RCQRKPTYST SLRTLLTIKG PLDDLKANCS SYMYSSILAD IVLSPIERFQ
1001: KMIELVEAFT FAIPAARVPS PTCWCSKSDS PVFLSPSYKE KVTDLLSPLL SPIRPAIVRR QVYFPDRRLI QFDCGKLQEL AMLLRKLKFG GHRALIFTQM
1101: TKMLDVLEAF INLYGYTYMR LDGSTPPEER QTLMQRFNTN PKIFLFILST RSGGVGINLV GADTVIFYDS DWNPAMDQQA QDRCHRIGQT REVHIYRLIS
1201: ESTIEENILK KANQKRVLDN LVIQNGEYNT EFFKKLDPME LFSGHKALTT KDEKETSKHC GADIPLSNAD VEAALKQAED EADYMALKRV EQEEAVDNQE
1301: FTEEPVERPE DDELVNEDDI KADEPADQGL VAAGPAKEEM SLLHSDIRDE RAVITTSSQE DDTDVLDDVK QMAAAAADAG QAISSFENQL RPIDRYAIRF
1401: LELWDPIIVE AAMENEAGFE EKEWELDHIE KYKEEMEAEI DDGEEPLVYE KWDADFATEA YRQQVEVLAQ HQLMEDLENE AREREAAEVA EMVLTQNESA
1501: HVLKPKKKKK AKKAKYKSLK KGSLAAESKH VKSVVKIEDS TDDDNEEFGY VSSSDSDMVT PLSRMHMKGK KRDLIVDTDE EKTSKKKAKK HKKSLPNSDI
1601: KYKQTSALLD ELEPSKPSDS MVVDNELKLT NRGKTVGKKF ITSMPIKRVL MIKPEKLKKG NLWSRDCVPS PDSWLPQEDA ILCAMVHEYG PNWNFVSGTL
1701: YGMTAGGAYR GRYRHPAYCC ERYRELIQRH ILSASDSAVN EKNLNTGSGK ALLKVTEENI RTLLNVAAEQ PDTEMLLQKH FSCLLSSIWR TSTRTGNDQM
1801: LSLNSPIFNR QFMGSVNHTQ DLARKPWQGM KVTSLSRKLL ESALQDSGPS QPDNTISRSR LQETQPINKL GLELTLEFPR GNDDSLNQFP PMISLSIDGS
1901: DSLNYVNEPP GEDVLKGSRV AAENRYRNAA NACIEDSFGW ASNTFPANDL KSRTGTKAQS LGKHKLSASD SAKSTKSKHR KLLAEQLEGA WVRPNDPNLK
2001: FDFTPGDREE EEEQEVDEKA NSAEIEMISC SQWYDPFFTS GLDDCSLASD ISEIE
Arabidopsis Description
PIE1Protein PHOTOPERIOD-INDEPENDENT EARLY FLOWERING 1 [Source:UniProtKB/Swiss-Prot;Acc:Q7X9V2]
SUBAcon: [nucleus]
SUBAcon: [nucleus]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.