Subcellular Localization
min:
: max
Winner_takes_all: nucleus
Predictor Summary:
Predictor Summary:
- nucleus 5
- cytosol 1
- mitochondrion 1
Predictors | GFP | MS/MS | Papers | ||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
PPI
No PPI Data
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
CDY66146 | Canola | cytosol | 23.36 | 30.48 |
VIT_14s0068g01650.t01 | Wine grape | nucleus | 42.07 | 23.9 |
PGSC0003DMT400006082 | Potato | cytosol | 18.29 | 23.7 |
PGSC0003DMT400028024 | Potato | nucleus | 38.58 | 23.64 |
Solyc05g015920.2.1 | Tomato | nucleus | 38.69 | 23.45 |
Solyc05g015930.2.1 | Tomato | cytosol | 37.95 | 23.25 |
AT1G27430.1 | Thale cress | cytosol | 36.58 | 23.19 |
AT1G24300.1 | Thale cress | cytosol | 36.47 | 23.08 |
Bra010950.1-P | Field mustard | cytosol | 25.16 | 18.8 |
Bra010951.1-P | Field mustard | cytosol | 19.77 | 17.66 |
CDY11817 | Canola | cytosol | 21.04 | 17.43 |
CDY11819 | Canola | cytosol | 19.56 | 16.11 |
VIT_18s0001g12350.t01 | Wine grape | cytosol | 22.09 | 11.38 |
CDY66147 | Canola | nucleus | 1.27 | 7.1 |
Protein Annotations
EntrezGene:100254102 | wikigene:100254102 | Gene3D:3.30.1490.40 | MapMan:35.2 | ProteinID:CBI38156 | ProteinID:CBI38156.3 |
ncoils:Coil | UniProt:D7U5Y7 | EMBL:FN596513 | GO:GO:0003674 | GO:GO:0005488 | GO:GO:0005515 |
InterPro:GYF | InterPro:GYF-like_dom_sf | InterPro:IPR003169 | InterPro:IPR035445 | EntrezGene:LOC100254102 | wikigene:LOC100254102 |
PFAM:PF02213 | PFscan:PS50829 | PANTHER:PTHR14445 | PANTHER:PTHR14445:SF41 | SMART:SM00444 | SUPFAM:SSF55277 |
UniParc:UPI0001BE4EDC | ArrayExpress:VIT_01s0146g00140 | EnsemblPlantsGene:VIT_01s0146g00140 | EnsemblPlants:VIT_01s0146g00140.t01 | RefSeq:XP_002264994 | RefSeq:XP_002264994.2 |
SEG:seg | : | : | : | : | : |
Description
No Description!
Coordinates
chr1:-:21924403..21931349
Molecular Weight (calculated)
104601.0 Da
IEP (calculated)
5.151
GRAVY (calculated)
-0.780
Length
946 amino acids
Sequence
(BLAST)
(BLAST)
001: MAESKLDLPD DLISTKPSDQ FWTATVVASG GNDDEKALMG LADESKDQLA SESSIPLSPQ WLYSKPNETK METRAPNSAA LGNSTDPNQK EGWRLDASED
101: KKDWRKIATD TESNRRWREE ERETGLLGGR RNLRKVDRRV DTVSIRESID SRALPTSERW HDGSNRNSVH ETRRDSKWSS RWGPEEREKE SRTEKRPDVD
201: KEDAHSDNQS FVGSNRPAPE RDSDSRDKWR PRHRMELHSG GPTSYRAAPG FGIERARLEG SHVGFAIGRG RSTALGSTPV LRSSSAGPIG GAQFERNGNV
301: TGKLNLLDDT LCYPRGKLLD IYRRKKLDPS FATMPENMEE TPHITLGDFI EPLAFVAPDA EEEVILRDIW KGKITSSGVV YNSFRKGRTT ENVTGIEDLE
401: SPKEKQGILP SITTKEIADT FPEGVNDGAY QDDDSGISFN YNMTKNMIDE MDANQGEGKY SVAGMDDMIS TVSKGSSLCG VSEMSGANRT ASQLKAVENE
501: HLANSDFTKH DKLDNITSAA SFDIGCGLPD ISNSIFALPS PKHSLSSNMQ HLNSTGGTNL LGRGIPPEDF SLHYLDPQGE IQGPFLGVDI ISWFKQGFFG
601: IDLPVRLSDA PEGIPFQDLG EIMPHLKTKD GANSTDASSE LEHAGILGAN LEASSPAPGP VPVPDIADTT ALNDHHWSLS EFDGLSSQNF QQRKSEREGP
701: LQLSYSDGQS FHDFSPQDEE IVFPGRPGSG GGGYPIGKPS RSTQDPLANP ITYSSLPNEL TEPVMANQND NKLHQFGLLW SELEGAHPTH AQPSNLSSSI
801: GRLGPLGAMA GSTPDAEAFS DVYRRNILSN PNSYQDATAT RHLSHIEQDS NRFDLAEQLM RQQFQQQLQQ RQLQQQNLLS SHAHLNESLL EQVASRNHMH
901: HQRLANQPFH QKQMLLQEQK QAQARQALLE QYNSVPTILR GMLIHL
101: KKDWRKIATD TESNRRWREE ERETGLLGGR RNLRKVDRRV DTVSIRESID SRALPTSERW HDGSNRNSVH ETRRDSKWSS RWGPEEREKE SRTEKRPDVD
201: KEDAHSDNQS FVGSNRPAPE RDSDSRDKWR PRHRMELHSG GPTSYRAAPG FGIERARLEG SHVGFAIGRG RSTALGSTPV LRSSSAGPIG GAQFERNGNV
301: TGKLNLLDDT LCYPRGKLLD IYRRKKLDPS FATMPENMEE TPHITLGDFI EPLAFVAPDA EEEVILRDIW KGKITSSGVV YNSFRKGRTT ENVTGIEDLE
401: SPKEKQGILP SITTKEIADT FPEGVNDGAY QDDDSGISFN YNMTKNMIDE MDANQGEGKY SVAGMDDMIS TVSKGSSLCG VSEMSGANRT ASQLKAVENE
501: HLANSDFTKH DKLDNITSAA SFDIGCGLPD ISNSIFALPS PKHSLSSNMQ HLNSTGGTNL LGRGIPPEDF SLHYLDPQGE IQGPFLGVDI ISWFKQGFFG
601: IDLPVRLSDA PEGIPFQDLG EIMPHLKTKD GANSTDASSE LEHAGILGAN LEASSPAPGP VPVPDIADTT ALNDHHWSLS EFDGLSSQNF QQRKSEREGP
701: LQLSYSDGQS FHDFSPQDEE IVFPGRPGSG GGGYPIGKPS RSTQDPLANP ITYSSLPNEL TEPVMANQND NKLHQFGLLW SELEGAHPTH AQPSNLSSSI
801: GRLGPLGAMA GSTPDAEAFS DVYRRNILSN PNSYQDATAT RHLSHIEQDS NRFDLAEQLM RQQFQQQLQQ RQLQQQNLLS SHAHLNESLL EQVASRNHMH
901: HQRLANQPFH QKQMLLQEQK QAQARQALLE QYNSVPTILR GMLIHL
0001: MAEGKFDLPD DLIFSKSSDQ LKELASDNSI PLSPQWLYTK SSEYKMDVRS PTPVPMGNPS DPNPKDAWRL DAPEDKKDWK KIVHENETSR RWREEERETG
0101: LLGARKVDRR KTERRIDSVS SRETGDIKNA AASDRWNDVN SRAAVHEPRR DNKWSSRWGP DDKEKEARCE KVDINKDKEE PQSESQSVVS NVRATSERDS
0201: DTRDKWRPRH RMESQSGGPS SYRAAPGFGL DRGRAEGPNL GFTVGRGRAS TIGRGSSTSL IGAGSALSPV FRYPRGKLLD MYRKQKPDSS LGRILTEMDE
0301: VASITQVALI EPLAFIAPDA EEEANLNGIW KGRIISSEVY TSSGEESLGG NSLLKCRIPE SGETKVDGAL LGFMNGDNGS MKNNDSGLLG SHNGGLGAAS
0401: SVPRLNSVAS ESYGSGGAGY QLSHGSPEAV RSVFTKSSVL DGSESVVGSF EQAYTGKLQQ PDTEVDHSEG AMPPEEFLFL YIDPQGVIQG PFIGSDIISW
0501: FEQGFFGTDL QVRLASAPEG TPFQDLGRVM SYIKAESVHA HISDQKSELE ETSLKANSEA GGSVAHVAES NDSSSLTGIS RSFSVYNNPS GQDNFQRKSE
0601: SEVYGRPPHA EDQSFLDFSA QDEEIVFPGR ARVSGYASSV KSSTSMHDAL MEFSGHSDIP VEVTTAATRN QNENKLHPFG VLWSELEGGS TPVNPLPNRS
0701: SGAMGEPSCS IENRPINSRR NSQIDPNISL DALSGNRMSQ FEHESNFFNH GDQLPSNQHH QQHFQNRDML SHLHIGDQDL EHLITLQLQQ QQKIQMQQQQ
0801: KIQLQQQQKI QLQQHQLEQE HQLHQKLLQE QQQSHARQLH FQQILQGQTP DTRFGQSHDF PRSNSVDQML LEQQMLNELQ KSSGHPSQNF APYIEQHAAG
0901: NFGRFTHEGH QRELLEQLFS TQMQSQYGQK QSQYGQMQSQ HGQLQSEPIR SLEYQLLQQE QLMQLANGVR HNTLLEEQRH IDPLWPSDHS DQLLRTHPGI
1001: HRSHSSAGFR PLDFHQQQQR PHFEDQFSQL ERNRSYQQQL RLELLEHGLP FERSASGLNL DAVNGLGLSQ GLELRDATAH MQSSGRLGNS TPGFSHQNPR
1101: IPLGESHFSH LEPTEGRWSG ADTQLAGDWA ESQFRRSNMD TEHDKMRSEI RRLGEDPNSW MVGGSTDDKS KQLFMELLHQ RPGHQSAESP NMNRGYPYDR
1201: MVPSGLTPGI QTLGGLSDHG GNQNVSSAFG DRSFSDEQVN RVPGYGNNMG SLHHNSSLLS GIIDAGRSTQ NETQAFSNMF GMNKDANDIN TWNNVPPKNE
1301: GMGRMMSYDA QDRMGKQAVL DSLIQEELPV GTPGQQSSFN ISDRYSDNLV GEDRRKDRLV VPSHGQNSVL LKRPPSSHSS SSHEGLLERM SDTASRAAAS
1401: SYSGIEGGVR RESGAAGNKG STSEAASFSE MLKKSNSMKK VAAESTDATE GSKGGGGKKK GKKGRQIDPA LLGFKVTSNR ILMGEIHRAD DF
0101: LLGARKVDRR KTERRIDSVS SRETGDIKNA AASDRWNDVN SRAAVHEPRR DNKWSSRWGP DDKEKEARCE KVDINKDKEE PQSESQSVVS NVRATSERDS
0201: DTRDKWRPRH RMESQSGGPS SYRAAPGFGL DRGRAEGPNL GFTVGRGRAS TIGRGSSTSL IGAGSALSPV FRYPRGKLLD MYRKQKPDSS LGRILTEMDE
0301: VASITQVALI EPLAFIAPDA EEEANLNGIW KGRIISSEVY TSSGEESLGG NSLLKCRIPE SGETKVDGAL LGFMNGDNGS MKNNDSGLLG SHNGGLGAAS
0401: SVPRLNSVAS ESYGSGGAGY QLSHGSPEAV RSVFTKSSVL DGSESVVGSF EQAYTGKLQQ PDTEVDHSEG AMPPEEFLFL YIDPQGVIQG PFIGSDIISW
0501: FEQGFFGTDL QVRLASAPEG TPFQDLGRVM SYIKAESVHA HISDQKSELE ETSLKANSEA GGSVAHVAES NDSSSLTGIS RSFSVYNNPS GQDNFQRKSE
0601: SEVYGRPPHA EDQSFLDFSA QDEEIVFPGR ARVSGYASSV KSSTSMHDAL MEFSGHSDIP VEVTTAATRN QNENKLHPFG VLWSELEGGS TPVNPLPNRS
0701: SGAMGEPSCS IENRPINSRR NSQIDPNISL DALSGNRMSQ FEHESNFFNH GDQLPSNQHH QQHFQNRDML SHLHIGDQDL EHLITLQLQQ QQKIQMQQQQ
0801: KIQLQQQQKI QLQQHQLEQE HQLHQKLLQE QQQSHARQLH FQQILQGQTP DTRFGQSHDF PRSNSVDQML LEQQMLNELQ KSSGHPSQNF APYIEQHAAG
0901: NFGRFTHEGH QRELLEQLFS TQMQSQYGQK QSQYGQMQSQ HGQLQSEPIR SLEYQLLQQE QLMQLANGVR HNTLLEEQRH IDPLWPSDHS DQLLRTHPGI
1001: HRSHSSAGFR PLDFHQQQQR PHFEDQFSQL ERNRSYQQQL RLELLEHGLP FERSASGLNL DAVNGLGLSQ GLELRDATAH MQSSGRLGNS TPGFSHQNPR
1101: IPLGESHFSH LEPTEGRWSG ADTQLAGDWA ESQFRRSNMD TEHDKMRSEI RRLGEDPNSW MVGGSTDDKS KQLFMELLHQ RPGHQSAESP NMNRGYPYDR
1201: MVPSGLTPGI QTLGGLSDHG GNQNVSSAFG DRSFSDEQVN RVPGYGNNMG SLHHNSSLLS GIIDAGRSTQ NETQAFSNMF GMNKDANDIN TWNNVPPKNE
1301: GMGRMMSYDA QDRMGKQAVL DSLIQEELPV GTPGQQSSFN ISDRYSDNLV GEDRRKDRLV VPSHGQNSVL LKRPPSSHSS SSHEGLLERM SDTASRAAAS
1401: SYSGIEGGVR RESGAAGNKG STSEAASFSE MLKKSNSMKK VAAESTDATE GSKGGGGKKK GKKGRQIDPA LLGFKVTSNR ILMGEIHRAD DF
Arabidopsis Description
GYF domain-containing protein [Source:UniProtKB/TrEMBL;Acc:F4HSW8]
SUBAcon: [cytosol]
SUBAcon: [cytosol]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.