Subcellular Localization
min:
: max
Winner_takes_all: plastid, nucleus
Predictor Summary:
Predictor Summary:
- nucleus 3
- plastid 3
- plasma membrane 1
Predictors | GFP | MS/MS | Papers |
---|---|---|---|
PPI
No PPI Data
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
KRH68075 | Soybean | nucleus, plastid | 90.27 | 90.27 |
VIT_08s0007g07150.t01 | Wine grape | cytosol, nucleus, plasma membrane | 64.89 | 66.03 |
CDY08477 | Canola | nucleus | 57.32 | 58.46 |
Solyc10g080630.1.1 | Tomato | nucleus | 58.26 | 58.0 |
AT3G11960.3 | Thale cress | nucleus | 57.39 | 57.72 |
CDY01074 | Canola | nucleus | 56.45 | 57.62 |
Bra001431.1-P | Field mustard | nucleus, plastid | 55.66 | 57.57 |
CDX73841 | Canola | nucleus, plastid | 55.66 | 57.57 |
GSMUA_Achr9P11080_001 | Banana | nucleus, plasma membrane, plastid | 54.79 | 56.3 |
CDY00750 | Canola | nucleus | 54.0 | 55.94 |
Bra034797.1-P | Field mustard | nucleus | 56.38 | 53.86 |
Os07t0203700-01 | Rice | cytosol, nucleus | 23.79 | 49.55 |
KXG34639 | Sorghum | plastid | 47.22 | 48.13 |
TraesCS2B01G258400.1 | Wheat | cytosol, nucleus | 41.89 | 48.06 |
Zm00001d019115_P048 | Maize | plastid | 46.58 | 47.64 |
TraesCS2A01G243300.3 | Wheat | plastid | 46.0 | 46.88 |
TraesCS2D01G245600.6 | Wheat | plastid | 46.0 | 46.88 |
HORVU2Hr1G055870.14 | Barley | plastid | 45.78 | 46.69 |
Os07t0203800-01 | Rice | cytosol, plastid | 6.71 | 33.1 |
KRH43086 | Soybean | nucleus | 3.39 | 23.15 |
KRH59213 | Soybean | cytosol, plasma membrane, plastid | 7.21 | 20.0 |
KRH59212 | Soybean | cytosol, nucleus | 7.21 | 18.76 |
KRH25071 | Soybean | nucleus | 7.86 | 18.26 |
KRH36214 | Soybean | nucleus | 15.93 | 18.2 |
KRG92964 | Soybean | nucleus | 15.79 | 18.04 |
Protein Annotations
EntrezGene:100799711 | Gene3D:2.130.10.10 | MapMan:35.1 | UniProt:A0A0R0F241 | EMBL:ACUP02012151 | InterPro:Cleavage/polyA-sp_fac_asu_C |
EnsemblPlantsGene:GLYMA_19G204200 | GO:GO:0003674 | GO:GO:0003676 | GO:GO:0005488 | GO:GO:0005515 | GO:GO:0005575 |
GO:GO:0005622 | GO:GO:0005623 | GO:GO:0005634 | InterPro:IPR015943 | EnsemblPlants:KRG96340 | ProteinID:KRG96340 |
ProteinID:KRG96340.1 | PFAM:PF03178 | PFAM:PF10433 | PANTHER:PTHR10644 | PANTHER:PTHR10644:SF6 | UniParc:UPI0006EDC142 |
InterPro:WD40/YVTN_repeat-like_dom_sf | SEG:seg | : | : | : | : |
Description
hypothetical protein
Coordinates
chr19:+:46037890..46053993
Molecular Weight (calculated)
152460.0 Da
IEP (calculated)
5.509
GRAVY (calculated)
0.008
Length
1387 amino acids
Sequence
(BLAST)
(BLAST)
0001: MAVSEEECSS AKSGSSGPSS SSSSASRYYL SKCVFRGSVV LHVLHAHIRS PSSNDVVFGK ETSIELVVID EDGNVQSVFD QPVFGTLKDL AILPWNEKFR
0101: AARDPQLWGK DLLVATSDSG KLSLLTFCNE MHRFVPVTHI QLSNPGNQMD FPGRKLAVDS SGCFIAASAY EDRLALFSLS MSSGDIIDER IVYPSESEGT
0201: ASTSRSIQRT SISVTIWSIC FISQDSRQPS KEHNPVLALI INRREALLNE LLLLEWNVKA RKIFVISQYV EAGPLAHDIV EVPNSGGLAF LFRAGDVLLM
0301: DLRDHRNPSC VCKTNLNFLP HAMEEQTYVE DSCKLHDVDD ERFSVAACAL LELSDYDPMC IDSDNGGANS GYKYICSWSW EPENNRDPKM IFCVDTGEFF
0401: MIEVLFNSEG PKVNLSECLY KGLPCKALLW VEGGYLAALV EMGDGMVLKL EDGRLCYTNP IQNIAPILDM EVVDYHDEKH DQMFACCGVA PEGSLRIIRN
0501: GINVENLHRT ASIYQGVSGT WTVRMKVTDS HHSFLVLSFL DETRILSVGL SFTDVTDSVG FQPNVCTLAC GLVTDGLLVQ IHRSTVKLCL PTKASHSEGI
0601: PLSSPICTSW SPDNVGISLG AVGHNFIVVS TTNPCFLFIL GVRLLSVYQY EIYEMQHLVL QNELSCISIP GQEIEQKQSN SSISANNSSI SSFQSGVDIN
0701: KTFVIGTHKP SVEIWFFAPG GGITVVACGT ISLTNTIGSV KSDSIPQDVR LVSADKYYVL AGLRNGMLLR FEWPAEPCPS SPINMVDTAL SSTNLVNSVT
0801: NAFDKRNDLP SMLQLIAIRR IGITPIFLVP LGDTLDADII VLADRPWLLH SARQGLSYTS ISFQPATHVT PVSCVEFPKG ILFVAENSLH LVEMGHGKRL
0901: NVQKFHLEGT PRKVLYHDES KMLLVMRTEL NCGPCLSDIC CVDSLSGSVL SSFRLELGET GKSMELVRVG SEQVLVVGTS LSSGPHTMPT GEAESCKGRL
1001: LVLCLDHVQN SDSGSMTFCS KAGSSSQKTS PFHEIVTYAP ELLSSSSLGS SPDDNSSDGI KLHENEVWQF RLAYATKWPG VVLKICPYLD RYFLATAGNA
1101: FYVCGFPNDN PQRVRRYAMG RTRYMITSLT AHLTRIAVGD CRDGILLYSY HEEAKKLELL YNDPSQRIVA DCILMDADTA VVSDRKGSIA VLCSDHLEAS
1201: DNAGAQCNMT LSCAYFMAEI AMSIKKGSYS YRLPADDVLE GGNGPKTNVD SLQNTIIAST LLGSIMIFIP LSREEYELLE VVQARLVVHH LTAPVLGNDH
1301: HEFRSRENRV GVPKILDGDI LTQFLELTSM QQKMILSLEQ PDMVKPSLKP LLPSHVSVNQ NMEHVHAVVN NIVRQLLRAC LSKCCQI
0101: AARDPQLWGK DLLVATSDSG KLSLLTFCNE MHRFVPVTHI QLSNPGNQMD FPGRKLAVDS SGCFIAASAY EDRLALFSLS MSSGDIIDER IVYPSESEGT
0201: ASTSRSIQRT SISVTIWSIC FISQDSRQPS KEHNPVLALI INRREALLNE LLLLEWNVKA RKIFVISQYV EAGPLAHDIV EVPNSGGLAF LFRAGDVLLM
0301: DLRDHRNPSC VCKTNLNFLP HAMEEQTYVE DSCKLHDVDD ERFSVAACAL LELSDYDPMC IDSDNGGANS GYKYICSWSW EPENNRDPKM IFCVDTGEFF
0401: MIEVLFNSEG PKVNLSECLY KGLPCKALLW VEGGYLAALV EMGDGMVLKL EDGRLCYTNP IQNIAPILDM EVVDYHDEKH DQMFACCGVA PEGSLRIIRN
0501: GINVENLHRT ASIYQGVSGT WTVRMKVTDS HHSFLVLSFL DETRILSVGL SFTDVTDSVG FQPNVCTLAC GLVTDGLLVQ IHRSTVKLCL PTKASHSEGI
0601: PLSSPICTSW SPDNVGISLG AVGHNFIVVS TTNPCFLFIL GVRLLSVYQY EIYEMQHLVL QNELSCISIP GQEIEQKQSN SSISANNSSI SSFQSGVDIN
0701: KTFVIGTHKP SVEIWFFAPG GGITVVACGT ISLTNTIGSV KSDSIPQDVR LVSADKYYVL AGLRNGMLLR FEWPAEPCPS SPINMVDTAL SSTNLVNSVT
0801: NAFDKRNDLP SMLQLIAIRR IGITPIFLVP LGDTLDADII VLADRPWLLH SARQGLSYTS ISFQPATHVT PVSCVEFPKG ILFVAENSLH LVEMGHGKRL
0901: NVQKFHLEGT PRKVLYHDES KMLLVMRTEL NCGPCLSDIC CVDSLSGSVL SSFRLELGET GKSMELVRVG SEQVLVVGTS LSSGPHTMPT GEAESCKGRL
1001: LVLCLDHVQN SDSGSMTFCS KAGSSSQKTS PFHEIVTYAP ELLSSSSLGS SPDDNSSDGI KLHENEVWQF RLAYATKWPG VVLKICPYLD RYFLATAGNA
1101: FYVCGFPNDN PQRVRRYAMG RTRYMITSLT AHLTRIAVGD CRDGILLYSY HEEAKKLELL YNDPSQRIVA DCILMDADTA VVSDRKGSIA VLCSDHLEAS
1201: DNAGAQCNMT LSCAYFMAEI AMSIKKGSYS YRLPADDVLE GGNGPKTNVD SLQNTIIAST LLGSIMIFIP LSREEYELLE VVQARLVVHH LTAPVLGNDH
1301: HEFRSRENRV GVPKILDGDI LTQFLELTSM QQKMILSLEQ PDMVKPSLKP LLPSHVSVNQ NMEHVHAVVN NIVRQLLRAC LSKCCQI
0001: MAAPEDESSA QSQSSPATAA PTPPPSSSPS SAGDHYLAKC ILRPSVVLQV AYGYFRSPSS RDIVFGKETC IELVVIGEDG IVESVCEQYV FGTIKDLAVI
0101: PQSSKLYSNS LQMGKDLLAV LSDSGKLSFL SFSNEMHRFS PIQHVQLSTP GNSRIQLGRM LTIDSSGLFL AVSAYHDRFA LFSLSTSSMG DIIHQRISYP
0201: SEDGGNGSSI QAISGTIWSM CFISKDFNES KEYAPILAIV INRKGSLMNE LALFRWNVKE ESICLISEYV ETGALAHSIV EVPHSSGFAF LFRIGDVLLM
0301: DLRDPQNPCC LFRTSLDFVP ASLMEEHFVE ESCRVQDGDD EGCNVVVCAL LELRDHEVRD HDPMFIDTES DIGKLSSKNV SSWTWEPENN HNPRMIICLD
0401: NGDFFMFELI YEDDGVKVNL SECLYKGLPC KDILWIEGGF LATFAEMADG TVFKLGTEKL HWMSSIQNIA PILDFSVMDD QNEKRDQIFA CCGVTPEGSL
0501: RIIRSGINVE KLLKTAPVYQ GITGTWTVKM KLTDVYHSFL VLSFVEETRV LSVGLSFKDV TDSVGFQSDV CTFACGLVAD GLLVQIHQDA IRLCMPTMDA
0601: HSDGIPVSSP FFSSWFPENV SISLGAVGQN LIVVSTSNPC FLSILGVKSV SSQCCEIYEI QRVTLQYEVS CISVPQKHIG KKRSRDSSPD NFCKAAIPSA
0701: MEQGYTFLIG THKPSVEVLS FTEDGVGVRV LASGLVSLTN TMGTVISGCI PQDVRLVLVD QLYVLSGLRN GMLLRFEWAP FSNSSGLNCP DYFSHCKEEM
0801: DTVVGKKDNL PVNLLLIATR RIGITPVFLV PFSDSLDSDI IALSDRPWLL QTARQSLSYT SISFQPSTHA TPVCSFECPQ GILFVSENCL HLVEMVHSKR
0901: RNAQKFQLGG TPRKVIYHSE SKLLIVMRTD LYDTCTSDIC CVDPLSGSVL SSYKLKPGET GKSMELVRVG NEHVLVVGTS LSSGPAILPS GEAESTKGRV
1001: IILCLEHTQN SDSGSMTICS KACSSSQRTS PFHDVVGYTT ENLSSSSLCS SPDDYSYDGI KLDEAETWQL RLASSTTWPG MVLAICPYLD HYFLASAGNA
1101: FYVCGFPNDS PERMKRFAVG RTRFMITSLR TYFTRIVVGD CRDGVLFYSY HEESKKLHQI YCDPAQRLVA DCFLMDANSV AVSDRKGSIA ILSCKDHSDF
1201: GMKHLEYSSP ESNLNLNCAY YMGEIAMSIK KGCNIYKLPA DDVLRSYGLS KSIDTADDTI IAGTLLGSIF VFAPISSEEY ELLEGVQAKL GIHPLTAPVL
1301: GNDHNEFRGR ENPSQARKIL DGDMLAQFLE LTNRQQESVL STPQPSPSTS KASSKQRSFP PLMLHQVVQL LERVHYALH
0101: PQSSKLYSNS LQMGKDLLAV LSDSGKLSFL SFSNEMHRFS PIQHVQLSTP GNSRIQLGRM LTIDSSGLFL AVSAYHDRFA LFSLSTSSMG DIIHQRISYP
0201: SEDGGNGSSI QAISGTIWSM CFISKDFNES KEYAPILAIV INRKGSLMNE LALFRWNVKE ESICLISEYV ETGALAHSIV EVPHSSGFAF LFRIGDVLLM
0301: DLRDPQNPCC LFRTSLDFVP ASLMEEHFVE ESCRVQDGDD EGCNVVVCAL LELRDHEVRD HDPMFIDTES DIGKLSSKNV SSWTWEPENN HNPRMIICLD
0401: NGDFFMFELI YEDDGVKVNL SECLYKGLPC KDILWIEGGF LATFAEMADG TVFKLGTEKL HWMSSIQNIA PILDFSVMDD QNEKRDQIFA CCGVTPEGSL
0501: RIIRSGINVE KLLKTAPVYQ GITGTWTVKM KLTDVYHSFL VLSFVEETRV LSVGLSFKDV TDSVGFQSDV CTFACGLVAD GLLVQIHQDA IRLCMPTMDA
0601: HSDGIPVSSP FFSSWFPENV SISLGAVGQN LIVVSTSNPC FLSILGVKSV SSQCCEIYEI QRVTLQYEVS CISVPQKHIG KKRSRDSSPD NFCKAAIPSA
0701: MEQGYTFLIG THKPSVEVLS FTEDGVGVRV LASGLVSLTN TMGTVISGCI PQDVRLVLVD QLYVLSGLRN GMLLRFEWAP FSNSSGLNCP DYFSHCKEEM
0801: DTVVGKKDNL PVNLLLIATR RIGITPVFLV PFSDSLDSDI IALSDRPWLL QTARQSLSYT SISFQPSTHA TPVCSFECPQ GILFVSENCL HLVEMVHSKR
0901: RNAQKFQLGG TPRKVIYHSE SKLLIVMRTD LYDTCTSDIC CVDPLSGSVL SSYKLKPGET GKSMELVRVG NEHVLVVGTS LSSGPAILPS GEAESTKGRV
1001: IILCLEHTQN SDSGSMTICS KACSSSQRTS PFHDVVGYTT ENLSSSSLCS SPDDYSYDGI KLDEAETWQL RLASSTTWPG MVLAICPYLD HYFLASAGNA
1101: FYVCGFPNDS PERMKRFAVG RTRFMITSLR TYFTRIVVGD CRDGVLFYSY HEESKKLHQI YCDPAQRLVA DCFLMDANSV AVSDRKGSIA ILSCKDHSDF
1201: GMKHLEYSSP ESNLNLNCAY YMGEIAMSIK KGCNIYKLPA DDVLRSYGLS KSIDTADDTI IAGTLLGSIF VFAPISSEEY ELLEGVQAKL GIHPLTAPVL
1301: GNDHNEFRGR ENPSQARKIL DGDMLAQFLE LTNRQQESVL STPQPSPSTS KASSKQRSFP PLMLHQVVQL LERVHYALH
Arabidopsis Description
Cleavage and polyadenylation specificity factor (CPSF) A subunit protein [Source:UniProtKB/TrEMBL;Acc:Q84R20]
SUBAcon: [nucleus]
SUBAcon: [nucleus]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.