Subcellular Localization
min:
: max
Winner_takes_all: nucleus
Predictor Summary:
Predictor Summary:
- nucleus 5
- cytosol 1
- mitochondrion 1
Predictors | GFP | MS/MS | Papers | ||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
PPI
No PPI Data
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
Zm00001d022028_P005 | Maize | nucleus | 89.28 | 94.49 |
Os05t0274200-01 | Rice | cytosol | 87.47 | 87.47 |
TraesCS1D01G131600.3 | Wheat | nucleus | 85.03 | 85.03 |
TraesCS1A01G135100.1 | Wheat | nucleus | 85.03 | 83.52 |
CDY40145 | Canola | cytosol, extracellular, plastid | 7.43 | 83.33 |
TraesCS1B01G149700.3 | Wheat | nucleus | 84.82 | 82.88 |
HORVU1Hr1G030930.22 | Barley | plastid | 82.27 | 77.73 |
GSMUA_Achr6P13480_001 | Banana | cytosol | 20.7 | 75.88 |
Bra028327.1-P | Field mustard | extracellular, vacuole | 12.95 | 73.94 |
GSMUA_Achr6P13490_001 | Banana | cytosol | 50.11 | 71.62 |
VIT_17s0000g04170.t01 | Wine grape | cytosol | 71.44 | 71.22 |
KRH03809 | Soybean | nucleus | 69.53 | 69.53 |
Solyc06g069230.2.1 | Tomato | cytosol | 68.68 | 68.61 |
PGSC0003DMT400010422 | Potato | cytosol | 68.47 | 68.4 |
AT3G18524.1 | Thale cress | nucleus | 64.97 | 65.31 |
Bra001709.1-P | Field mustard | nucleus | 64.33 | 64.67 |
CDX82340 | Canola | nucleus | 64.33 | 64.67 |
CDY52899 | Canola | nucleus | 64.12 | 64.46 |
PGSC0003DMT400021697 | Potato | cytosol, extracellular, nucleus | 12.0 | 56.22 |
PGSC0003DMT400021699 | Potato | cytosol | 8.49 | 50.31 |
EER99382 | Sorghum | cytosol | 17.83 | 20.92 |
OQU78247 | Sorghum | cytosol | 16.99 | 20.38 |
EES11671 | Sorghum | plastid | 22.72 | 19.45 |
KXG35601 | Sorghum | mitochondrion, nucleus, plastid | 20.81 | 15.03 |
KXG31683 | Sorghum | mitochondrion, nucleus | 19.43 | 14.87 |
KXG26672 | Sorghum | mitochondrion, nucleus | 14.01 | 11.67 |
OQU91528 | Sorghum | plastid | 10.08 | 11.43 |
EES11666 | Sorghum | plastid | 10.19 | 10.53 |
EES13125 | Sorghum | cytosol | 0.0 | 0.0 |
Protein Annotations
Description
hypothetical protein
Coordinates
chr2:-:72341324..72351213
Molecular Weight (calculated)
105297.0 Da
IEP (calculated)
5.544
GRAVY (calculated)
-0.249
Length
942 amino acids
Sequence
(BLAST)
(BLAST)
001: MEGDDFTPEG GKLPEFKLDA RQAQGFISFF KRLPQDPRAV RLFDRRDYYT AHGENATFIA RTYYHTMSAL RQLGSSSDGI SSVSVSKAMF ETIARNILLE
101: RTDCTLELYE GSGSNWRLTK SGTPGNIGSF EDLLFANNDM QDSPVIVALF PVCRESQLYV GLSFLDMTNR KLGLAEFPED SRFTNVESAL VALGCKECLL
201: SEDCEKSIDL NPLRDAISNC NVLLTVKKKA DFKSRDLAQD LGRIIRGSVE PVRDLLSQFD YALGPLGALL SYAELLADDT NYGNYTIEKY NLNCYMRLDS
301: AAVRALNISE RKTDVNKNFS LFGLMNRTCT VGMGKRLLNR WLKQPLLDVN EINNRLDMVQ AFVEDPELRQ GLRQQLKRIS DIDRLTHALR KKSATLQPVV
401: KLYQSCCRIS YIKGILEQYN GQFSTLIRSK FLEPLEEWMA EDRFGRFSSL VETTIDLGQL ENGEYRISPL YSSDLGVLKD ELSVVENHIN NLHVDTASDL
501: DLSVDKQLKL EKGPLGHVFR MSKKEEQKVR KKLTGSYLII ETRKDGVKFT SSKLKKLSDQ YQALFAEYTS CQKKVVGDVV RVSGSYSEVF ENFAAVLSEL
601: DVLQSFADLA TSCPVPYVRP DITVSDEGDI VLLGSRHPCL EAQDGVNFIP NDCTLVRGKS WFQIITGPNM GGKSTFIRQV GVNVLMAQVG SFVPCDQASV
701: SVRDCIFARV GAGDCQLHGV STFMQEMLET ASILKGASDK SLIIIDELGR GTSTYDGFGL AWAICEHLME VTRAPTLFAT HFHELTALAH KNDDEHQRVS
801: NIGIANYHVG AHIDPSSRKL TMLYKVEPGA CDQSFGIHVA EFANFPEAVV ALAKSKAAEL EDFSTTPTFS DDSKDEVGSK RKRVFSPDDV TRGAARARLF
901: LEDFAALPVD EMDRSKIVEM VTKMKSDLQK DAADNPWLQQ FF
101: RTDCTLELYE GSGSNWRLTK SGTPGNIGSF EDLLFANNDM QDSPVIVALF PVCRESQLYV GLSFLDMTNR KLGLAEFPED SRFTNVESAL VALGCKECLL
201: SEDCEKSIDL NPLRDAISNC NVLLTVKKKA DFKSRDLAQD LGRIIRGSVE PVRDLLSQFD YALGPLGALL SYAELLADDT NYGNYTIEKY NLNCYMRLDS
301: AAVRALNISE RKTDVNKNFS LFGLMNRTCT VGMGKRLLNR WLKQPLLDVN EINNRLDMVQ AFVEDPELRQ GLRQQLKRIS DIDRLTHALR KKSATLQPVV
401: KLYQSCCRIS YIKGILEQYN GQFSTLIRSK FLEPLEEWMA EDRFGRFSSL VETTIDLGQL ENGEYRISPL YSSDLGVLKD ELSVVENHIN NLHVDTASDL
501: DLSVDKQLKL EKGPLGHVFR MSKKEEQKVR KKLTGSYLII ETRKDGVKFT SSKLKKLSDQ YQALFAEYTS CQKKVVGDVV RVSGSYSEVF ENFAAVLSEL
601: DVLQSFADLA TSCPVPYVRP DITVSDEGDI VLLGSRHPCL EAQDGVNFIP NDCTLVRGKS WFQIITGPNM GGKSTFIRQV GVNVLMAQVG SFVPCDQASV
701: SVRDCIFARV GAGDCQLHGV STFMQEMLET ASILKGASDK SLIIIDELGR GTSTYDGFGL AWAICEHLME VTRAPTLFAT HFHELTALAH KNDDEHQRVS
801: NIGIANYHVG AHIDPSSRKL TMLYKVEPGA CDQSFGIHVA EFANFPEAVV ALAKSKAAEL EDFSTTPTFS DDSKDEVGSK RKRVFSPDDV TRGAARARLF
901: LEDFAALPVD EMDRSKIVEM VTKMKSDLQK DAADNPWLQQ FF
001: MEGNFEEQNK LPELKLDAKQ AQGFLSFYKT LPNDTRAVRF FDRKDYYTAH GENSVFIAKT YYHTTTALRQ LGSGSNALSS VSISRNMFET IARDLLLERN
101: DHTVELYEGS GSNWRLVKTG SPGNIGSFED VLFANNEMQD TPVVVSIFPS FHDGRCVIGM AYVDLTRRVL GLAEFLDDSR FTNLESSLIA LGAKECIFPA
201: ESGKSNECKS LYDSLERCAV MITERKKHEF KGRDLDSDLK RLVKGNIEPV RDLVSGFDLA TPALGALLSF SELLSNEDNY GNFTIRRYDI GGFMRLDSAA
301: MRALNVMESK TDANKNFSLF GLMNRTCTAG MGKRLLHMWL KQPLVDLNEI KTRLDIVQCF VEEAGLRQDL RQHLKRISDV ERLLRSLERR RGGLQHIIKL
401: YQSTIRLPFI KTAMQQYTGE FASLISERYL KKLEALSDQD HLGKFIDLVE CSVDLDQLEN GEYMISSSYD TKLASLKDQK ELLEQQIHEL HKKTAIELDL
501: QVDKALKLDK AAQFGHVFRI TKKEEPKIRK KLTTQFIVLE TRKDGVKFTN TKLKKLGDQY QSVVDDYRSC QKELVDRVVE TVTSFSEVFE DLAGLLSEMD
601: VLLSFADLAA SCPTPYCRPE ITSSDAGDIV LEGSRHPCVE AQDWVNFIPN DCRLMRGKSW FQIVTGPNMG GKSTFIRQVG VIVLMAQVGS FVPCDKASIS
701: IRDCIFARVG AGDCQLRGVS TFMQEMLETA SILKGASDKS LIIIDELGRG TSTYDGFGLA WAICEHLVQV KRAPTLFATH FHELTALAQA NSEVSGNTVG
801: VANFHVSAHI DTESRKLTML YKVEPGACDQ SFGIHVAEFA NFPESVVALA REKAAELEDF SPSSMIINNE ESGKRKSRED DPDEVSRGAE RAHKFLKEFA
901: AIPLDKMELK DSLQRVREMK DELEKDAADC HWLRQFL
101: DHTVELYEGS GSNWRLVKTG SPGNIGSFED VLFANNEMQD TPVVVSIFPS FHDGRCVIGM AYVDLTRRVL GLAEFLDDSR FTNLESSLIA LGAKECIFPA
201: ESGKSNECKS LYDSLERCAV MITERKKHEF KGRDLDSDLK RLVKGNIEPV RDLVSGFDLA TPALGALLSF SELLSNEDNY GNFTIRRYDI GGFMRLDSAA
301: MRALNVMESK TDANKNFSLF GLMNRTCTAG MGKRLLHMWL KQPLVDLNEI KTRLDIVQCF VEEAGLRQDL RQHLKRISDV ERLLRSLERR RGGLQHIIKL
401: YQSTIRLPFI KTAMQQYTGE FASLISERYL KKLEALSDQD HLGKFIDLVE CSVDLDQLEN GEYMISSSYD TKLASLKDQK ELLEQQIHEL HKKTAIELDL
501: QVDKALKLDK AAQFGHVFRI TKKEEPKIRK KLTTQFIVLE TRKDGVKFTN TKLKKLGDQY QSVVDDYRSC QKELVDRVVE TVTSFSEVFE DLAGLLSEMD
601: VLLSFADLAA SCPTPYCRPE ITSSDAGDIV LEGSRHPCVE AQDWVNFIPN DCRLMRGKSW FQIVTGPNMG GKSTFIRQVG VIVLMAQVGS FVPCDKASIS
701: IRDCIFARVG AGDCQLRGVS TFMQEMLETA SILKGASDKS LIIIDELGRG TSTYDGFGLA WAICEHLVQV KRAPTLFATH FHELTALAQA NSEVSGNTVG
801: VANFHVSAHI DTESRKLTML YKVEPGACDQ SFGIHVAEFA NFPESVVALA REKAAELEDF SPSSMIINNE ESGKRKSRED DPDEVSRGAE RAHKFLKEFA
901: AIPLDKMELK DSLQRVREMK DELEKDAADC HWLRQFL
Arabidopsis Description
MSH2DNA mismatch repair protein MSH2 [Source:UniProtKB/Swiss-Prot;Acc:O24617]
SUBAcon: [nucleus]
SUBAcon: [nucleus]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.