Subcellular Localization
min:
: max
Winner_takes_all: nucleus
Predictor Summary:
Predictor Summary:
- cytosol 1
- mitochondrion 1
- nucleus 2
Predictors | GFP | MS/MS | Papers | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
PPI
No PPI Data
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
TraesCS1D01G131600.3 | Wheat | nucleus | 97.29 | 99.04 |
TraesCS1B01G149700.3 | Wheat | nucleus | 98.75 | 98.24 |
HORVU1Hr1G030930.22 | Barley | plastid | 93.22 | 89.67 |
Os05t0274200-01 | Rice | cytosol | 86.24 | 87.79 |
EER99643 | Sorghum | nucleus | 83.52 | 85.03 |
Zm00001d022028_P005 | Maize | nucleus | 78.1 | 84.16 |
CDY40145 | Canola | cytosol, extracellular, plastid | 7.19 | 82.14 |
GSMUA_Achr6P13480_001 | Banana | cytosol | 20.33 | 75.88 |
Bra028327.1-P | Field mustard | extracellular, vacuole | 12.51 | 72.73 |
GSMUA_Achr6P13490_001 | Banana | cytosol | 49.32 | 71.78 |
VIT_17s0000g04170.t01 | Wine grape | cytosol | 69.97 | 71.01 |
KRH03809 | Soybean | nucleus | 68.72 | 69.96 |
Solyc06g069230.2.1 | Tomato | cytosol | 67.47 | 68.61 |
PGSC0003DMT400010422 | Potato | cytosol | 67.15 | 68.29 |
AT3G18524.1 | Thale cress | nucleus | 63.82 | 65.31 |
Bra001709.1-P | Field mustard | nucleus | 62.25 | 63.71 |
CDY52899 | Canola | nucleus | 62.25 | 63.71 |
CDX82340 | Canola | nucleus | 62.25 | 63.71 |
PGSC0003DMT400021697 | Potato | cytosol, extracellular, nucleus | 12.1 | 57.71 |
PGSC0003DMT400021699 | Potato | cytosol | 8.55 | 51.57 |
TraesCS2A01G171900.1 | Wheat | cytosol | 18.04 | 21.36 |
TraesCS1A01G315900.1 | Wheat | cytosol | 16.48 | 19.32 |
TraesCS5A01G206300.1 | Wheat | mitochondrion | 20.96 | 15.24 |
TraesCS3A01G117500.1 | Wheat | mitochondrion, nucleus | 18.56 | 14.57 |
TraesCS1A01G179700.1 | Wheat | plastid | 10.22 | 11.78 |
TraesCS2A01G362100.1 | Wheat | mitochondrion, nucleus | 12.93 | 10.99 |
TraesCS2A01G588000.2 | Wheat | plastid | 10.43 | 10.92 |
Protein Annotations
Gene3D:1.10.1420.10 | MapMan:14.7.1.1 | Gene3D:3.30.420.110 | Gene3D:3.40.1170.10 | Gene3D:3.40.50.300 | ncoils:Coil |
InterPro:DNA_mismatch_repair_Msh2 | InterPro:DNA_mismatch_repair_MutS-lik_N | InterPro:DNA_mismatch_repair_MutS_C | InterPro:DNA_mismatch_repair_MutS_N | InterPro:DNA_mismatch_repair_MutS_clamp | InterPro:DNA_mismatch_repair_MutS_core |
InterPro:DNA_mismatch_repair_MutS_sf | InterPro:DNA_mmatch_repair_MutS_con_dom | GO:GO:0000166 | GO:GO:0003674 | GO:GO:0003676 | GO:GO:0003677 |
GO:GO:0005488 | GO:GO:0005524 | GO:GO:0005575 | GO:GO:0005622 | GO:GO:0005623 | GO:GO:0006139 |
GO:GO:0006259 | GO:GO:0006298 | GO:GO:0006950 | GO:GO:0008150 | GO:GO:0008152 | GO:GO:0009987 |
GO:GO:0030983 | GO:GO:0032300 | InterPro:IPR016151 | InterPro:IPR036678 | InterPro:Msh2 | InterPro:MutS_con_dom_sf |
InterPro:P-loop_NTPase | PFAM:PF00488 | PFAM:PF01624 | PFAM:PF05188 | PFAM:PF05190 | PFAM:PF05192 |
PIRSF:PIRSF005813 | ScanProsite:PS00486 | PANTHER:PTHR11361 | PANTHER:PTHR11361:SF35 | SMART:SM00533 | SMART:SM00534 |
SUPFAM:SSF48334 | SUPFAM:SSF52540 | EnsemblPlantsGene:TraesCS1A01G135100 | EnsemblPlants:TraesCS1A01G135100.1 | TIGR:cd03285 | : |
Description
No Description!
Coordinates
chr1A:-:206792880..206805344
Molecular Weight (calculated)
106933.0 Da
IEP (calculated)
5.620
GRAVY (calculated)
-0.233
Length
959 amino acids
Sequence
(BLAST)
(BLAST)
001: MDSEDCLPEG GKLPELKPDA RQAQGFISFF KKLPKDPRAI RFFDRRDYYT AHGENATFIA KTYYHTMTAL RQLGGNSDGI SSVSVSRAMF ETIARNLLLD
101: RTDHTLELYE GSGSSWRLTK SGTPGNIGSF EDILFANNDM QYSPVTVALF PVFREGQLYV ALSFVDMTNR KLGLAEFPED SRFTNVESAL VELGCKECLL
201: PADFEKSIDL QPLQDAISNC SILLTERKKA EFKSRDLVQD LGRIIRGSVE PVRDLLSQFD YALGTLGALV SYAELLADDT NYGNYTIEKF NLDRYMRLDS
301: AVVRALNIVE GKTDVNKNFS LFGLMNRTCT AGMGKRLLNK WLKQPLIDVN EINNRLDMVQ AFAEDPELRQ GLRQQLKRIS DIDRLTHALR KKSANLQPVV
401: KLYQSCSRIP YIKGVLQQYN GQFSTSVMTR FLSSLEEWLT ENRYGRFANL VETAIDLGQV DNGEYRISPL YSSDLAVLKD ELSEVENHIN NLHMHTATDL
501: DLSVDKQLKL EKGPFGHVFR ISKKDEQKVR KKLTTNYIII ETRKDGVKFT SAKLKKLGDQ YQSLLGDYTS CQKKIVDNVV QVSCTFSEVF ENFAAIISEL
601: DVLQSFADLA ISCPVPYVRP DITTSEEGDI ILQGSRHPCV EAQDGVNFIP NDCTLARGKS WFQIITGPNM GGKSTFIRQV GVNVLMAQVG SFVPCDQASI
701: SVRDCIFARV GAGDCQLRGV STFMQEMLET ASILKGASEK SLIIIDELGR GTSTYDGFGL AWAICEHLVE VTRAPTLFAT HFHELTALAN KNGDQHQHVP
801: DLGIANYHVG AHIDPSSRKL TMLYKVFIMI MSLFSSGSCV QKVEPGSCDQ SFGIHVAEFA NFPEAVIALA KSKAEELEDF TSAPNLSGEP SDEVGSKRKR
901: VFSPDDVTRG AARARLLLED FSALPLDEVD GSKATEMVAK LKSDFEKDAA SNPWLQQFL
101: RTDHTLELYE GSGSSWRLTK SGTPGNIGSF EDILFANNDM QYSPVTVALF PVFREGQLYV ALSFVDMTNR KLGLAEFPED SRFTNVESAL VELGCKECLL
201: PADFEKSIDL QPLQDAISNC SILLTERKKA EFKSRDLVQD LGRIIRGSVE PVRDLLSQFD YALGTLGALV SYAELLADDT NYGNYTIEKF NLDRYMRLDS
301: AVVRALNIVE GKTDVNKNFS LFGLMNRTCT AGMGKRLLNK WLKQPLIDVN EINNRLDMVQ AFAEDPELRQ GLRQQLKRIS DIDRLTHALR KKSANLQPVV
401: KLYQSCSRIP YIKGVLQQYN GQFSTSVMTR FLSSLEEWLT ENRYGRFANL VETAIDLGQV DNGEYRISPL YSSDLAVLKD ELSEVENHIN NLHMHTATDL
501: DLSVDKQLKL EKGPFGHVFR ISKKDEQKVR KKLTTNYIII ETRKDGVKFT SAKLKKLGDQ YQSLLGDYTS CQKKIVDNVV QVSCTFSEVF ENFAAIISEL
601: DVLQSFADLA ISCPVPYVRP DITTSEEGDI ILQGSRHPCV EAQDGVNFIP NDCTLARGKS WFQIITGPNM GGKSTFIRQV GVNVLMAQVG SFVPCDQASI
701: SVRDCIFARV GAGDCQLRGV STFMQEMLET ASILKGASEK SLIIIDELGR GTSTYDGFGL AWAICEHLVE VTRAPTLFAT HFHELTALAN KNGDQHQHVP
801: DLGIANYHVG AHIDPSSRKL TMLYKVFIMI MSLFSSGSCV QKVEPGSCDQ SFGIHVAEFA NFPEAVIALA KSKAEELEDF TSAPNLSGEP SDEVGSKRKR
901: VFSPDDVTRG AARARLLLED FSALPLDEVD GSKATEMVAK LKSDFEKDAA SNPWLQQFL
001: MEGNFEEQNK LPELKLDAKQ AQGFLSFYKT LPNDTRAVRF FDRKDYYTAH GENSVFIAKT YYHTTTALRQ LGSGSNALSS VSISRNMFET IARDLLLERN
101: DHTVELYEGS GSNWRLVKTG SPGNIGSFED VLFANNEMQD TPVVVSIFPS FHDGRCVIGM AYVDLTRRVL GLAEFLDDSR FTNLESSLIA LGAKECIFPA
201: ESGKSNECKS LYDSLERCAV MITERKKHEF KGRDLDSDLK RLVKGNIEPV RDLVSGFDLA TPALGALLSF SELLSNEDNY GNFTIRRYDI GGFMRLDSAA
301: MRALNVMESK TDANKNFSLF GLMNRTCTAG MGKRLLHMWL KQPLVDLNEI KTRLDIVQCF VEEAGLRQDL RQHLKRISDV ERLLRSLERR RGGLQHIIKL
401: YQSTIRLPFI KTAMQQYTGE FASLISERYL KKLEALSDQD HLGKFIDLVE CSVDLDQLEN GEYMISSSYD TKLASLKDQK ELLEQQIHEL HKKTAIELDL
501: QVDKALKLDK AAQFGHVFRI TKKEEPKIRK KLTTQFIVLE TRKDGVKFTN TKLKKLGDQY QSVVDDYRSC QKELVDRVVE TVTSFSEVFE DLAGLLSEMD
601: VLLSFADLAA SCPTPYCRPE ITSSDAGDIV LEGSRHPCVE AQDWVNFIPN DCRLMRGKSW FQIVTGPNMG GKSTFIRQVG VIVLMAQVGS FVPCDKASIS
701: IRDCIFARVG AGDCQLRGVS TFMQEMLETA SILKGASDKS LIIIDELGRG TSTYDGFGLA WAICEHLVQV KRAPTLFATH FHELTALAQA NSEVSGNTVG
801: VANFHVSAHI DTESRKLTML YKVEPGACDQ SFGIHVAEFA NFPESVVALA REKAAELEDF SPSSMIINNE ESGKRKSRED DPDEVSRGAE RAHKFLKEFA
901: AIPLDKMELK DSLQRVREMK DELEKDAADC HWLRQFL
101: DHTVELYEGS GSNWRLVKTG SPGNIGSFED VLFANNEMQD TPVVVSIFPS FHDGRCVIGM AYVDLTRRVL GLAEFLDDSR FTNLESSLIA LGAKECIFPA
201: ESGKSNECKS LYDSLERCAV MITERKKHEF KGRDLDSDLK RLVKGNIEPV RDLVSGFDLA TPALGALLSF SELLSNEDNY GNFTIRRYDI GGFMRLDSAA
301: MRALNVMESK TDANKNFSLF GLMNRTCTAG MGKRLLHMWL KQPLVDLNEI KTRLDIVQCF VEEAGLRQDL RQHLKRISDV ERLLRSLERR RGGLQHIIKL
401: YQSTIRLPFI KTAMQQYTGE FASLISERYL KKLEALSDQD HLGKFIDLVE CSVDLDQLEN GEYMISSSYD TKLASLKDQK ELLEQQIHEL HKKTAIELDL
501: QVDKALKLDK AAQFGHVFRI TKKEEPKIRK KLTTQFIVLE TRKDGVKFTN TKLKKLGDQY QSVVDDYRSC QKELVDRVVE TVTSFSEVFE DLAGLLSEMD
601: VLLSFADLAA SCPTPYCRPE ITSSDAGDIV LEGSRHPCVE AQDWVNFIPN DCRLMRGKSW FQIVTGPNMG GKSTFIRQVG VIVLMAQVGS FVPCDKASIS
701: IRDCIFARVG AGDCQLRGVS TFMQEMLETA SILKGASDKS LIIIDELGRG TSTYDGFGLA WAICEHLVQV KRAPTLFATH FHELTALAQA NSEVSGNTVG
801: VANFHVSAHI DTESRKLTML YKVEPGACDQ SFGIHVAEFA NFPESVVALA REKAAELEDF SPSSMIINNE ESGKRKSRED DPDEVSRGAE RAHKFLKEFA
901: AIPLDKMELK DSLQRVREMK DELEKDAADC HWLRQFL
Arabidopsis Description
MSH2DNA mismatch repair protein MSH2 [Source:UniProtKB/Swiss-Prot;Acc:O24617]
SUBAcon: [nucleus]
SUBAcon: [nucleus]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.