Subcellular Localization
min:
: max
Winner_takes_all: nucleus
Predictor Summary:
Predictor Summary:
- nucleus 5
- plastid 2
Predictors | GFP | MS/MS | Papers | ||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
PPI
Inferred distinct locusB in Crop
locusB | locations |
---|---|
GSMUA_Achr4P32570_001 | |
GSMUA_Achr6P24690_001 |
Inferred from Arabidopsis experimental PPI
Ath locusA | locusB | Ath locusB | Paper |
---|---|---|---|
AT5G56250.1 | GSMUA_Achr4P32570_001 | AT1G49850.1 | 21798944 |
AT5G56250.1 | GSMUA_Achr6P24690_001 | AT1G49850.1 | 21798944 |
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
GSMUA_Achr10P... | Banana | nucleus | 31.84 | 37.11 |
GSMUA_Achr5P04160_001 | Banana | nucleus | 22.34 | 31.61 |
GSMUA_Achr4P01620_001 | Banana | nucleus | 29.71 | 30.86 |
GSMUA_Achr10P... | Banana | nucleus | 25.23 | 30.26 |
GSMUA_Achr9P12690_001 | Banana | nucleus | 28.72 | 29.91 |
GSMUA_Achr3P10940_001 | Banana | nucleus | 27.13 | 27.29 |
Os04t0471400-01 | Rice | nucleus | 24.7 | 26.23 |
Zm00001d003333_P005 | Maize | nucleus | 24.32 | 25.83 |
OQU81762 | Sorghum | nucleus | 24.24 | 25.81 |
Zm00001d025668_P005 | Maize | nucleus | 24.09 | 25.59 |
TraesCS2D01G332100.1 | Wheat | nucleus | 22.72 | 24.21 |
TraesCS2B01G351400.1 | Wheat | nucleus | 22.49 | 24.01 |
TraesCS2A01G330900.1 | Wheat | nucleus | 22.49 | 23.95 |
CDY36785 | Canola | nucleus | 10.18 | 23.1 |
CDY06514 | Canola | nucleus, plastid | 11.63 | 22.21 |
Bra035611.1-P | Field mustard | nucleus, plastid | 12.23 | 22.09 |
CDY36786 | Canola | nucleus, plastid | 12.54 | 21.91 |
Bra028941.1-P | Field mustard | nucleus, plastid | 12.54 | 21.91 |
CDY20740 | Canola | plastid | 12.23 | 21.79 |
AT5G56250.1 | Thale cress | nucleus | 13.37 | 21.7 |
CDY62101 | Canola | nucleus | 10.87 | 20.91 |
CDY11950 | Canola | nucleus | 11.78 | 20.39 |
CDY17406 | Canola | nucleus, plastid | 13.75 | 19.4 |
CDY06515 | Canola | nucleus, plastid | 12.92 | 18.81 |
Bra002832.1-P | Field mustard | nucleus | 13.83 | 18.78 |
CDY32802 | Canola | nucleus | 8.51 | 18.67 |
Bra002831.1-P | Field mustard | nucleus | 8.51 | 18.67 |
CDY32803 | Canola | nucleus | 13.45 | 18.63 |
CDY11951 | Canola | nucleus, plastid | 12.54 | 18.5 |
Bra028942.1-P | Field mustard | nucleus | 13.15 | 18.04 |
AT5G56240.3 | Thale cress | nucleus | 13.3 | 17.73 |
VIT_19s0015g01810.t01 | Wine grape | nucleus | 18.46 | 17.47 |
Bra035610.1-P | Field mustard | nucleus, plastid | 12.69 | 17.43 |
Solyc07g066200.2.1 | Tomato | nucleus | 17.63 | 17.31 |
PGSC0003DMT400056953 | Potato | nucleus | 17.93 | 16.92 |
CDY65749 | Canola | nucleus | 11.09 | 16.55 |
CDY36783 | Canola | cytosol | 1.22 | 13.45 |
Protein Annotations
EnsemblPlants:GSMUA_Achr7P20270_001 | EnsemblPlants:GSMUA_Achr7T20270_001 | EnsemblPlantsGene:GSMUA_Achr7G20270_001 | PANTHER:PTHR35767 | PANTHER:PTHR35767:SF1 | SEG:seg |
UniParc:UPI000295A5F5 | UniProt:M0TJ64 | MapMan:35.2 | : | : | : |
Description
Putative expressed protein [Source:GMGC_GENE;Acc:GSMUA_Achr7G20270_001]
Coordinates
chr7:-:23344248..23349962
Molecular Weight (calculated)
145416.0 Da
IEP (calculated)
8.305
GRAVY (calculated)
-0.649
Length
1316 amino acids
Sequence
(BLAST)
(BLAST)
0001: MLSTEDPPDP ACSSKLSAVR TNARSSYTSL PSQEEDPVVL EERTTPNFSI RDYVFTSRSK GIEASWPFAP QFLQLCLKHG VNDLLPPFEH PDIVRTQSIG
0101: EVEESVQSAA YAKSTFYEGF KANKLSHSDV GIALTARTSQ QAEVISSQID ILPCSISQSG NSVKTQFQPE FSDPSHNLEK FGEPLEKNRL VPQLGTILET
0201: SQADHLVNNP GMIMHPMASK VCPVCKTFSF TSITTLNAHI DQCLSMGSNC KEVVNHVAKY TVKPRKKRLM VDIYATAPCC TLEDLDKRNG TNWASELALL
0301: AAPINKVSTH SKRTEVSPTN STHDGNGTVH VDSCGGKLSV LSMFNEQTLS SENFELRNHA KESKESMGFL SSKKNNFAPE HLKSMNSEAQ KEQLISFDML
0401: PKQIQAAAEK DCRTESHQKN AKSPSHVSDS RDHDKSFASA TIKQWARSRR SDILQKWTRK GNCSKLDDMI PITRSTQITS IQPDPGRSTD MKTQPLKLPR
0501: LSENMTGSPK TNRVGFLHNA VCSMDERKRE SPELPSSSSR WPSKGAGSAN GLILKLSRSS GHFTCSSIMK SKETNTGTQE HFDNTSKRKM VISKSCSMLR
0601: DRRSPTLKKN VMVKRPFCLE ARKVRAIEKQ SIFKKFHKHR TILRTGQKGL PRSNNAGVCS PTDNTHLLRK KVKRTLRSHQ SYTPGSITKI GEGEVMNEVL
0701: PSRENIREYS SIMEQQVNNS LENTTAGAQS LDTEIETSGT QVAIVDSGDY VTKTCVEEAV CDPTAYDNVN SEKTEPRLST QSHSCSCEED VQPISESEAG
0801: AEQPKQICDD QQKFLGNGSS NEIGNQEIPM ADVRGTKDSC AIQPKECHTD SSSVQESSDC LTSHGDVDLE LAEKGSSINS VRIAASHIEN LDSKGEPSGS
0901: SVSTVSAVSL PSPTDSKSQN FETEPTSRAI SDQDNLHLAI PSAGRTEATR YLNMEKRKQE KRGSVPDKEP DRCLDDKCFF YSCKESLSRD SEICSENSIA
1001: RTTKLKCVPN LCIGPRNSSS FSIYENHTSN AVVNSKTGPP NQFASAKSSL DPTCNSTLPQ TQSVSSPTLR LMGKNLVVVN HKDLVQPQTT AFDCTQQVNL
1101: HRNGCTSTNN RLKQENFLYQ HGQLSSGSPS FGPALLMSDH HMSLNLHVTP VSGFAWTPLQ NGYPAKPDQQ TQQRNSHKKL KSSHSIMDKV IVIDDSPKHQ
1201: TDAEVSLSAS ASTLPLTPSE RPVTCYPLQQ QIRDYPRPLL PNVYSSANSS FMNQGIEKGP FLSSPTLFQF PIAQRGPSMN KKTGWSFTWL FSFVIMKKNI
1301: GILYNEKFYK QLKLLG
0101: EVEESVQSAA YAKSTFYEGF KANKLSHSDV GIALTARTSQ QAEVISSQID ILPCSISQSG NSVKTQFQPE FSDPSHNLEK FGEPLEKNRL VPQLGTILET
0201: SQADHLVNNP GMIMHPMASK VCPVCKTFSF TSITTLNAHI DQCLSMGSNC KEVVNHVAKY TVKPRKKRLM VDIYATAPCC TLEDLDKRNG TNWASELALL
0301: AAPINKVSTH SKRTEVSPTN STHDGNGTVH VDSCGGKLSV LSMFNEQTLS SENFELRNHA KESKESMGFL SSKKNNFAPE HLKSMNSEAQ KEQLISFDML
0401: PKQIQAAAEK DCRTESHQKN AKSPSHVSDS RDHDKSFASA TIKQWARSRR SDILQKWTRK GNCSKLDDMI PITRSTQITS IQPDPGRSTD MKTQPLKLPR
0501: LSENMTGSPK TNRVGFLHNA VCSMDERKRE SPELPSSSSR WPSKGAGSAN GLILKLSRSS GHFTCSSIMK SKETNTGTQE HFDNTSKRKM VISKSCSMLR
0601: DRRSPTLKKN VMVKRPFCLE ARKVRAIEKQ SIFKKFHKHR TILRTGQKGL PRSNNAGVCS PTDNTHLLRK KVKRTLRSHQ SYTPGSITKI GEGEVMNEVL
0701: PSRENIREYS SIMEQQVNNS LENTTAGAQS LDTEIETSGT QVAIVDSGDY VTKTCVEEAV CDPTAYDNVN SEKTEPRLST QSHSCSCEED VQPISESEAG
0801: AEQPKQICDD QQKFLGNGSS NEIGNQEIPM ADVRGTKDSC AIQPKECHTD SSSVQESSDC LTSHGDVDLE LAEKGSSINS VRIAASHIEN LDSKGEPSGS
0901: SVSTVSAVSL PSPTDSKSQN FETEPTSRAI SDQDNLHLAI PSAGRTEATR YLNMEKRKQE KRGSVPDKEP DRCLDDKCFF YSCKESLSRD SEICSENSIA
1001: RTTKLKCVPN LCIGPRNSSS FSIYENHTSN AVVNSKTGPP NQFASAKSSL DPTCNSTLPQ TQSVSSPTLR LMGKNLVVVN HKDLVQPQTT AFDCTQQVNL
1101: HRNGCTSTNN RLKQENFLYQ HGQLSSGSPS FGPALLMSDH HMSLNLHVTP VSGFAWTPLQ NGYPAKPDQQ TQQRNSHKKL KSSHSIMDKV IVIDDSPKHQ
1201: TDAEVSLSAS ASTLPLTPSE RPVTCYPLQQ QIRDYPRPLL PNVYSSANSS FMNQGIEKGP FLSSPTLFQF PIAQRGPSMN KKTGWSFTWL FSFVIMKKNI
1301: GILYNEKFYK QLKLLG
001: MFLSTENPPN DPLSSSSSPF LQHLTSSSHE LGQSHLSNFS IRDYAYSNRK NNIKNNWPFS SKSLQLFSTH GVTNPLPPFQ KFSTVSSKFE TTASPSSGKQ
101: IVSSYVHQGR DLDLAKLGLN QTVAETSSKG VCSQSRIIEN GLFPSTSVSK SEVEILVATT SNKKDNHSRK CGRGMVKSKE DSCAGLVTTS ESIMASKTCP
201: ICKTFSSASN TTLNAHIDQC LSVDSALLPP VVFSKPNKPR SKPPRVKVKT MVDIYASAKQ GTLEDLDRRN GTKWVSILSY SNRVVADKSE VSKKRKVSPV
301: GVGPVYIDAK GQKLRILSGF SEKKSSTTPL REQHEDGSSD KKCLGQGSKG TNKSLRKIRR GKKPHKFVKL TNHKADGPEQ IRGVQRGFSG EGSHMGHHRR
401: IYNQRMLAKR GLVSKKLNEK GHELSEDDED TWSGGDPTVL RGTDLSATDS YPLKKQKLGS EVAGRKKTLF RSQSAQSRSF RVPQSEKEDE SLEGVNINRL
501: KKSVASFQED KYPPGKKFCS DASPRGTSMR KFSPPFVPNA WRRLSMPVEL KKARLDFSEE KDDEETGKWE SEMTHERELR DDDYVSGDDG ENNEVLLRSN
601: PSSSGYDDYN DDDEESSEEE GDNNKRAHVL DQTDYTGAEF YQSESDSPTS IEILPSERAM YYSEAGNMIY GQTSCKEDER FDSEVGQGSL FVEVDTIPIP
701: GPPGSFLPSP RDMGFDENLG NSSVITSQVQ SSMDQLDRNS SESPVSAVSN FAAGRLNFPA ELSSFRENFS PDIAMSYSTT PMSFCVPSHH GTITEAEPIT
801: IDKTISPSRF RNNDQESCCC QRKERISEGI TLNHQGSHLL QRRAASSSNT MNLTNSPTRL DPNHPFEQSP YKTQQALDLQ MSKFSSRKSL NAVVPPSPSN
901: PVLRLMGKDL MVMNQGEADE EASRSSLTPN PQFVDPPCGG TGLYFNTGLY LRNSFDSTHQ PQVQAQQQSQ AAAFRNNFDH VRYFSPS
101: IVSSYVHQGR DLDLAKLGLN QTVAETSSKG VCSQSRIIEN GLFPSTSVSK SEVEILVATT SNKKDNHSRK CGRGMVKSKE DSCAGLVTTS ESIMASKTCP
201: ICKTFSSASN TTLNAHIDQC LSVDSALLPP VVFSKPNKPR SKPPRVKVKT MVDIYASAKQ GTLEDLDRRN GTKWVSILSY SNRVVADKSE VSKKRKVSPV
301: GVGPVYIDAK GQKLRILSGF SEKKSSTTPL REQHEDGSSD KKCLGQGSKG TNKSLRKIRR GKKPHKFVKL TNHKADGPEQ IRGVQRGFSG EGSHMGHHRR
401: IYNQRMLAKR GLVSKKLNEK GHELSEDDED TWSGGDPTVL RGTDLSATDS YPLKKQKLGS EVAGRKKTLF RSQSAQSRSF RVPQSEKEDE SLEGVNINRL
501: KKSVASFQED KYPPGKKFCS DASPRGTSMR KFSPPFVPNA WRRLSMPVEL KKARLDFSEE KDDEETGKWE SEMTHERELR DDDYVSGDDG ENNEVLLRSN
601: PSSSGYDDYN DDDEESSEEE GDNNKRAHVL DQTDYTGAEF YQSESDSPTS IEILPSERAM YYSEAGNMIY GQTSCKEDER FDSEVGQGSL FVEVDTIPIP
701: GPPGSFLPSP RDMGFDENLG NSSVITSQVQ SSMDQLDRNS SESPVSAVSN FAAGRLNFPA ELSSFRENFS PDIAMSYSTT PMSFCVPSHH GTITEAEPIT
801: IDKTISPSRF RNNDQESCCC QRKERISEGI TLNHQGSHLL QRRAASSSNT MNLTNSPTRL DPNHPFEQSP YKTQQALDLQ MSKFSSRKSL NAVVPPSPSN
901: PVLRLMGKDL MVMNQGEADE EASRSSLTPN PQFVDPPCGG TGLYFNTGLY LRNSFDSTHQ PQVQAQQQSQ AAAFRNNFDH VRYFSPS
Arabidopsis Description
BEST Arabidopsis thaliana protein match is: hapless 8 (TAIR:AT5G56250.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: /.../BLink). [Source:TAIR;Acc:AT5G56240]
SUBAcon: [nucleus]
SUBAcon: [nucleus]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.