Subcellular Localization
min:
: max
Winner_takes_all: nucleus
Predictor Summary:
Predictor Summary:
- nucleus 4
- mitochondrion 1
Predictors | GFP | MS/MS | Papers | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
PPI
Inferred from Arabidopsis experimental PPI
Ath locusA | locusB | Ath locusB | Paper |
---|---|---|---|
AT2G40030.1 | EER89414 | AT1G49590.1 | 23524848 |
AT2G40030.1 | EES00570 | AT2G27040.1 | 19410546 |
AT2G40030.1 | EES18816 | AT2G27040.1 | 19410546 |
AT2G40030.1 | OQU86696 | AT2G27040.1 | 19410546 |
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
Zm00001d053872_P021 | Maize | nucleus | 85.96 | 88.87 |
TraesCS6B01G094400.1 | Wheat | nucleus | 51.66 | 65.35 |
TraesCS6D01G113900.1 | Wheat | nucleus | 62.58 | 64.77 |
TraesCS6B01G151900.1 | Wheat | nucleus | 62.38 | 64.53 |
TraesCS6A01G123800.1 | Wheat | nucleus | 62.48 | 64.3 |
HORVU6Hr1G022890.1 | Barley | nucleus | 62.58 | 63.95 |
VIT_13s0067g03120.t01 | Wine grape | nucleus | 26.7 | 54.02 |
GSMUA_Achr3P05700_001 | Banana | nucleus | 35.63 | 50.1 |
Solyc01g096390.2.1 | Tomato | nucleus | 38.01 | 45.65 |
KXG34194 | Sorghum | nucleus | 33.2 | 42.42 |
CDY60335 | Canola | nucleus | 37.17 | 40.07 |
AT2G40030.1 | Thale cress | nucleus | 38.41 | 39.17 |
KRH20723 | Soybean | nucleus | 39.16 | 39.06 |
KRH13386 | Soybean | nucleus | 39.55 | 38.28 |
Bra000162.1-P | Field mustard | nucleus | 37.82 | 37.01 |
OQU82183 | Sorghum | endoplasmic reticulum, plasma membrane | 16.23 | 22.04 |
CDY07630 | Canola | nucleus | 25.06 | 22.03 |
EES11045 | Sorghum | nucleus | 12.11 | 17.54 |
EES13372 | Sorghum | nucleus | 15.34 | 16.64 |
EES08565 | Sorghum | nucleus | 14.94 | 16.03 |
KXG22455 | Sorghum | nucleus | 13.15 | 15.3 |
Protein Annotations
KEGG:00230+2.7.7.6 | KEGG:00240+2.7.7.6 | Gene3D:1.10.132.30 | Gene3D:1.10.274.100 | Gene3D:1.20.120.1280 | MapMan:15.1.5.1 |
Gene3D:2.40.40.20 | Gene3D:3.10.450.40 | Gene3D:3.30.1490.180 | UniProt:A0A1Z5RLY9 | ncoils:Coil | GO:GO:0003674 |
GO:GO:0003676 | GO:GO:0003677 | GO:GO:0003824 | GO:GO:0003899 | GO:GO:0005488 | GO:GO:0006139 |
GO:GO:0006351 | GO:GO:0008150 | GO:GO:0008152 | GO:GO:0009058 | GO:GO:0009987 | GO:GO:0016740 |
GO:GO:0016779 | InterPro:IPR038120 | EnsemblPlants:OQU84386 | ProteinID:OQU84386 | ProteinID:OQU84386.1 | ProteinID:OQU84387.1 |
PFAM:PF00623 | PFAM:PF04983 | PFAM:PF04997 | PFAM:PF04998 | PFAM:PF11523 | PANTHER:PTHR19376 |
PANTHER:PTHR19376:SF36 | InterPro:RNA_pol_N | InterPro:RNA_pol_Rpb1_1 | InterPro:RNA_pol_Rpb1_3 | InterPro:RNA_pol_Rpb1_5 | InterPro:RNA_pol_asu |
InterPro:Rpb1_funnel_sf | SMART:SM00663 | EnsemblPlantsGene:SORBI_3004G043750 | SUPFAM:SSF64484 | UniParc:UPI000B42367B | SEG:seg |
Description
hypothetical protein
Coordinates
chr4:-:3604376..3618601
Molecular Weight (calculated)
221750.0 Da
IEP (calculated)
5.442
GRAVY (calculated)
-0.526
Length
2015 amino acids
Sequence
(BLAST)
(BLAST)
0001: MEEDHSATLV SEGAIKSIKL SLSTGEEVCT YSVNECPVTH PSQLGNPFLG LPLEAGKCES CGASENDKCE GHFGYIELPV PIFHPCHVSE LRQLLSLICL
0101: KCLRIKKGKV KQSNGKGNLS ATLCSYCRDI PALSVKEVKT ADGAIRLELS APHKRHMTER SWNFLDKYGF HHGGCSQFRS LLPEEALNIL KKVPDDTRRK
0201: LAARGYIVQT GYVMKYLPVP PNCLYIPEFT DGQSIMSYDI SIALLKKVLQ KIEQIKRSRS GSPNFDSHDA ESCDLQLAIG QYIRLRGTTR GPQDNTKRFT
0301: VGSADSAALS TKQWLEKMRT LFISKGSGFS SRSVLTGDPY IGLGVVGLPS EVAKRMTFEE QVTDININRL QEVVDKGLCL TYRDGQATYA ITVGSKGHTT
0401: LKVGQTISRR IVDGDVVFLN RPPSTHKHSL QAFYAYVHDD HTVKINPLMC GPFSADFDGD CVHIYYPQSL AAKAEALELF SVERQLISSH SGKVNLQLGN
0501: DCLVAMKAMS DRTVLHKELA NQLAMFVPFS LLAPAVMKPI PSWTITQIVQ GALPAKLTCQ GDTHLVRDST IIKLDLDKES VQDSFPDLVS SILREKGPRE
0601: ALQFLNVLEP LLMEFLVLGG LSISLRDFNV PKALLEEAQK NIQNQSLVLE QSRCSTSQFV ELRVENNLKS VKQQISDYVG KFSGLGLLID PKKEASMAKV
0701: VQQVGFVGLQ LYREGKLYSR RLVEDCFSSF VNKHSAIGDE YSPEAFGLVQ SSYFHGLNPY EELVHAICTR ETMIRSSRGL SEPGTLFKNL MAILRDVVIC
0801: YDGTVRNICS NSIIQLKYGE DDEADSSSAV PPGEPVGVLA ATAISNPAYK AVLDSSQSNN ASWESMKEIL QTRTSYKNDA KDRKVVLFLS DCSCAKKFCK
0901: ERAALAVQSC LKRVTLGDCA TDICIEHQKQ INLDGTSEAA PTLVGHIHLD KGQLERINIS IQDILQKCQE VSGRYGKKKG HLCHLLKKIT FATCDCSFTQ
1001: MPISGKLHKV PCVQFSFSDE STVLSESVER AVNVIADSVC SVLLDTIIKG DPRIQAAKVI WVESDATAWV KNTRKVSKGE PALEIIVEKD HAVSNGDAWR
1101: TTIDACLPVL DLIDTRRSIP YGIQQVKELI GISCAFDQVV QRLSSTVKMV NKGVLKDHLI LVANSMTCTG SLIGFNIAGY KATFRSLKVQ VPFTESTLFT
1201: PMKCFEKAAE KCDSDSLGCV VSSSSWGKHA AVGTGSSFQI LWNENQLKSN KDYGDGLYDF LALVRTDQEK TGYMFLDDVD YLLEENAIDD MCLSPEPDGT
1301: VGKPTFEDNF EEQNIQKGSS WENGITMKSS WEQDASAAND SGDWGGWSSG GGASAKPADQ DNSWEVHAKV QDNSTDWGGW SSGVGAAAKP ADQDNSWEVH
1401: AKAQDNCTDW GGWSTDKPTG EATVSGQPAE MDTWADKGTK MESGAGDANW EKKSSTPEAS NKNDPWGKSE NTWDKRKGDG GDGGDGAWEK KSVDGHGNWD
1501: HPGNWNGQSL NVDQDTWGNA RGKKKADGNC QWEEQPSTYR RKKTNADHNS SYNNVMPSSD NAWNAGERFG RSNAKSNAGS SWGEKDKMES DEHPKVPKES
1601: DTWNTGKSNE SPWDNTDALQ DSWGVNSATH DNNTEDGSWD KVVAIKDPVS QQDSWSNVAI QKNDAQNDSW DNVAEKALNS ASQDSWGHLA ATPVSNSDAK
1701: QSDSWDGWNA VPAENSQGTA QWKERTDSGN KDWKSDGWGA KSGNWSSQRN NPGRPPRRPD ERGPPPPRQR FELTIEEKKI LLEVEPLIFR VRRIFREACD
1801: GVRLKPEDEK FIQEKILEHH PEKQSKVSSE IDHIMVNKHH TFEDTRCFFV VSTDGSQADF SYLKCLENFV RKNYTEDVDS FCMKYLRPRR RQAPPPDVGT
1901: APGTPAEVPP STAAETEQGT PAPPAEVPQE TLGSPAVALE GTHNPRTDPT DDTELLGKDS DLTPASPAVA PQEAPKPDPT DDTELLGNEK PDLTPSSPGE
2001: ALQATADPDS TLTDI
0101: KCLRIKKGKV KQSNGKGNLS ATLCSYCRDI PALSVKEVKT ADGAIRLELS APHKRHMTER SWNFLDKYGF HHGGCSQFRS LLPEEALNIL KKVPDDTRRK
0201: LAARGYIVQT GYVMKYLPVP PNCLYIPEFT DGQSIMSYDI SIALLKKVLQ KIEQIKRSRS GSPNFDSHDA ESCDLQLAIG QYIRLRGTTR GPQDNTKRFT
0301: VGSADSAALS TKQWLEKMRT LFISKGSGFS SRSVLTGDPY IGLGVVGLPS EVAKRMTFEE QVTDININRL QEVVDKGLCL TYRDGQATYA ITVGSKGHTT
0401: LKVGQTISRR IVDGDVVFLN RPPSTHKHSL QAFYAYVHDD HTVKINPLMC GPFSADFDGD CVHIYYPQSL AAKAEALELF SVERQLISSH SGKVNLQLGN
0501: DCLVAMKAMS DRTVLHKELA NQLAMFVPFS LLAPAVMKPI PSWTITQIVQ GALPAKLTCQ GDTHLVRDST IIKLDLDKES VQDSFPDLVS SILREKGPRE
0601: ALQFLNVLEP LLMEFLVLGG LSISLRDFNV PKALLEEAQK NIQNQSLVLE QSRCSTSQFV ELRVENNLKS VKQQISDYVG KFSGLGLLID PKKEASMAKV
0701: VQQVGFVGLQ LYREGKLYSR RLVEDCFSSF VNKHSAIGDE YSPEAFGLVQ SSYFHGLNPY EELVHAICTR ETMIRSSRGL SEPGTLFKNL MAILRDVVIC
0801: YDGTVRNICS NSIIQLKYGE DDEADSSSAV PPGEPVGVLA ATAISNPAYK AVLDSSQSNN ASWESMKEIL QTRTSYKNDA KDRKVVLFLS DCSCAKKFCK
0901: ERAALAVQSC LKRVTLGDCA TDICIEHQKQ INLDGTSEAA PTLVGHIHLD KGQLERINIS IQDILQKCQE VSGRYGKKKG HLCHLLKKIT FATCDCSFTQ
1001: MPISGKLHKV PCVQFSFSDE STVLSESVER AVNVIADSVC SVLLDTIIKG DPRIQAAKVI WVESDATAWV KNTRKVSKGE PALEIIVEKD HAVSNGDAWR
1101: TTIDACLPVL DLIDTRRSIP YGIQQVKELI GISCAFDQVV QRLSSTVKMV NKGVLKDHLI LVANSMTCTG SLIGFNIAGY KATFRSLKVQ VPFTESTLFT
1201: PMKCFEKAAE KCDSDSLGCV VSSSSWGKHA AVGTGSSFQI LWNENQLKSN KDYGDGLYDF LALVRTDQEK TGYMFLDDVD YLLEENAIDD MCLSPEPDGT
1301: VGKPTFEDNF EEQNIQKGSS WENGITMKSS WEQDASAAND SGDWGGWSSG GGASAKPADQ DNSWEVHAKV QDNSTDWGGW SSGVGAAAKP ADQDNSWEVH
1401: AKAQDNCTDW GGWSTDKPTG EATVSGQPAE MDTWADKGTK MESGAGDANW EKKSSTPEAS NKNDPWGKSE NTWDKRKGDG GDGGDGAWEK KSVDGHGNWD
1501: HPGNWNGQSL NVDQDTWGNA RGKKKADGNC QWEEQPSTYR RKKTNADHNS SYNNVMPSSD NAWNAGERFG RSNAKSNAGS SWGEKDKMES DEHPKVPKES
1601: DTWNTGKSNE SPWDNTDALQ DSWGVNSATH DNNTEDGSWD KVVAIKDPVS QQDSWSNVAI QKNDAQNDSW DNVAEKALNS ASQDSWGHLA ATPVSNSDAK
1701: QSDSWDGWNA VPAENSQGTA QWKERTDSGN KDWKSDGWGA KSGNWSSQRN NPGRPPRRPD ERGPPPPRQR FELTIEEKKI LLEVEPLIFR VRRIFREACD
1801: GVRLKPEDEK FIQEKILEHH PEKQSKVSSE IDHIMVNKHH TFEDTRCFFV VSTDGSQADF SYLKCLENFV RKNYTEDVDS FCMKYLRPRR RQAPPPDVGT
1901: APGTPAEVPP STAAETEQGT PAPPAEVPQE TLGSPAVALE GTHNPRTDPT DDTELLGKDS DLTPASPAVA PQEAPKPDPT DDTELLGNEK PDLTPSSPGE
2001: ALQATADPDS TLTDI
0001: MEEESTSEIL DGEIVGITFA LASHHEICIQ SISESAINHP SQLTNAFLGL PLEFGKCESC GATEPDKCEG HFGYIQLPVP IYHPAHVNEL KQMLSLLCLK
0101: CLKIKKAKGT SGGLADRLLG VCCEEASQIS IKDRASDGAS YLELKLPSRS RLQPGCWNFL ERYGYRYGSD YTRPLLAREV KEILRRIPEE SRKKLTAKGH
0201: IPQEGYILEY LPVPPNCLSV PEASDGFSTM SVDPSRIELK DVLKKVIAIK SSRSGETNFE SHKAEASEMF RVVDTYLQVR GTAKAARNID MRYGVSKISD
0301: SSSSKAWTEK MRTLFIRKGS GFSSRSVITG DAYRHVNEVG IPIEIAQRIT FEERVSVHNR GYLQKLVDDK LCLSYTQGST TYSLRDGSKG HTELKPGQVV
0401: HRRVMDGDVV FINRPPTTHK HSLQALRVYV HEDNTVKINP LMCSPLSADF DGDCVHLFYP QSLSAKAEVM ELFSVEKQLL SSHTGQLILQ MGSDSLLSLR
0501: VMLERVFLDK ATAQQLAMYG SLSLPPPALR KSSKSGPAWT VFQILQLAFP ERLSCKGDRF LVDGSDLLKF DFGVDAMGSI INEIVTSIFL EKGPKETLGF
0601: FDSLQPLLME SLFAEGFSLS LEDLSMSRAD MDVIHNLIIR EISPMVSRLR LSYRDELQLE NSIHKVKEVA ANFMLKSYSI RNLIDIKSNS AITKLVQQTG
0701: FLGLQLSDKK KFYTKTLVED MAIFCKRKYG RISSSGDFGI VKGCFFHGLD PYEEMAHSIA AREVIVRSSR GLAEPGTLFK NLMAVLRDIV ITNDGTVRNT
0801: CSNSVIQFKY GVDSERGHQG LFEAGEPVGV LAATAMSNPA YKAVLDSSPN SNSSWELMKE VLLCKVNFQN TTNDRRVILY LNECHCGKRF CQENAACTVR
0901: NKLNKVSLKD TAVEFLVEYR KQPTISEIFG IDSCLHGHIH LNKTLLQDWN ISMQDIHQKC EDVINSLGQK KKKKATDDFK RTSLSVSECC SFRDPCGSKG
1001: SDMPCLTFSY NATDPDLERT LDVLCNTVYP VLLEIVIKGD SRICSANIIW NSSDMTTWIR NRHASRRGEW VLDVTVEKSA VKQSGDAWRV VIDSCLSVLH
1101: LIDTKRSIPY SVKQVQELLG LSCAFEQAVQ RLSASVRMVS KGVLKEHIIL LANNMTCSGT MLGFNSGGYK ALTRSLNIKA PFTEATLIAP RKCFEKAAEK
1201: CHTDSLSTVV GSCSWGKRVD VGTGSQFELL WNQKETGLDD KEETDVYSFL QMVISTTNAD AFVSSPGFDV TEEEMAEWAE SPERDSALGE PKFEDSADFQ
1301: NLHDEGKPSG ANWEKSSSWD NGCSGGSEWG VSKSTGGEAN PESNWEKTTN VEKEDAWSSW NTRKDAQESS KSDSGGAWGI KTKDADADTT PNWETSPAPK
1401: DSIVPENNEP TSDVWGHKSV SDKSWDKKNW GTESAPAAWG STDAAVWGSS DKKNSETESD AAAWGSRDKN NSDVGSGAGV LGPWNKKSSE TESNGATWGS
1501: SDKTKSGAAA WNSWDKKNIE TDSEPAAWGS QGKKNSETES GPAAWGAWDK KKSETEPGPA GWGMGDKKNS ETELGPAAMG NWDKKKSDTK SGPAAWGSTD
1601: AAAWGSSDKN NSETESDAAA WGSRNKKTSE IESGAGAWGS WGQPSPTAED KDTNEDDRNP WVSLKETKSR EKDDKERSQW GNPAKKFPSS GGWSNGGGAD
1701: WKGNRNHTPR PPRSEDNLAP MFTATRQRLD SFTSEEQELL SDVEPVMRTL RKIMHPSAYP DGDPISDDDK TFVLEKILNF HPQKETKLGS GVDFITVDKH
1801: TIFSDSRCFF VVSTDGAKQD FSYRKSLNNY LMKKYPDRAE EFIDKYFTKP RPSGNRDRNN QDATPPGEEQ SQPPNQSIGN GGDDFQTQTQ SQSPSQTRAQ
1901: SPSQAQAQSP SQTQSQSQSQ SQSQSQSQSQ SQSQSQSQSQ SQSQSQSPSQ TQTQSPSQTQ AQAQSPSSQS PSQTQT
0101: CLKIKKAKGT SGGLADRLLG VCCEEASQIS IKDRASDGAS YLELKLPSRS RLQPGCWNFL ERYGYRYGSD YTRPLLAREV KEILRRIPEE SRKKLTAKGH
0201: IPQEGYILEY LPVPPNCLSV PEASDGFSTM SVDPSRIELK DVLKKVIAIK SSRSGETNFE SHKAEASEMF RVVDTYLQVR GTAKAARNID MRYGVSKISD
0301: SSSSKAWTEK MRTLFIRKGS GFSSRSVITG DAYRHVNEVG IPIEIAQRIT FEERVSVHNR GYLQKLVDDK LCLSYTQGST TYSLRDGSKG HTELKPGQVV
0401: HRRVMDGDVV FINRPPTTHK HSLQALRVYV HEDNTVKINP LMCSPLSADF DGDCVHLFYP QSLSAKAEVM ELFSVEKQLL SSHTGQLILQ MGSDSLLSLR
0501: VMLERVFLDK ATAQQLAMYG SLSLPPPALR KSSKSGPAWT VFQILQLAFP ERLSCKGDRF LVDGSDLLKF DFGVDAMGSI INEIVTSIFL EKGPKETLGF
0601: FDSLQPLLME SLFAEGFSLS LEDLSMSRAD MDVIHNLIIR EISPMVSRLR LSYRDELQLE NSIHKVKEVA ANFMLKSYSI RNLIDIKSNS AITKLVQQTG
0701: FLGLQLSDKK KFYTKTLVED MAIFCKRKYG RISSSGDFGI VKGCFFHGLD PYEEMAHSIA AREVIVRSSR GLAEPGTLFK NLMAVLRDIV ITNDGTVRNT
0801: CSNSVIQFKY GVDSERGHQG LFEAGEPVGV LAATAMSNPA YKAVLDSSPN SNSSWELMKE VLLCKVNFQN TTNDRRVILY LNECHCGKRF CQENAACTVR
0901: NKLNKVSLKD TAVEFLVEYR KQPTISEIFG IDSCLHGHIH LNKTLLQDWN ISMQDIHQKC EDVINSLGQK KKKKATDDFK RTSLSVSECC SFRDPCGSKG
1001: SDMPCLTFSY NATDPDLERT LDVLCNTVYP VLLEIVIKGD SRICSANIIW NSSDMTTWIR NRHASRRGEW VLDVTVEKSA VKQSGDAWRV VIDSCLSVLH
1101: LIDTKRSIPY SVKQVQELLG LSCAFEQAVQ RLSASVRMVS KGVLKEHIIL LANNMTCSGT MLGFNSGGYK ALTRSLNIKA PFTEATLIAP RKCFEKAAEK
1201: CHTDSLSTVV GSCSWGKRVD VGTGSQFELL WNQKETGLDD KEETDVYSFL QMVISTTNAD AFVSSPGFDV TEEEMAEWAE SPERDSALGE PKFEDSADFQ
1301: NLHDEGKPSG ANWEKSSSWD NGCSGGSEWG VSKSTGGEAN PESNWEKTTN VEKEDAWSSW NTRKDAQESS KSDSGGAWGI KTKDADADTT PNWETSPAPK
1401: DSIVPENNEP TSDVWGHKSV SDKSWDKKNW GTESAPAAWG STDAAVWGSS DKKNSETESD AAAWGSRDKN NSDVGSGAGV LGPWNKKSSE TESNGATWGS
1501: SDKTKSGAAA WNSWDKKNIE TDSEPAAWGS QGKKNSETES GPAAWGAWDK KKSETEPGPA GWGMGDKKNS ETELGPAAMG NWDKKKSDTK SGPAAWGSTD
1601: AAAWGSSDKN NSETESDAAA WGSRNKKTSE IESGAGAWGS WGQPSPTAED KDTNEDDRNP WVSLKETKSR EKDDKERSQW GNPAKKFPSS GGWSNGGGAD
1701: WKGNRNHTPR PPRSEDNLAP MFTATRQRLD SFTSEEQELL SDVEPVMRTL RKIMHPSAYP DGDPISDDDK TFVLEKILNF HPQKETKLGS GVDFITVDKH
1801: TIFSDSRCFF VVSTDGAKQD FSYRKSLNNY LMKKYPDRAE EFIDKYFTKP RPSGNRDRNN QDATPPGEEQ SQPPNQSIGN GGDDFQTQTQ SQSPSQTRAQ
1901: SPSQAQAQSP SQTQSQSQSQ SQSQSQSQSQ SQSQSQSQSQ SQSQSQSPSQ TQTQSPSQTQ AQAQSPSSQS PSQTQT
Arabidopsis Description
NRPE1DNA-directed RNA polymerase V subunit 1 [Source:UniProtKB/Swiss-Prot;Acc:Q5D869]
SUBAcon: [nucleus]
SUBAcon: [nucleus]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.