Skip to main content
crop-pal logo
Soybean
Subcellular Localization
min:
: max

 
Winner_takes_all: nucleus

Predictor Summary:
  • nucleus 3
  • mitochondrion 1
  • cytosol 1
Homology

Paralog

locusIdentityHomology Identity

Ortholog

locusHomology SpeciesLocationIdentityHomology Identity
KRH20723 Soybean nucleus 85.45 88.07
VIT_13s0067g03120.t01 Wine grape nucleus 32.76 68.47
Solyc01g096390.2.1 Tomato nucleus 44.86 55.66
CDY60335 Canola nucleus 43.56 48.53
GSMUA_Achr3P05700_001 Banana nucleus 32.47 47.17
AT2G40030.1 Thale cress nucleus 44.67 47.06
Bra000162.1-P Field mustard nucleus 44.24 44.73
TraesCS6B01G094400.1 Wheat nucleus 33.09 43.25
TraesCS6B01G151900.1 Wheat nucleus 38.23 40.86
TraesCS6D01G113900.1 Wheat nucleus 38.18 40.83
TraesCS6A01G123800.1 Wheat nucleus 38.28 40.7
HORVU6Hr1G022890.1 Barley nucleus 38.38 40.52
Zm00001d053872_P021 Maize nucleus 37.75 40.33
OQU84386 Sorghum nucleus 38.28 39.55
CDY07630 Canola nucleus 30.07 27.31
KRH27976 Soybean nucleus 16.14 23.11
KRH77477 Soybean nucleus 15.95 22.79
KRG88695 Soybean nucleus 11.72 17.54
KRH28855 Soybean nucleus 14.79 16.86
KRH40807 Soybean nucleus 14.31 16.82
KRH76591 Soybean nucleus 14.75 16.8
KRH71957 Soybean nucleus 10.13 16.38
KRH33046 Soybean nucleus 12.44 15.69
Protein Annotations
KEGG:00230+2.7.7.6KEGG:00240+2.7.7.6Gene3D:1.20.120.1280EntrezGene:100801412MapMan:15.1.5.1Gene3D:2.40.40.20
Gene3D:3.10.450.40Gene3D:3.30.1490.180UniProt:A0A0R0G598EMBL:ACUP02009712EnsemblPlantsGene:GLYMA_15G235900GO:GO:0001056
GO:GO:0003674GO:GO:0003676GO:GO:0003677GO:GO:0003824GO:GO:0003899GO:GO:0005488
GO:GO:0005575GO:GO:0005622GO:GO:0005623GO:GO:0005634GO:GO:0005654GO:GO:0005666
GO:GO:0006139GO:GO:0006351GO:GO:0006383GO:GO:0008150GO:GO:0008152GO:GO:0009058
GO:GO:0009987GO:GO:0016740GO:GO:0016779EnsemblPlants:KRH13386ProteinID:KRH13386ProteinID:KRH13386.1
ProteinID:KRH13387.1ProteinID:KRH13388.1PFAM:PF00623PFAM:PF04983PFAM:PF04997PFAM:PF04998
PFAM:PF11523PANTHER:PTHR19376PANTHER:PTHR19376:SF36InterPro:RNA_pol_NInterPro:RNA_pol_Rpb1_1InterPro:RNA_pol_Rpb1_3
InterPro:RNA_pol_Rpb1_5InterPro:RNA_pol_asuSMART:SM00663SUPFAM:SSF64484UniParc:UPI0006EE11DASEG:seg
Description
hypothetical protein
Coordinates
chr15:-:44346879..44363540
Molecular Weight (calculated)
231368.0 Da
IEP (calculated)
5.984
GRAVY (calculated)
-0.585
Length
2082 amino acids
Sequence
(BLAST)
0001: MEDNPPSSVL DGTVVGIKFG MATRQEICTA SISDSSISHA SQLSNPFLGL PLEFGRCESC GTSEVGKCEG HFGYIELPIP IYHPSHISDL KRMLSMVCLN
0101: CLKLRKTKLP ASSSGLAQRL ISPCCQEDKA ALVSIREVKT SDGACYLALK VSKSKMQNGF WSFLEKYGYR YGGDHTRALL PCEAMEIIKR IPIETKKKLA
0201: GKGYFPQDGY VLKYLPVPPN CLSVPEVSDG VSVMSSDPSI TILRKLLRKV EIIKSSRSGE PNFESHHVEA NDLQSVVDQY FQIRGTSKPA RDIETHFGVN
0301: KELTASSTKA WLEKMRTLFI RKGSGFSSRN VITGDCYKRI NEVGIPVEVA QRITFEERVN IHNIRYLQKL VDEHLCLTYK EGGSTYSLRE GSKGHIYLKP
0401: GQIVHRRIMD GDIVFINRPP TTHKHSLQAL YVYIHEDHTV KINPLICGPL GADFDGDCVH LFYPQSLAAK AEVVELFSVE NQLLSSHSGN LNLQLSTDSL
0501: LSLKMLVKRC FFDRAAANQL AMFILLPLPR PALLKASSGD ACWTSIQILQ CALPLGFDCT GGRYLIRQSE ILEFEFSRDV LPATVNEIAA SVFFGKGPKE
0601: ALNFFDVLQP FLMESLFAEG FSVSLEEFSI SRAIKRIIRK SIGKVSSLLY QLRSLYNELV AQQLEKHIRD VELPIINFAL KSTKLGDLID SKSKSAIDKV
0701: VQQIGFLGQQ LFDRGRFYSK GLVDDVASHF HAKCCYDGDG YPSAEYGLLK GCFFNGLDPY EEMVHSISTR EIMVRSSRGL SEPGTLFKNL MAILRDVVIC
0801: YDGTVRNICS NSIIQFEYGI QAGDKSEHLF PAGEPVGVLA ATAMSNPAYK AVLDASPSSN SSWELMKEIL LCKVNFRNEL VDRRVILYLN DCDCGGSYCR
0901: ENAAYSVKDQ LRKVSLKDAA VEFIIEYQQQ RTQKENSETD VGLVGHIYLD EMMLEELKIS MAYVFDKCHE RLKSFSQKKK VNQSLKNIEL SFSESCSSSH
1001: PAAPCLTFWL KNYDSDLDNA VKVLAEKICP VLFKTIIQGD PRISSASIIW VSPDTNTWVR NPYKSSNGEL ALDIILEKEA VKQSGDAWRV VLDACLPVLH
1101: LIDTRRSIPY AIKQIQELLG ISCTFDQAIQ RVAASVKMVA KGVLREHLIL LASSMTCGGN LVGFNIGGYK ALSRQLNIQV PFTDATLFTP KKCFERAAEK
1201: CHTDSLSSIV ASCSWGKHVA VGTGSKFDVV WDANEIKSNE IEGMDVYSFL HMVKSFTNGE EETDACLGED IDDLLEEEYM DLGMSPQHNS GFEAVFEENP
1301: EVLNGSTSNG WDVSSNQGES KTNEWSGWAS SNKAEIKDGR SEIAPKNSWG KTVNQEDSSK SNPWSTSTIA DQTKTKSNEW SAWGSNKSEI PVGWASSNKT
1401: EIKDGRSETA QENSWGKTVN QEDSSKSNAW NTSTTVDHAN TKSNEWSAWG SNQSEIPAGG SKAVQEDSWG SSKWKADVAQ EDNSRLGAWD ANAADQTKSS
1501: EWSGWGKKKD VTQEDNSRLG AWDANAADQT KSRDWSGWGK KKDITQEDNS RLGAWDANAA DQTKSSEWSG WGKKKDVTQE DNSRLGAWDA NTADQTKSNE
1601: WSGWGKKKEV TQEDNSRLGA WDANTADQTK SNEWSGWGKK KEVTQEDNSR LGAWDANAAD QTKSNEWSGW GKKKDVTQED NSRLGAWDAN AADQTKSNEW
1701: SDWGKKKEVT QEDNVQDSWG SGKRKDKVTQ EDNSGSGGWG ANRTDLAKSK SSEWSSWGKN KSEIPAGGSE NVQNDSWGSG KLEDDTQKEN SGSAWVRNKA
1801: ETIDGGSEKP QEDAWNSGNW KAESKVGNAS WGKPKSSESQ AWDSHNQSNQ NSSSQGWESH IASANSESEK GFQWGKQGRD SFKKNRFEGS QGRGSNAGDW
1901: KNRNRPPRAP GQRLDIYSSG EQDVLKDIEP IMQSIRRIMQ QQGYNDGDPL AAEDQLFVLE NVFEHHPDKE TKMGTGIDYV MVNKHSSFQE SRCFYVVCKD
2001: GESKDFSYRK CLANYISKKY PDLAESFLGK YFRKPRARGD QTATPGRDEA ATPGEQTATP GRDEAATPAE QISTPTPMET NE
Best Arabidopsis Sequence Match ( AT2G40030.1 )
(BLAST)
0001: MEEESTSEIL DGEIVGITFA LASHHEICIQ SISESAINHP SQLTNAFLGL PLEFGKCESC GATEPDKCEG HFGYIQLPVP IYHPAHVNEL KQMLSLLCLK
0101: CLKIKKAKGT SGGLADRLLG VCCEEASQIS IKDRASDGAS YLELKLPSRS RLQPGCWNFL ERYGYRYGSD YTRPLLAREV KEILRRIPEE SRKKLTAKGH
0201: IPQEGYILEY LPVPPNCLSV PEASDGFSTM SVDPSRIELK DVLKKVIAIK SSRSGETNFE SHKAEASEMF RVVDTYLQVR GTAKAARNID MRYGVSKISD
0301: SSSSKAWTEK MRTLFIRKGS GFSSRSVITG DAYRHVNEVG IPIEIAQRIT FEERVSVHNR GYLQKLVDDK LCLSYTQGST TYSLRDGSKG HTELKPGQVV
0401: HRRVMDGDVV FINRPPTTHK HSLQALRVYV HEDNTVKINP LMCSPLSADF DGDCVHLFYP QSLSAKAEVM ELFSVEKQLL SSHTGQLILQ MGSDSLLSLR
0501: VMLERVFLDK ATAQQLAMYG SLSLPPPALR KSSKSGPAWT VFQILQLAFP ERLSCKGDRF LVDGSDLLKF DFGVDAMGSI INEIVTSIFL EKGPKETLGF
0601: FDSLQPLLME SLFAEGFSLS LEDLSMSRAD MDVIHNLIIR EISPMVSRLR LSYRDELQLE NSIHKVKEVA ANFMLKSYSI RNLIDIKSNS AITKLVQQTG
0701: FLGLQLSDKK KFYTKTLVED MAIFCKRKYG RISSSGDFGI VKGCFFHGLD PYEEMAHSIA AREVIVRSSR GLAEPGTLFK NLMAVLRDIV ITNDGTVRNT
0801: CSNSVIQFKY GVDSERGHQG LFEAGEPVGV LAATAMSNPA YKAVLDSSPN SNSSWELMKE VLLCKVNFQN TTNDRRVILY LNECHCGKRF CQENAACTVR
0901: NKLNKVSLKD TAVEFLVEYR KQPTISEIFG IDSCLHGHIH LNKTLLQDWN ISMQDIHQKC EDVINSLGQK KKKKATDDFK RTSLSVSECC SFRDPCGSKG
1001: SDMPCLTFSY NATDPDLERT LDVLCNTVYP VLLEIVIKGD SRICSANIIW NSSDMTTWIR NRHASRRGEW VLDVTVEKSA VKQSGDAWRV VIDSCLSVLH
1101: LIDTKRSIPY SVKQVQELLG LSCAFEQAVQ RLSASVRMVS KGVLKEHIIL LANNMTCSGT MLGFNSGGYK ALTRSLNIKA PFTEATLIAP RKCFEKAAEK
1201: CHTDSLSTVV GSCSWGKRVD VGTGSQFELL WNQKETGLDD KEETDVYSFL QMVISTTNAD AFVSSPGFDV TEEEMAEWAE SPERDSALGE PKFEDSADFQ
1301: NLHDEGKPSG ANWEKSSSWD NGCSGGSEWG VSKSTGGEAN PESNWEKTTN VEKEDAWSSW NTRKDAQESS KSDSGGAWGI KTKDADADTT PNWETSPAPK
1401: DSIVPENNEP TSDVWGHKSV SDKSWDKKNW GTESAPAAWG STDAAVWGSS DKKNSETESD AAAWGSRDKN NSDVGSGAGV LGPWNKKSSE TESNGATWGS
1501: SDKTKSGAAA WNSWDKKNIE TDSEPAAWGS QGKKNSETES GPAAWGAWDK KKSETEPGPA GWGMGDKKNS ETELGPAAMG NWDKKKSDTK SGPAAWGSTD
1601: AAAWGSSDKN NSETESDAAA WGSRNKKTSE IESGAGAWGS WGQPSPTAED KDTNEDDRNP WVSLKETKSR EKDDKERSQW GNPAKKFPSS GGWSNGGGAD
1701: WKGNRNHTPR PPRSEDNLAP MFTATRQRLD SFTSEEQELL SDVEPVMRTL RKIMHPSAYP DGDPISDDDK TFVLEKILNF HPQKETKLGS GVDFITVDKH
1801: TIFSDSRCFF VVSTDGAKQD FSYRKSLNNY LMKKYPDRAE EFIDKYFTKP RPSGNRDRNN QDATPPGEEQ SQPPNQSIGN GGDDFQTQTQ SQSPSQTRAQ
1901: SPSQAQAQSP SQTQSQSQSQ SQSQSQSQSQ SQSQSQSQSQ SQSQSQSPSQ TQTQSPSQTQ AQAQSPSSQS PSQTQT
Arabidopsis Description
NRPE1DNA-directed RNA polymerase V subunit 1 [Source:UniProtKB/Swiss-Prot;Acc:Q5D869]
SUBAcon: [nucleus]
Hydropathy Plot

About CropPAL

The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.