Skip to main content
crop-pal logo
Soybean
Subcellular Localization
min:
: max

 
Winner_takes_all: nucleus

Predictor Summary:
  • nucleus 4
  • plastid 3
  • cytosol 1
PPI

Inferred distinct locusB in Crop

Homology

Paralog

locusIdentityHomology Identity

Ortholog

locusHomology SpeciesLocationIdentityHomology Identity
KRH35569 Soybean cytosol 88.79 90.24
KRH35574 Soybean nucleus 20.49 47.64
VIT_10s0116g00720.t01 Wine grape nucleus 45.14 43.93
Bra029526.1-P Field mustard nucleus, plastid 35.86 43.52
AT4G04885.1 Thale cress nucleus 37.14 43.07
CDX94585 Canola nucleus 34.9 43.03
CDY16640 Canola nucleus 35.22 42.64
PGSC0003DMT400075619 Potato nucleus 44.29 42.52
Solyc12g094490.1.1 Tomato nucleus 43.54 41.85
GSMUA_Achr6P18930_001 Banana nucleus 34.79 39.66
TraesCS7A01G294500.5 Wheat nucleus 32.12 32.26
TraesCS7D01G290300.2 Wheat nucleus 32.02 32.19
Os08t0187700-01 Rice cytosol 32.44 31.31
OQU80002 Sorghum nucleus 31.48 30.89
Zm00001d049442_P009 Maize nucleus 31.7 30.71
HORVU7Hr1G069840.2 Barley plastid 31.91 30.45
Zm00001d000023_P004 Maize nucleus 31.27 30.39
TraesCS7B01G181900.3 Wheat nucleus 31.8 29.86
KRH32653 Soybean nucleus 24.01 23.08
KRG96135 Soybean nucleus 23.69 22.27
KRH67849 Soybean nucleus 23.59 21.37
KRG96139 Soybean nucleus 23.37 21.26
Protein Annotations
Gene3D:1.25.40.90EntrezGene:100794796MapMan:16.2.1.5.2EMBL:ACUP02012669InterPro:CID_domInterPro:ENTH_VHS
EnsemblPlantsGene:GLYMA_20G142500GO:GO:0000993GO:GO:0003674GO:GO:0003676GO:GO:0003723GO:GO:0003729
GO:GO:0005488GO:GO:0005515GO:GO:0005575GO:GO:0005622GO:GO:0005623GO:GO:0005634
GO:GO:0005654GO:GO:0005737GO:GO:0005849GO:GO:0006139GO:GO:0006369GO:GO:0006378
GO:GO:0006379GO:GO:0008150GO:GO:0008152GO:GO:0009058GO:GO:0009987InterPro:IPR006569
InterPro:IPR008942InterPro:IPR013087UniProt:K7N3F0EnsemblPlants:KRG91241ProteinID:KRG91241ProteinID:KRG91241.1
PFAM:PF04818ScanProsite:PS00028PFscan:PS51391PANTHER:PTHR15921InterPro:RNA_pol_II-bdSMART:SM00582
SUPFAM:SSF48464UniParc:UPI00023DBD4EInterPro:Znf_C2H2_typeSEG:seg::
Description
hypothetical protein
Coordinates
chr20:-:38101385..38107264
Molecular Weight (calculated)
103830.0 Da
IEP (calculated)
6.797
GRAVY (calculated)
-0.599
Length
937 amino acids
Sequence
(BLAST)
001: MFSQNVILPP ENPRPTAFAS KPMSNEIAKP LPSILVGRFK ALLKQRDDEL RVAAGDPVPP ASTDEIVQIY ELLLSELTCN LKPIITDLTI IAEQQREHAK
101: GIADAICARI LEVPVDQKLP SLYLLDSIVK NFGQEYIRYF SLRLPEVFCE AYRQIQPTLH SAMRHLFGTW SKVFPPSVLR KIETELQFSQ AVNTQSSTLN
201: PVRASESSRP SHAIHVNPKY LRQLERSTVD SASKTHQFLS SSSSLGISSS SPSRIGVDRP LSASMDEYAV DNSAVRLIER NSPHPAVDYG VAKALGRDVD
301: LTEWQQKQYP GDGRNRFPTS VTYSLSNGHQ RQSPRALIDA YGSDKSQETS SSKPLLVERL DRNGIDKVLS TSWQNTEEEE FDWENMSPTL TDHSRNNSLL
401: PSTFGFSRER PGVAANATLS EQDTRKGWSS GSQLPPVDDS SAIAEDAFAS STFRRTPPGQ VPGSQNQINH SLGSSQPHDA WKISHHPSNI FSNRGRARNL
501: MIPPMDNIRN TDNNPYWVRP SMSRMEARPS VLPAPFEMRP SVNVNVTRPP IINPINPLQK HVRSQFNAIN TSNPIANHVN KSSFMPKQSF DSVENKDASI
601: SKIHQLPNQL PGVISSNQQN HGQAPQLQFF PSQDPSTSQF CHGSSLQGHG ASISTAMSNP LPVIPFPLPF QSIANNPLHL QGGAHPSLPP GRPPAPSQMI
701: PHPNVGAYMS SQQPTVGYTN LISSLMSQGV ISLANQLPAQ DSVGTEFNPD ILKVRHESAV NALYGDLPRQ CTTCGLRFKC QEEHSSHMDW HVTKNRMSKT
801: RKQKPSRKWF VSDRMWLSGA EALGTESAPG FLPTETIEER KDDEELAVPA EEDQNTCALC GEPFDEFYSD EMEEWMYRGA VYLNAPTGTT AGMDRTQLGP
901: IIHAKCRSES NMATSEDLGP DEKGADEEGS QRKRMRS
Best Arabidopsis Sequence Match ( AT4G04885.1 )
(BLAST)
001: MDSEKILNPR LVSINSTSRK GMSVELPQKP PPPPSLLDRF KALLNQREDE FGGGEEVLPP SMDEIVQLYE VVLGELTFNS KPIITDLTII AGEQREHGEG
101: IANAICTRIL EAPVEQKLPS LYLLDSIVKN IGRDYGRYFS SRLPEVFCLA YRQAHPSLHP SMRHLFGTWS SVFPPPVLRK IDMQLQLSSA ANQSSVGASE
201: PSQPTRGIHV NPKYLRRLEP SAAENNLRGI NSSARVYGQN SLGGYNDFED QLESPSSLSS TPDGFTRRSN DGANPSNQAF NYGMGRATSR DDEHMEWRRK
301: ENLGQGNDHE RPRALIDAYG VDTSKHVTIN KPIRDMNGMH SKMVTPWQNT EEEEFDWEDM SPTLDRSRAG EFLRSSVPAL GSVRARPRVG NTSDFHLDSD
401: IKNGVSHQLR ENWSLSQNYP HTSNRVDTRA GKDLKVLASS VGLVSSNSEF GAPPFDSIQD VNSRFGRALP DGTWPHLSAR GPNSLPVPSA HLHHLANPGN
501: AMSNRLQGKP LYRPENQVSQ SHLNDMTQQN QMLVNYLPSS SAMAPRPMQS LLTHVSHGYP PHGSTIRPSL SIQGGEAMHP LSSGVLSQIG ASNQPPGGAF
601: SGLIGSLMAQ GLISLNNQPA GQGPLGLEFD ADMLKIRNES AISALYGDLP RQCTTCGLRF KCQEEHSKHM DWHVTKNRMS KNHKQNPSRK WFVSASMWLS
701: GAEALGAEAV PGFLPTEPTT EKKDDEDMAV PADEDQTSCA LCGEPFEDFY SDETEEWMYK GAVYMNAPEE STTDMDKSQL GPIVHAKCRP ESNGGDMEEG
801: SQRKKMRS
Arabidopsis Description
PCFS4Polyadenylation and cleavage factor homolog 4 [Source:UniProtKB/Swiss-Prot;Acc:Q0WPF2]
SUBAcon: [nucleus]
Hydropathy Plot

About CropPAL

The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.