Skip to main content
crop-pal logo
Soybean
Subcellular Localization
min:
: max

 
Winner_takes_all: plastid, nucleus

Predictor Summary:
  • nucleus 4
  • plastid 3
  • plasma membrane 1
PPI
No PPI Data
Homology

Paralog

locusIdentityHomology Identity

Ortholog

locusHomology SpeciesLocationIdentityHomology Identity
KRG96340 Soybean nucleus, plastid 90.27 90.27
VIT_08s0007g07150.t01 Wine grape cytosol, nucleus, plasma membrane 65.1 66.25
Solyc10g080630.1.1 Tomato nucleus 59.26 59.01
CDY08477 Canola nucleus 57.39 58.53
Bra001431.1-P Field mustard nucleus, plastid 56.52 58.46
CDX73841 Canola nucleus, plastid 56.52 58.46
CDY01074 Canola nucleus 57.17 58.35
AT3G11960.3 Thale cress nucleus 57.75 58.09
CDY00750 Canola nucleus 55.08 57.06
GSMUA_Achr9P11080_001 Banana nucleus, plasma membrane, plastid 54.72 56.22
Bra034797.1-P Field mustard nucleus 57.32 54.75
Os07t0203700-01 Rice cytosol, nucleus 24.08 50.15
KXG34639 Sorghum plastid 48.23 49.16
TraesCS2B01G258400.1 Wheat cytosol, nucleus 42.68 48.97
Zm00001d019115_P048 Maize plastid 47.58 48.67
TraesCS2A01G243300.3 Wheat plastid 46.86 47.76
TraesCS2D01G245600.6 Wheat plastid 46.86 47.76
HORVU2Hr1G055870.14 Barley plastid 46.65 47.57
Os07t0203800-01 Rice cytosol, plastid 6.99 34.52
KRH43086 Soybean nucleus 3.17 21.67
KRH59213 Soybean cytosol, plasma membrane, plastid 7.57 21.0
KRH59212 Soybean cytosol, nucleus 7.14 18.57
KRH36214 Soybean nucleus 16.01 18.29
KRH25071 Soybean nucleus 7.86 18.26
KRG92964 Soybean nucleus 15.79 18.04
Protein Annotations
EntrezGene:100806799Gene3D:2.130.10.10MapMan:35.1EMBL:ACUP02002098InterPro:Cleavage/polyA-sp_fac_asu_CEnsemblPlantsGene:GLYMA_03G206600
GO:GO:0003674GO:GO:0003676GO:GO:0005488GO:GO:0005515GO:GO:0005575GO:GO:0005622
GO:GO:0005623GO:GO:0005634UniProt:I1JQE1InterPro:IPR015943EnsemblPlants:KRH68075ProteinID:KRH68075
ProteinID:KRH68075.1PFAM:PF03178PFAM:PF10433PANTHER:PTHR10644PANTHER:PTHR10644:SF6UniParc:UPI00023C148D
InterPro:WD40/YVTN_repeat-like_dom_sfSEG:seg::::
Description
hypothetical protein
Coordinates
chr3:+:41412272..41426946
Molecular Weight (calculated)
152547.0 Da
IEP (calculated)
5.419
GRAVY (calculated)
0.033
Length
1387 amino acids
Sequence
(BLAST)
0001: MAVSEEECSS ANSGSGPSSS SSSASARYYL SKCVLRGSVV LQVLHAHIRS PSSNDVIFGK ETSIELVVID EDGNVQSVCD QPVFGTVKDL AILPWNEKFR
0101: VARDPQLWGK DLLVATSDSG KLSLLTFCNE MHRFIPVTHI QLSNPGNQIY LPGRKLAVDS SGCFIASSAY EDRLALFSLS MSSGDIIDER IVYPSENEGT
0201: ASTSRSIQRI GIRGTIWSIC FISQDSRQPS KEHNPVLAVI INRRGALLNE LLLLEWNVKA HKIFVISQYV EAGPLAHDIV EVPNSGGLAF LFRAGDVLLM
0301: DLRDHRNPSC VCKTNLNFLP NAMEEQTYVE ESCKLHDVDD ERFSVAACAL LELSDYDPMC IDSDNGGANS GYKYICSWSW EPENNRDPRM IFCVDTGEFF
0401: MIEVLFDSEG PKVNLSECLY KGLPCKALLW VESGYLAALV EMGDGMVLKL EDGRLCYINP IQNIAPILDM EVVDYHDEKQ DQMFACCGVA PEGSLRIIRN
0501: GINVENLHRT ASIYQGVTGT WTVRMRVTDS HHSFLVLSFV EETRILSVGL SFTDVTDSVG FQPNVCTLAC GLVTDGLLVQ IHKSTVKLCL PTKAAHSEGI
0601: PLSSPICTSW SPDNVSISLG AVGHNFIVVS TSNPCFLFIL GVRLLSAYQY EIYEMQHLVL QNELSCISIP GQEIEQKQSN SSISANNSSI SSFQIQSGVD
0701: INKTFVIGTH RPSVEIWYFA PGGGITVVAC GTISLTNTVG TAISGCVPQD VRLVFVGKYY VLAGLRNGML LRFEWPAEPC PSSPINIVDT ALSSINLVNS
0801: VTNAFDKRND FPSMLQLIAI RRIGITPVFL VPLGDTLDAD IITLSDRPWL LHSARHSLSY SSISFQPSTH VTPVCSVECP KGILFVAENS LHLVEMVHSK
0901: RLNMQKFHLE GTPRKVLYHD ESKMLLVMRT ELNCGTCLSD ICIMDPLSGS VLSSFRLELG ETGKSMELVR VGSEQVLVVG TSLSSGPHTM ATGEAESCKG
1001: RLLVLCLDHV QNSDSGSVTF CSKAGSSSQK TSPFREIVTY APEQLSSSSL GSSPDDNSSD GIKLDENEVW QFRLTFATKW PGVVLKICPY LDRYFLATAG
1101: NAFYVCGFPN DNPQRVRRYA MGRARFMITS LTAHFTRIAV GDCRDGILLY SYHEEAKKLE LLYNDPSLRL VADCILMDAD TAVVSDRKGS IAVLCSDHLE
1201: DNAGAQCNMA LSCAYFMAEI AMSIKKGSYS YRLPADDVLQ GGNGPKTNVD SLQNTIIATT LLGSIMIFIP LSREEYELLE AVQARLVVHH LTAPVLGNDH
1301: NEFRSRENRV GVPKILDGDM LTQFLELTSM QQKMILSLEL PDMVKPSLKP LLPSHVSVNQ NAEHAYAVLN NIVRQRLRVC FSKCCQI
Best Arabidopsis Sequence Match ( AT3G11960.1 )
(BLAST)
0001: MAAPEDESSA QSQSSPATAA PTPPPSSSPS SAGDHYLAKC ILRPSVVLQV AYGYFRSPSS RDIVFGKETC IELVVIGEDG IVESVCEQYV FGTIKDLAVI
0101: PQSSKLYSNS LQMGKDLLAV LSDSGKLSFL SFSNEMHRFS PIQHVQLSTP GNSRIQLGRM LTIDSSGLFL AVSAYHDRFA LFSLSTSSMG DIIHQRISYP
0201: SEDGGNGSSI QAISGTIWSM CFISKDFNES KEYAPILAIV INRKGSLMNE LALFRWNVKE ESICLISEYV ETGALAHSIV EVPHSSGFAF LFRIGDVLLM
0301: DLRDPQNPCC LFRTSLDFVP ASLMEEHFVE ESCRVQDGDD EGCNVVVCAL LELRDHEVRD HDPMFIDTES DIGKLSSKNV SSWTWEPENN HNPRMIICLD
0401: NGDFFMFELI YEDDGVKVNL SECLYKGLPC KDILWIEGGF LATFAEMADG TVFKLGTEKL HWMSSIQNIA PILDFSVMDD QNEKRDQIFA CCGVTPEGSL
0501: RIIRSGINVE KLLKTAPVYQ GITGTWTVKM KLTDVYHSFL VLSFVEETRV LSVGLSFKDV TDSVGFQSDV CTFACGLVAD GLLVQIHQDA IRLCMPTMDA
0601: HSDGIPVSSP FFSSWFPENV SISLGAVGQN LIVVSTSNPC FLSILGVKSV SSQCCEIYEI QRVTLQYEVS CISVPQKHIG KKRSRDSSPD NFCKAAIPSA
0701: MEQGYTFLIG THKPSVEVLS FTEDGVGVRV LASGLVSLTN TMGTVISGCI PQDVRLVLVD QLYVLSGLRN GMLLRFEWAP FSNSSGLNCP DYFSHCKEEM
0801: DTVVGKKDNL PVNLLLIATR RIGITPVFLV PFSDSLDSDI IALSDRPWLL QTARQSLSYT SISFQPSTHA TPVCSFECPQ GILFVSENCL HLVEMVHSKR
0901: RNAQKFQLGG TPRKVIYHSE SKLLIVMRTD LYDTCTSDIC CVDPLSGSVL SSYKLKPGET GKSMELVRVG NEHVLVVGTS LSSGPAILPS GEAESTKGRV
1001: IILCLEHTQN SDSGSMTICS KACSSSQRTS PFHDVVGYTT ENLSSSSLCS SPDDYSYDGI KLDEAETWQL RLASSTTWPG MVLAICPYLD HYFLASAGNA
1101: FYVCGFPNDS PERMKRFAVG RTRFMITSLR TYFTRIVVGD CRDGVLFYSY HEESKKLHQI YCDPAQRLVA DCFLMDANSV AVSDRKGSIA ILSCKDHSDF
1201: GMKHLEYSSP ESNLNLNCAY YMGEIAMSIK KGCNIYKLPA DDVLRSYGLS KSIDTADDTI IAGTLLGSIF VFAPISSEEY ELLEGVQAKL GIHPLTAPVL
1301: GNDHNEFRGR ENPSQARKIL DGDMLAQFLE LTNRQQESVL STPQPSPSTS KASSKQRSFP PLMLHQVVQL LERVHYALH
Arabidopsis Description
Cleavage and polyadenylation specificity factor (CPSF) A subunit protein [Source:UniProtKB/TrEMBL;Acc:Q84R20]
SUBAcon: [nucleus]
Hydropathy Plot

About CropPAL

The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.