Skip to main content
crop-pal logo
Soybean
Subcellular Localization
min:
: max

 
Winner_takes_all: nucleus

Predictor Summary:
  • nucleus 2
  • cytosol 2
  • mitochondrion 1
  • plasma membrane 2
  • extracellular 1
  • endoplasmic reticulum 1
  • golgi 1
  • vacuole 1
Predictors GFP MS/MS Papers
Winner Takes All:nucleus
Any Predictor:cytosol, mitochondrion, nucleus, secretory
BaCelLo:nucleus
MultiLoc:cytosol
Plant-mPloc:nucleus
Predotar:secretory
PProwler:mitochondrion
WoLF PSORT:plasma membrane
YLoc:cytosol
nucleus: 21132161
msms PMID: 21132161 doi
B Cooper, KB Campbell, J Feng, WM Garrett, R Frederick
Soybean Genomics and Improvement Laboratory, USDA-ARS, Beltsville, MD 20705, USA. bret.cooper@ars.usda.gov
PPI
No PPI Data
Homology

Paralog

locusIdentityHomology Identity

Ortholog

locusHomology SpeciesLocationIdentityHomology Identity
KRH29119 Soybean mitochondrion 19.48 87.24
PGSC0003DMT400056702 Potato endoplasmic reticulum, extracellular, plastid 2.34 68.66
VIT_03s0038g01920.t01 Wine grape plasma membrane 40.69 60.06
VIT_03s0038g01880.t01 Wine grape cytosol 13.84 58.49
Solyc11g039880.1.1 Tomato nucleus 21.57 57.14
VIT_03s0038g01870.t01 Wine grape cytosol 3.0 54.13
AT4G38760.1 Thale cress cytosol 48.98 49.01
CDX72740 Canola cytosol 49.03 48.96
GSMUA_Achr11P... Banana cytosol 14.19 48.95
Bra033575.1-P Field mustard cytosol 49.19 48.57
CDY22413 Canola cytosol 42.83 47.9
Solyc11g039870.1.1 Tomato nucleus 25.74 45.92
OQU82713 Sorghum plastid 37.44 37.53
TraesCS7B01G055100.1 Wheat cytosol, plasma membrane, plastid 37.18 37.22
HORVU7Hr1G031100.1 Barley cytosol 37.03 37.09
Zm00001d030342_P001 Maize plastid 2.7 37.06
TraesCS7D01G153000.1 Wheat cytosol, plasma membrane, plastid 36.93 37.0
TraesCS7A01G151100.1 Wheat plastid 36.88 36.91
Zm00001d030340_P037 Maize plasma membrane 33.52 36.67
GSMUA_Achr11P... Banana plasma membrane 23.55 34.86
Os10t0154600-01 Rice peroxisome, plasma membrane, plastid 11.24 34.21
Os10t0154566-00 Rice plastid 2.64 32.7
Protein Annotations
EntrezGene:100796560MapMan:23.5.1.1.2.2EMBL:ACUP02007171ncoils:CoilEnsemblPlantsGene:GLYMA_12G024600GO:GO:0003674
GO:GO:0005198GO:GO:0005575GO:GO:0005622GO:GO:0005623GO:GO:0005634GO:GO:0005635
GO:GO:0006405GO:GO:0006606GO:GO:0006810GO:GO:0008150GO:GO:0017056GO:GO:0044611
UniProt:K7LSN5EnsemblPlants:KRH24149ProteinID:KRH24149ProteinID:KRH24149.1InterPro:Nucleoporin_Nup188PFAM:PF10487
PANTHER:PTHR31431UniParc:UPI000296B699SEG:seg:::
Description
hypothetical protein
Coordinates
chr12:+:1781192..1803922
Molecular Weight (calculated)
220250.0 Da
IEP (calculated)
5.344
GRAVY (calculated)
0.155
Length
1966 amino acids
Sequence
(BLAST)
0001: MADTSSVDAS LWWDSFTVLF SELENSSLTS DLPPNLAKKL KDNHAWFVDT LTRFKPPNQS SKEALSSKTL KIGSHQLTIQ PQLKDTALQI SSCLLLDEVQ
0101: SYILVERSIK HNNAAVADSM APEFLYMMLV QYYKERQCLL KCIRWILMHA IHNGYVAEDN TMKEEARKLF HDGLENKLIL FFSNLLSCSF PEQMDVDLFT
0201: LWAEETLIED NLVLDILFLA YYDSFCTCSS EMWKKFISLY KGILAGDYNL GKLSITTETQ QLSYHAKVQL LLILIETLNL ENVLQMVHDE VPYRKGVSTF
0301: SMTDVQEMDA LVSTFNAFEM KEAGPLVLAW AVFLYLLLTL VEKDENNELM EIDHISYVRQ AFEAGSLRYC LEILECDILK EYDGPVSGYR GVLRTFISAF
0401: VASYEINLQP EDSNPTLMLD ILCKIYRGEE SLCIQFWDKE SFIDGPIRSL LCNLESEFPF RTLELVQLLS SLCEGTWPAE CVYNFLNRSV GISSLFEISS
0501: DLEVVEAQQA VQVPGVEGFF IPAGTRGSVL RVVGENTALV RWEYSPSGMF VLLLHLAQEM YLNSKDGVVY TLDLLSRLVS FNTGVCFAVM DISNSLLFHD
0601: VGLMDEQVEK RVWVVDIICN LVKNLTLNSC GAALMSMGVK ILGIMLICSP ANVAATTLNA NLFDITLQTP TFNVGSNGLS SGSWLLSCKL ARMLLIDCEQ
0701: NSNDCPLAIS VLDFTIQLVE TGVEHDALLA LIIFSLQYVL VNHEYWKYKM KHIRWKITLK VLELMKKCIS SMPYYGKLGE IINNVLFSDS SIHNTLFQIV
0801: CTNAHALEKL HVSRLFDPME IEGLQLAIGS VLDILSVMLT KLSKDTSSNF PVFLQALFSC TTKPVPVVTS VMSLISYSQD PAIQFGAVRF ISMLFAIADC
0901: IQPFSYGITC FIPDNEIMDL RHSVNYILLE QSESNEDLFV ATVNLFTSAA HYQPSFIVAI FALEENTEGH LSIGDAKLQK KETSPTTVVS KRSSLVDALM
1001: HYIERADDLI KSNPRILLCV LNFMIALWQG APHYANLLDA LRRHGKFWEH LANAISNIAS SEIPLLRSLE EKDAFNLAYC FHCQSSIHGI MAYELFLHKK
1101: LFHAESLVKD VAESKDKEQN ASKTEKSKAP DLQDLKGIWS SWFNDSILEK LIKSYTSCGY NNDIYGGAKV ATSLFSVHVM MKLAVCDSGS ISVLLLQKIH
1201: EILTKLSIHP AFSELVSQYS QRGYSEGKEL KKLILSDLFY HLQGELEGRK IDIGPFKELS QYLVESNFLG TYQHLFNEDS FTKNMFTKNV YLFDLAHLRE
1301: DLRLDLWDCS NWKTSKEIAE TMLRFLQDAN SVMLLSSSKL SALKGLIAVL AVNHYDSQGR ATTGGRISDE LIFAFMDNIC QSFLATIETL SSVLDASEDI
1401: LNFLACQAEL LLQLTRTVCK SLSLHVSLLV LKCASSGLKL LSALKPLPSE ANLIMKLLLT LLLSVLQSDS LNAHSDGATD ESSGEDFSKV SNATLGLLPI
1501: LCNCIATSEH CMLSLSVMDL ILRRFLTPRT WLPVLQNHLQ LPIVMLKLHD KNSASIPIIM KFFLTLARVR GGAEMLYCSG FLSSLRVLFA ESGEDFLRIG
1601: SENLGSSCEK FVIPQDIWGL GLAVVTAMVK SLGDNSSGTA IVDSMIPYFF SEKARLIFNS LNAPDFPSDD HDKKRPRAQR AWISLATLKE TEHTLMLMCE
1701: LAKHWNSWIK AIRNVDRQLR EKCIHLLAFI SRGSQRLSEL SSRNAPLLCP PTVKEEFEIC LKPSYVNSKN GWFALSPLGC VPKPKISSFS TALSTYGQAT
1801: ESRNPASKTG FSDTVALQVY RIAFLLLKFL CLQTEGAAKR AEEVGFVDLA HFPELPMPEI LHGLQDQAIA ITTELCEANK LKVSPETQDV CNLLLQILEM
1901: ALHLELCVLQ ICGIRPVLGR VEDFSKEAKS LFSALEGHAF LKASCNSLKQ MISCVYPGLL QGENFI
Best Arabidopsis Sequence Match ( AT4G38760.1 )
(BLAST)
0001: MANPNSVDSS LWWDPFDSLL TDLENASLSD DLPQPIAKKL EENHAWFVGT LSMFKPPSEK SKEALNSDLV KIKEHQLVIK PQLKDKALRI SSHLNLDEIQ
0101: SYILVERSME QEYGTTDSVA QELTQEFIDM ILLQYYIQRQ CLLKCTKRIL IHALYAPREE SSIKEEAVKL ISDGLERRQS SVLEDLLSSC FPKNMDVNLF
0201: TLWAEETLIE DNLILDILFL IYNESYCSCN GERWRKLCSF YKGILSGSYN FSKLAVSVEA QHSACRVQIQ LLMILIETLD MENLLQMVHD GVPFRSGTCV
0301: FSIVDVQEMD ATISSLNTSE VNEAGPLVLA WAVFLCLISS LPGKEESPFL MDIDHVSYVH QAFEAASLSY FLEILQSNLL NDFDGPISGH RSVVRTFISA
0401: FIASYEINLQ LEDGTLELIL DILSKVYQGE ESLCCQFWDR KSFVDGPIRC LLFDLESEFP FRSAEFIRLL SSLSEGSWPA ECVYNFLDKS VGVSTLFDIT
0501: SDSPADDASQ LVETSRPLHI PGLEGLVIPS NTRGRILRVI SENTVLVRWE YSLSGIIVLI IRLANKLYIG NNREAFVTLE LLRRMVTFNK AVCFSLLNIS
0601: HFFYVQESYV NGKMESDVRV VDIICNSVRS LTFDSGGAAV MAMAIDILAK LLRCSPSSVA PMVLKSNIFD MTSCSDVPDS GYNISLSGSW SLSGKLAKMI
0701: LIDCEKNDTS CPLVISVLEF TMQLVEGGLE NDVVFALVVF SLQYILASHE YWKYNHGNMR WKVTLKVIEL MKTCLRFSKF STKLRDVLLD ILLNDASVHS
0801: ALFRIICTTT QNLENLCSSR FIEPAEIEGW QLAIVSVLDV LNVILSQFSE STHSGLPVFH QAMLSSTTKP ISVVAAITSL ISYFRNPTIQ VCAAQVLSKL
0901: FALAESSQLY IISNAGFGLD NKQITDLRNS VTQIVLDLSG QNEHLVVATL KLLTVAARFQ PALLVAIFDS DEDSDSSNVK QSRKDASSIP DWACKSRLLH
1001: TILQYVERAT DFVDRHTDIL LGLLDFLKTL WQEAGQYANM LEPFKASKKL WQEFSDIISQ ASKIKDSTVG SLGKEEISKL LVKYQCQASV LEIMACNMFL
1101: YKKLLFAESL KKPCVETKKT ASNGVSPPKL TWTADSDPKD IFSKWCDISV LDGIIQSVSS LDGESEINFQ AKVAAVLLIV HLIVKLETSG AGALSMVLVE
1201: KIKLISETLC AQPAFSELLA QYSKLGYSGG KELMPMIFSD LYCHLQGKLE GRDIPTGPFK ELFQFLVETS FWEKYKQKTN KDVNMALGDC LFDTQQIQTE
1301: LGIDIWDFSE WKTSKTTAEE MLNYMQRANS MVLLSTSQLS VLHALISVLI LYEDNSLEES AAAERKIPSR VTLLSIDKVC RKFCTTVDSL ASLWDAPKIV
1401: FDILTAQADL LSRLLKSAKK NLSLSVCALV LRNVGPGLKI LGSLRHSNAI LKKTINLLLE VLLLVVGFGS DNSNSSGMGH MVLAKDFAEI SDATIGLLPL
1501: LCNFMGNPEY LTLCLTTVDL ILRNFLTPET WFPIIQSQLR LQHVILQLQD KKSTTSVSAI LKFFLTIAQV HGGAQMLLNS GFFSTLRALL MEFPDGMSTL
1601: VSDNEKGSLL EKTEKTQHIW GIGLAVVTAM VHSLGSVSAG ADIVESVISY FFLEKGYMIS YYLAAPDFPS DDRDKVRLRS QRTWTSLAYL RVTEHTLLLL
1701: CALASHWRSW VKIMKDMDSP LREMTIHLLA FISKGAQRLR ESQSHISHLL CPPVAKEEFD SCKRPSFINT KHGWFSLAPL VCVGKPKITA VSISTALVVR
1801: GDTTEHPGSV PQSQFSDSVA IQIYRVASLL LKFLCLQAEG VVTRAEEVGY VDIAHFPELP EPEILHGLQD QATAIVAELC DNYKSKEIPD EVKKLCLMLI
1901: QTTEMSLYLE LCVVQVCRIH PVFGRVDNFS KDLKKLVKAA EVHTYLEPSI DSLKKIAAFL YPGSL
Arabidopsis Description
Protein of unknown function (DUF3414) [Source:TAIR;Acc:AT4G38760]
SUBAcon: [cytosol]
Hydropathy Plot

About CropPAL

The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.