Skip to main content
crop-pal logo
Soybean
Subcellular Localization
min:
: max

 
Winner_takes_all: nucleus

Predictor Summary:
  • nucleus 6
  • mitochondrion 1
Predictors GFP MS/MS Papers
Winner Takes All:nucleus
Any Predictor:mitochondrion, nucleus
BaCelLo:nucleus
EpiLoc:nucleus
MultiLoc:nucleus
Plant-mPloc:nucleus
PProwler:mitochondrion
WoLF PSORT:nucleus
YLoc:nucleus
nucleus: 21132161
msms PMID: 21132161 doi
B Cooper, KB Campbell, J Feng, WM Garrett, R Frederick
Soybean Genomics and Improvement Laboratory, USDA-ARS, Beltsville, MD 20705, USA. bret.cooper@ars.usda.gov
Homology

Paralog

locusIdentityHomology Identity

Ortholog

locusHomology SpeciesLocationIdentityHomology Identity
KRH39697 Soybean nucleus 92.43 92.38
Os06t0645700-01 Rice cytosol, nucleus 2.38 63.38
VIT_12s0028g03940.t01 Wine grape nucleus 62.36 61.58
Solyc07g006820.2.1 Tomato nucleus 54.74 56.72
GSMUA_Achr2P11610_001 Banana nucleus 50.66 52.21
CDY44660 Canola nucleus 49.76 51.0
AT1G32750.1 Thale cress nucleus 51.61 50.81
Bra035499.1-P Field mustard nucleus 50.66 50.55
CDY18566 Canola nucleus 45.9 50.55
CDX95471 Canola nucleus 45.74 50.41
Os06t0645800-01 Rice nucleus 23.4 50.23
Bra037541.1-P Field mustard nucleus 46.06 49.38
EER90111 Sorghum nucleus 47.01 49.22
CDX84083 Canola nucleus 6.19 49.16
TraesCS7D01G505200.1 Wheat nucleus 46.69 49.11
TraesCS7A01G514800.1 Wheat nucleus 46.69 49.11
Zm00001d046579_P019 Maize nucleus 46.85 48.98
TraesCS7B01G431500.1 Wheat nucleus 46.37 48.78
TraesCS7D01G505400.1 Wheat nucleus 44.52 47.73
HORVU7Hr1G115120.1 Barley plastid 39.81 47.69
TraesCS7B01G431700.2 Wheat nucleus 44.41 47.62
AT3G19040.2 Thale cress nucleus 44.2 46.6
TraesCS7A01G515000.1 Wheat plastid 43.25 45.85
Os06t0645901-00 Rice cytosol, nucleus 6.62 36.87
Protein Annotations
Gene3D:1.20.920.10EntrezGene:100783866MapMan:15.3.4.4.4MapMan:15.3.5.3.2MapMan:18.4.7.3Gene3D:3.10.20.90
EMBL:ACUP02002334InterPro:BromodomainInterPro:Bromodomain-like_sfInterPro:Bromodomain_CSncoils:CoilEnsemblPlantsGene:GLYMA_04G123500
GO:GO:0001075GO:GO:0001129GO:GO:0003674GO:GO:0003676GO:GO:0003677GO:GO:0003700
GO:GO:0003824GO:GO:0004402GO:GO:0005488GO:GO:0005515GO:GO:0005575GO:GO:0005622
GO:GO:0005623GO:GO:0005634GO:GO:0005654GO:GO:0005669GO:GO:0006139GO:GO:0006464
GO:GO:0008150GO:GO:0008152GO:GO:0009058GO:GO:0009987GO:GO:0016043GO:GO:0016573
GO:GO:0016740GO:GO:0017025GO:GO:0019538GO:GO:0043565GO:GO:0045944GO:GO:0051123
InterPro:IPR000626InterPro:IPR001487InterPro:IPR036427UniProt:K7KJP5EnsemblPlants:KRH62676ProteinID:KRH62676
ProteinID:KRH62676.1PFAM:PF00240PFAM:PF00439PFAM:PF09247PFAM:PF12157PRINTS:PR00503
ScanProsite:PS00633PFscan:PS50014PFscan:PS50053SMART:SM00213SMART:SM00297SUPFAM:SSF47055
SUPFAM:SSF47370SUPFAM:SSF54236InterPro:TAFII-230_TBP-bd_sfInterPro:TAF_II_230-bdInterPro:TFIID_sub1_DUF3591UniParc:UPI000296053E
InterPro:Ubiquitin-like_domsfInterPro:Ubiquitin_domSEG:seg:::
Description
hypothetical protein
Coordinates
chr4:+:15945709..15987722
Molecular Weight (calculated)
214293.0 Da
IEP (calculated)
5.979
GRAVY (calculated)
-0.856
Length
1889 amino acids
Sequence
(BLAST)
0001: MGYDSDSPSQ DGRDEDDEEE YEESGKGNRF LGFMFGNVDN SGDLDVDYLD EDAKEHLSAL ADKLGPSLTD IDLSGKSPQT PPDVVEQGCD VKAEDAVDYE
0101: DIDEEYDGPE TEAANEEDYL LPKKEFFSAE ASVCLESKAS VFDDENYDED SEKEQDFVND DCKVDNIPLA GEQKESFVDA SKEESSLEHE LHVDSPQTEE
0201: LDADVQKLEE ESPEVPKRSM AMPLPVLCVE DGVTILRFSE IFGIHEPLRK GEKREHRHSI PRDRYKSLDL IDDFIEEDEE EFLKGFSQSL SLTKQVCVVH
0301: NDVSESNDVD LEFPKFGFLL ADASVARKDD HQSKDSCHSA EPMKGDFAED HSRKDHPFML ANFYPLDQQD WEDEILWGNS PVPSNNNVES CEISGPELGA
0401: SGGSEIEIES GIQSIQMEPQ KKLEDKDHNV LMCSSPVKVE PFGSWDSFGA KTNLISRSLF HPQLLRLESR SEVDSSSLAD GREAEISEHN QSGQVKRFTK
0501: VISQNRDMME GSWLDKIIWE ELDQPMVKPK LIFDLQDDQM HFEVLDSKDG THLRLHAGAM ILTRSLQSIS GDSSELPGHG SQYGWRHVAN DKHYSNRKTS
0601: QQLKSNSKKR SAHGVKVFHS QPALKLQTMK LKLSNKDIAN FHRPKALWYP HDNEVAVKEQ GKLPTQGPMK IIIKSLGGKG SKLHVDAEET LSSVKAKASK
0701: KLDFKVSETV KIFYLGRELE DHKSLAAQNV QPNSLLHLVR TKIHLWPKAQ RVPGENKSLR PPGAFKKKSD LSVKDGHVFL MEHCEERPLL LSNVGMGARL
0801: CTYYQKCSPD DQSGSLLRNT DNSLGHIISL DPADKSPFLG DLKPGCTQSS LETNMYRAPV FPHKVPLTDY LLVRSSKGKL SLRRIDKINV VGQQEPLMEV
0901: LSPGSKNLQN YMINRLLVHM CREFQAAEKR HMPPYIRVDE FLSQFPYQSE ASFRKKIKEY ANLQRGTNGQ SILVKKRNFR IWSEDELRKM VTPELVCAYE
1001: SMQAGLYRLK HLGITETHPT NISSAMSRLP DEAIALAAAS HIERELQITP WNLSSNFVAC TSQGKENIER MEITGVGDPS GRGMGFSYAR APPKAPVSSA
1101: MVKKKAAANR GGSTVTGTDA DLRRLSMDAA REVLLKFNVP DEVIAKQTRW HRIAMIRKLS SEQATSGVKV DPTTISKYAR GQRMSFLQLQ QQTREKCQEI
1201: WDRQVQSLSA VNGDENESDL EGNSDLDSFA GDLENLLDAE ECEEGEESTN DLKRDKGDGV KGLKMRRHPT LAQAEEEIED DAAEAAELCR LLMDDDEADK
1301: KKKKKAKVIV GEARLVPKMQ SKFSFDNAEQ VKQITNTLQL DGTNHWKEDA ITDLREEENF PTKKSKSLKV NKVKKNDITP ISIPNKKIKL NMGEGIKNQV
1401: FKEKKPSRET FVCGACGKAG HMRTNKNCPK YGEDLETQLE STDMEKSSGK SSFVDPSSLS QHKAPSKKSM SKGTTKIAPV DNSSKIPLKF KCSSTEKSSD
1501: KPAIESLQSS DKPVTSDSET AKSAKVNKII IPKKVKPDDT QAESGKHAIV IRPPTDSGRG QVDSHKFPIK IRPPTEIDRE QNHKKIVIKR TKEVIDLELD
1601: SPGGNTGLQH RKTKRIVELS NFEKQKKQET VYGTEGFKKW NSKEDRRWQE EQEKWRNDAR LREEDRARRH RKEEIRMLKE QERLDEIKRF EEDIRREREE
1701: EEQQKAKKKK KKKPELRDEY LDDLRARRHD KRMPERDRSG KRRSITELGK IGADYMPPTK RRRGGGGEVG LANILESVVD TIVKDRYDLS YLFLKPVSKK
1801: EAPDYLDIIE RPMDLSRIRE RVRNMEYKSR EDFRHDMWQI TFNAHKYNDG RNPGIPPLAD MLLEYCDYLL NENDDSLTEA ETGIEIRDF
Best Arabidopsis Sequence Match ( AT1G32750.1 )
(BLAST)
0001: MAESNGKGSH NETSSDDDDE YEDNSRGFNL GFIFGNVDNS GDLDADYLDE DAKEHLSALA DKLGSSLPDI NLLAKSERTA SDPAEQDYDR KAEDAVDYED
0101: IDEEYDGPEV QVVSEEDHLL PKKEYFSTAV ALGSLKSRAS VFDDEDYDEE EEQEEEQAPV EKSLETEKRE PVVLKEDKAL EYEEEASILD KEDHMDTEDV
0201: QEEEVDELLE GTLDDKGATP LPTLYVEDGM VILQFSEIFA IHEPPQKRDR RENRYVTCRD KYKSMDISEL VEDDEEVLLK SHGRIDTHVE QADLIQLDVP
0301: FPIREGLQLV KASTIGGITP ESREFTKLGR DSCIMGELLK QDFIDDNSSL CQSQLSMQVF PLDQHEWERR IIWEHSPEIS GNSGEIFEPG LEPEGMLVKG
0401: TNSETEQESL NVVNSRVQVQ ADNNMFVPFS ANLLESFGSR GSQSTNESTN KSRHHPQLLR LESQWDENHL SGNDEAGVKK IKRLEKDALG RFSRLVLRER
0501: DLGDEAWLDS IIWDSEKELS RSKLIFDLQD EQMVFEIFDN EESKNLQLHA GAMIVSRSSK SKDETFQEGC ESNSGWQFNL SNDKFYMNGK SSQQLQANTN
0601: KSSVHSLRVF HSVPAIKLQT MKSKLSNKDI ANFHRPKALW YPHDNELAIK QQGKLPTRGS MKIIVKSLGG KGSKLHVGIE ESVSSLRAKA SRKLDFKETE
0701: AVKMFYKGKE LDDEKSLAAQ NVQPNSLVHL IRTKVHLWPW AQKLPGENKS LRPPGAFKKK SDLSTKDGHV FLMEYCEERP LMLSNAGMGA NLCTYYQKSS
0801: PEDQRGNLLR NQSDTLGNVM ILEPGDKSPF LGEIHAGCSQ SSVETNMYKA PIFPQRLQST DYLLVRSPKG KLSLRRIDKI VVVGQQEPRM EVMSPGSKNL
0901: QTYLVNRMLV YVYREFFKRG GGEHPIAADE LSFLFSNLTD AIIKKNMKII ACWKRDKNGQ SYWTKKDSLL EPPESELKKL VAPEHVCSYE SMLAGLYRLK
1001: HLGITRFTLP ASISNALAQL PDEAIALAAA SHIERELQIT PWNLSSNFVA CTNQDRANIE RLEITGVGDP SGRGLGFSYV RAAPKAPAAA GHMKKKAAAG
1101: RGAPTVTGTD ADLRRLSMEA AREVLIKFNV PDEIIAKQTR WHRIAMIRKL SSEQAASGVK VDPTTIGKYA RGQRMSFLQM QQQAREKCQE IWDRQLLSLS
1201: AFDGDENESE NEANSDLDSF AGDLENLLDA EEGGEGEESN ISKNDKLDGV KGLKMRRRPS QVETDEEIED EATEYAELCR LLMQDEDQKK KKKKMKGVGE
1301: GMGSYPPPRP NIALQSGEPV RKANAMDKKP IAIQPDASFL VNESTIKDNR NVDSIIKTPK GKQVKENSNS LGQLKKVKIL NENLKVFKEK KSARENFVCG
1401: ACGQHGHMRT NKHCPRYREN TESQPEGIDM DKSAGKPSSS EPSGLPKLKP IKNSKAAPKS AMKTSVDEAL KGDKLSSKTG GLPLKFRYGI PAGDLSDKPV
1501: SEAPGSSEQA VVSDIDTGIK STSKISKLKI SSKAKPKESK GESERRSHSL MPTFSRERGE SESHKPSVSG QPLSSTERNQ AASSRHTISI PRPSLSMDTD
1601: QAESRRPHLV IRPPTEREQP QKKLVIKRSK EMNDHDMSSL EESPRFESRK TKRMAELAGF QRQQSFRLSE NSLERRPKED RVWWEEEEIS TGRHREVRAR
1701: RDYDDMSVSE EPNEIAEIRR YEEVIRSERE EEERQKAKKK KKKKKLQPEI VEGYLEDYPP RKNDRRLSER GRNVRSRYVS DFERDGAEYA PQPKRRKKGE
1801: VGLANILERI VDTLRLKEEV SRLFLKPVSK KEAPDYLDIV ENPMDLSTIR DKVRKIEYRN REQFRHDVWQ IKYNAHLYND GRNPGIPPLA DQLLEICDYL
1901: LDDYEDQLKE AEKGIDPND
Arabidopsis Description
TAF1Transcription initiation factor TFIID subunit 1 [Source:UniProtKB/Swiss-Prot;Acc:Q8LRK9]
SUBAcon: [nucleus]
Hydropathy Plot

About CropPAL

The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.