Subcellular Localization
min:
: max
Winner_takes_all: cytosol
Predictor Summary:
Predictor Summary:
- nucleus 3
- plastid 2
- cytosol 2
- mitochondrion 1
Predictors | GFP | MS/MS | Papers |
---|---|---|---|
PPI
Inferred from Arabidopsis experimental PPI
Ath locusA | locusB | Ath locusB | Paper |
---|---|---|---|
AT4G04885.1 | KRH11760 | AT1G30460.1 | 18479511 |
AT4G04885.1 | KRH36762 | AT1G30460.1 | 18479511 |
AT4G04885.1 | KRG91241 | AT4G04885.1 | 18479511 |
AT4G04885.1 | KRH35569 | AT4G04885.1 | 18479511 |
AT4G04885.1 | KRH35574 | AT4G04885.1 | 18479511 |
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
KRG91241 | Soybean | nucleus | 90.24 | 88.79 |
KRH35574 | Soybean | nucleus | 21.04 | 48.14 |
CDY16640 | Canola | nucleus | 36.98 | 44.06 |
Bra029526.1-P | Field mustard | nucleus, plastid | 36.55 | 43.65 |
CDX94585 | Canola | nucleus | 35.57 | 43.16 |
VIT_10s0116g00720.t01 | Wine grape | nucleus | 44.9 | 42.99 |
AT4G04885.1 | Thale cress | nucleus | 37.53 | 42.82 |
PGSC0003DMT400075619 | Potato | nucleus | 43.06 | 40.68 |
Solyc12g094490.1.1 | Tomato | nucleus | 42.41 | 40.1 |
GSMUA_Achr6P18930_001 | Banana | nucleus | 34.6 | 38.81 |
TraesCS7A01G294500.5 | Wheat | nucleus | 32.65 | 32.26 |
TraesCS7D01G290300.2 | Wheat | nucleus | 32.54 | 32.19 |
OQU80002 | Sorghum | nucleus | 32.43 | 31.31 |
Os08t0187700-01 | Rice | cytosol | 32.75 | 31.1 |
Zm00001d000023_P004 | Maize | nucleus | 32.43 | 31.02 |
Zm00001d049442_P009 | Maize | nucleus | 32.21 | 30.71 |
HORVU7Hr1G069840.2 | Barley | plastid | 32.54 | 30.55 |
TraesCS7B01G181900.3 | Wheat | nucleus | 32.32 | 29.86 |
KRH32653 | Soybean | nucleus | 24.4 | 23.08 |
KRG96135 | Soybean | nucleus | 24.3 | 22.47 |
KRH67849 | Soybean | nucleus | 24.62 | 21.95 |
KRG96139 | Soybean | nucleus | 24.3 | 21.75 |
Protein Annotations
Gene3D:1.25.40.90 | EntrezGene:100787354 | MapMan:16.2.1.5.2 | EMBL:ACUP02006692 | InterPro:CID_dom | InterPro:ENTH_VHS |
EnsemblPlantsGene:GLYMA_10G251100 | GO:GO:0000993 | GO:GO:0003674 | GO:GO:0003676 | GO:GO:0003723 | GO:GO:0003729 |
GO:GO:0005488 | GO:GO:0005515 | GO:GO:0005575 | GO:GO:0005622 | GO:GO:0005623 | GO:GO:0005634 |
GO:GO:0005654 | GO:GO:0005737 | GO:GO:0005849 | GO:GO:0006139 | GO:GO:0006369 | GO:GO:0006378 |
GO:GO:0006379 | GO:GO:0008150 | GO:GO:0008152 | GO:GO:0009058 | GO:GO:0009987 | InterPro:IPR006569 |
InterPro:IPR008942 | InterPro:IPR013087 | UniProt:K7LLB6 | EnsemblPlants:KRH35569 | ProteinID:KRH35569 | ProteinID:KRH35569.1 |
PFAM:PF04818 | ScanProsite:PS00028 | PFscan:PS51391 | PANTHER:PTHR15921 | InterPro:RNA_pol_II-bd | SMART:SM00582 |
SUPFAM:SSF48464 | UniParc:UPI0002949737 | InterPro:Znf_C2H2_type | SEG:seg | : | : |
Description
hypothetical protein
Coordinates
chr10:+:47920678..47927383
Molecular Weight (calculated)
102039.0 Da
IEP (calculated)
6.598
GRAVY (calculated)
-0.555
Length
922 amino acids
Sequence
(BLAST)
(BLAST)
001: MFSQNMILPP ENPRPAGFAS KPMGNEIAKP PPSILVGRFK ALLKQRDDEL RATSVPVPPP STDEIVQIYE LLLSELTCNL KPIITDLTII AEQQREHAKG
101: IADAICARIL EVPVDQKLPS LYLLDSIVKN FGQEYIRYFS LRLPEVFCEA YRQVQPSLHS AMRHLFGTWS KVFPPSVLHK IEAELQFSQA VNTQSSTPNP
201: VRASESSRPS HGIHVNPKYL RQLERSTVDS ASKTHQFLSS SSRLGISSSS PLRIGVDRPL SASIDEYAVD NPGVDYGVAK ALGRDVDLTE WQRKLYSGDG
301: RNRFPTSFTY SLSNGHQRQS SRALIDAYGS DKSQETSSSK SLLVERLDRN GIDKVLSTSW QNTEEEEFDW ENMSPTLIDH SRNNSLLPST FGFSRERPGV
401: AANATLSEQD TRKGWSSGSQ LPPVDDSSAI AEDAFASSTF CRAPPGQVPG SQNQINHSLG SSQPHDAWKI SHHPSNIFSN RGRARNLMIP PIDNIRNTDN
501: NPYWVRPAVS RMEAHPSVLP APFEMRPSVN VNVTRPPIIN PLQKHVRSQF DAMNTSNPIA NHVVNKSSFM PEQSFDSVEN KDASILKIHQ LPNQLSGVIS
601: SNQQNHGQAP QLQFFPSQDP STSQFSHGSS SQGHGVSIST AMSNPLPVLP FPLPFQSISN NPLHLQGGAH PPLPPGRPPA PSQMIPHPNA GAFMPSQQPT
701: VGYTNLISSL MSQGVISLAN QLPAQDSVGT EFNPDILKIR HESAVNALYG DLPRQCTTCA LRFKCQEEHS SHMDWHVTKN RMSKSRKQKP SRKWFVSDRM
801: WLSGAEALGT ESAPGFLPTE TIEEMKDHEE LAVPAEEDQN TCALCGEPFD EFYSDEMEEW MYRGAVYLNA PLGITAGMDR SQLGPIIHAK CRSESNMATS
901: EDLGLDEKGA DEEGSQRKRM RS
101: IADAICARIL EVPVDQKLPS LYLLDSIVKN FGQEYIRYFS LRLPEVFCEA YRQVQPSLHS AMRHLFGTWS KVFPPSVLHK IEAELQFSQA VNTQSSTPNP
201: VRASESSRPS HGIHVNPKYL RQLERSTVDS ASKTHQFLSS SSRLGISSSS PLRIGVDRPL SASIDEYAVD NPGVDYGVAK ALGRDVDLTE WQRKLYSGDG
301: RNRFPTSFTY SLSNGHQRQS SRALIDAYGS DKSQETSSSK SLLVERLDRN GIDKVLSTSW QNTEEEEFDW ENMSPTLIDH SRNNSLLPST FGFSRERPGV
401: AANATLSEQD TRKGWSSGSQ LPPVDDSSAI AEDAFASSTF CRAPPGQVPG SQNQINHSLG SSQPHDAWKI SHHPSNIFSN RGRARNLMIP PIDNIRNTDN
501: NPYWVRPAVS RMEAHPSVLP APFEMRPSVN VNVTRPPIIN PLQKHVRSQF DAMNTSNPIA NHVVNKSSFM PEQSFDSVEN KDASILKIHQ LPNQLSGVIS
601: SNQQNHGQAP QLQFFPSQDP STSQFSHGSS SQGHGVSIST AMSNPLPVLP FPLPFQSISN NPLHLQGGAH PPLPPGRPPA PSQMIPHPNA GAFMPSQQPT
701: VGYTNLISSL MSQGVISLAN QLPAQDSVGT EFNPDILKIR HESAVNALYG DLPRQCTTCA LRFKCQEEHS SHMDWHVTKN RMSKSRKQKP SRKWFVSDRM
801: WLSGAEALGT ESAPGFLPTE TIEEMKDHEE LAVPAEEDQN TCALCGEPFD EFYSDEMEEW MYRGAVYLNA PLGITAGMDR SQLGPIIHAK CRSESNMATS
901: EDLGLDEKGA DEEGSQRKRM RS
001: MDSEKILNPR LVSINSTSRK GMSVELPQKP PPPPSLLDRF KALLNQREDE FGGGEEVLPP SMDEIVQLYE VVLGELTFNS KPIITDLTII AGEQREHGEG
101: IANAICTRIL EAPVEQKLPS LYLLDSIVKN IGRDYGRYFS SRLPEVFCLA YRQAHPSLHP SMRHLFGTWS SVFPPPVLRK IDMQLQLSSA ANQSSVGASE
201: PSQPTRGIHV NPKYLRRLEP SAAENNLRGI NSSARVYGQN SLGGYNDFED QLESPSSLSS TPDGFTRRSN DGANPSNQAF NYGMGRATSR DDEHMEWRRK
301: ENLGQGNDHE RPRALIDAYG VDTSKHVTIN KPIRDMNGMH SKMVTPWQNT EEEEFDWEDM SPTLDRSRAG EFLRSSVPAL GSVRARPRVG NTSDFHLDSD
401: IKNGVSHQLR ENWSLSQNYP HTSNRVDTRA GKDLKVLASS VGLVSSNSEF GAPPFDSIQD VNSRFGRALP DGTWPHLSAR GPNSLPVPSA HLHHLANPGN
501: AMSNRLQGKP LYRPENQVSQ SHLNDMTQQN QMLVNYLPSS SAMAPRPMQS LLTHVSHGYP PHGSTIRPSL SIQGGEAMHP LSSGVLSQIG ASNQPPGGAF
601: SGLIGSLMAQ GLISLNNQPA GQGPLGLEFD ADMLKIRNES AISALYGDLP RQCTTCGLRF KCQEEHSKHM DWHVTKNRMS KNHKQNPSRK WFVSASMWLS
701: GAEALGAEAV PGFLPTEPTT EKKDDEDMAV PADEDQTSCA LCGEPFEDFY SDETEEWMYK GAVYMNAPEE STTDMDKSQL GPIVHAKCRP ESNGGDMEEG
801: SQRKKMRS
101: IANAICTRIL EAPVEQKLPS LYLLDSIVKN IGRDYGRYFS SRLPEVFCLA YRQAHPSLHP SMRHLFGTWS SVFPPPVLRK IDMQLQLSSA ANQSSVGASE
201: PSQPTRGIHV NPKYLRRLEP SAAENNLRGI NSSARVYGQN SLGGYNDFED QLESPSSLSS TPDGFTRRSN DGANPSNQAF NYGMGRATSR DDEHMEWRRK
301: ENLGQGNDHE RPRALIDAYG VDTSKHVTIN KPIRDMNGMH SKMVTPWQNT EEEEFDWEDM SPTLDRSRAG EFLRSSVPAL GSVRARPRVG NTSDFHLDSD
401: IKNGVSHQLR ENWSLSQNYP HTSNRVDTRA GKDLKVLASS VGLVSSNSEF GAPPFDSIQD VNSRFGRALP DGTWPHLSAR GPNSLPVPSA HLHHLANPGN
501: AMSNRLQGKP LYRPENQVSQ SHLNDMTQQN QMLVNYLPSS SAMAPRPMQS LLTHVSHGYP PHGSTIRPSL SIQGGEAMHP LSSGVLSQIG ASNQPPGGAF
601: SGLIGSLMAQ GLISLNNQPA GQGPLGLEFD ADMLKIRNES AISALYGDLP RQCTTCGLRF KCQEEHSKHM DWHVTKNRMS KNHKQNPSRK WFVSASMWLS
701: GAEALGAEAV PGFLPTEPTT EKKDDEDMAV PADEDQTSCA LCGEPFEDFY SDETEEWMYK GAVYMNAPEE STTDMDKSQL GPIVHAKCRP ESNGGDMEEG
801: SQRKKMRS
Arabidopsis Description
PCFS4Polyadenylation and cleavage factor homolog 4 [Source:UniProtKB/Swiss-Prot;Acc:Q0WPF2]
SUBAcon: [nucleus]
SUBAcon: [nucleus]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.