Subcellular Localization
min:
: max
Winner_takes_all: nucleus
Predictor Summary:
Predictor Summary:
- nucleus 5
- extracellular 1
- endoplasmic reticulum 1
- vacuole 1
- plasma membrane 1
- golgi 1
Predictors | GFP | MS/MS | Papers | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
PPI
No PPI Data
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
Zm00001d017420_P001 | Maize | nucleus | 58.73 | 38.54 |
HORVU6Hr1G061300.6 | Barley | nucleus | 55.1 | 38.39 |
TraesCS6B01G284400.3 | Wheat | nucleus, plastid | 56.46 | 38.31 |
Zm00001d051316_P002 | Maize | nucleus | 57.82 | 38.17 |
EES05815 | Sorghum | nucleus | 60.77 | 37.22 |
TraesCS6A01G240200.1 | Wheat | nucleus | 55.78 | 36.88 |
TraesCS6D01G222500.1 | Wheat | nucleus | 55.33 | 36.36 |
CDY44964 | Canola | nucleus | 31.07 | 35.58 |
Os04t0541100-01 | Rice | nucleus | 47.85 | 34.88 |
CDX85980 | Canola | nucleus | 16.1 | 33.49 |
CDY62151 | Canola | nucleus | 31.29 | 25.32 |
CDY71745 | Canola | nucleus | 32.43 | 25.0 |
Os10t0516500-01 | Rice | cytosol, nucleus | 13.61 | 24.59 |
CDX93785 | Canola | nucleus | 32.65 | 24.45 |
Bra040010.1-P | Field mustard | nucleus | 33.33 | 23.63 |
Os02t0104500-01 | Rice | nucleus | 25.62 | 23.54 |
Bra036731.1-P | Field mustard | nucleus | 33.56 | 23.49 |
AT1G33240.1 | Thale cress | nucleus | 33.79 | 22.27 |
CDX85981 | Canola | nucleus | 14.74 | 17.81 |
Protein Annotations
EnsemblPlants:Os02t0648300-01 | EnsemblPlantsGene:Os02g0648300 | Gene3D:1.10.10.60 | GO:GO:0003674 | GO:GO:0003676 | GO:GO:0003677 |
GO:GO:0005488 | InterPro:Homeobox-like_sf | InterPro:IPR017877 | InterPro:Myb-like_dom | InterPro:SANT/Myb | ncoils:Coil |
PANTHER:PTHR21654 | PANTHER:PTHR21654:SF21 | PFAM:PF13837 | PFscan:PS50090 | ProteinID:BAS80038.1 | SEG:seg |
SMART:SM00717 | SUPFAM:SSF46689 | UniParc:UPI000393B6DD | UniProt:A0A0P0VMM0 | MapMan:15.5.20 | : |
Description
SANT domain, DNA binding domain containing protein. (Os02t0648300-01)
Coordinates
chr2:-:26099326..26101023
Molecular Weight (calculated)
47501.8 Da
IEP (calculated)
4.432
GRAVY (calculated)
-0.791
Length
441 amino acids
Sequence
(BLAST)
(BLAST)
001: MPVEDPQPLA MAWMMLPGAA DLGFLSMSSE SESDDESDEE EEEEEAVAPG GGGREGLGDD GDGDGEGGSS TRKLMAMFEG MMRQVTEKQD AMQRVFLETL
101: EKWEAERTER EEAWRRKEVA RINREREQLS KERAAAASRD AALIAFLQRV GGAGGEPVRL SPSSAGATRH DAAAAGLQLV PVPAPRAKAE DAWAAAGGDG
201: SGTTAPSRWP KEEVQALIDL RMEKEEQYND MGPKGPLWEE IAAGMQRIGY NRSAKRCKEK WENINKYFKK VKESNKRRPE DSKTCPYFHQ LDAIYRKKHF
301: AGRGGGGGGV TIAASHSSLA IVTVSEQDNP SQRELEGKSS NDVGNVQLAV PLLVHNAPDK KVEGSEGEPN VTAAAEETDS DEMCGEYTDD GDDDDKMQYK
401: IEFQKPTAGG GGDGNDAPVP ATTAAATSSA PTSNTSFLAV Q
101: EKWEAERTER EEAWRRKEVA RINREREQLS KERAAAASRD AALIAFLQRV GGAGGEPVRL SPSSAGATRH DAAAAGLQLV PVPAPRAKAE DAWAAAGGDG
201: SGTTAPSRWP KEEVQALIDL RMEKEEQYND MGPKGPLWEE IAAGMQRIGY NRSAKRCKEK WENINKYFKK VKESNKRRPE DSKTCPYFHQ LDAIYRKKHF
301: AGRGGGGGGV TIAASHSSLA IVTVSEQDNP SQRELEGKSS NDVGNVQLAV PLLVHNAPDK KVEGSEGEPN VTAAAEETDS DEMCGEYTDD GDDDDKMQYK
401: IEFQKPTAGG GGDGNDAPVP ATTAAATSSA PTSNTSFLAV Q
001: MMQLGGGTPT TTAAATTVTT ATAPPPQSNN NDSAATEAAA AAVGAFEVSE EMHDRGFGGN RWPRQETLAL LKIRSDMGIA FRDASVKGPL WEEVSRKMAE
101: HGYIRNAKKC KEKFENVYKY HKRTKEGRTG KSEGKTYRFF DQLEALESQS TTSLHHHQQQ TPLRPQQNNN NNNNNNNNSS IFSTPPPVTT VMPTLPSSSI
201: PPYTQQINVP SFPNISGDFL SDNSTSSSSS YSTSSDMEMG GGTATTRKKR KRKWKVFFER LMKQVVDKQE ELQRKFLEAV EKREHERLVR EESWRVQEIA
301: RINREHEILA QERSMSAAKD AAVMAFLQKL SEKQPNQPQP QPQPQQVRPS MQLNNNNQQQ PPQRSPPPQP PAPLPQPIQA VVSTLDTTKT DNGGDQNMTP
401: AASASSSRWP KVEIEALIKL RTNLDSKYQE NGPKGPLWEE ISAGMRRLGF NRNSKRCKEK WENINKYFKK VKESNKKRPE DSKTCPYFHQ LDALYRERNK
501: FHSNNNIAAS SSSSGLVKPD NSVPLMVQPE QQWPPAVTTA TTTPAAAQPD QQSQPSEQNF DDEEGTDEEY DDEDEEEENE EEEGGEFELV PSNNNNNKTT
601: NNL
101: HGYIRNAKKC KEKFENVYKY HKRTKEGRTG KSEGKTYRFF DQLEALESQS TTSLHHHQQQ TPLRPQQNNN NNNNNNNNSS IFSTPPPVTT VMPTLPSSSI
201: PPYTQQINVP SFPNISGDFL SDNSTSSSSS YSTSSDMEMG GGTATTRKKR KRKWKVFFER LMKQVVDKQE ELQRKFLEAV EKREHERLVR EESWRVQEIA
301: RINREHEILA QERSMSAAKD AAVMAFLQKL SEKQPNQPQP QPQPQQVRPS MQLNNNNQQQ PPQRSPPPQP PAPLPQPIQA VVSTLDTTKT DNGGDQNMTP
401: AASASSSRWP KVEIEALIKL RTNLDSKYQE NGPKGPLWEE ISAGMRRLGF NRNSKRCKEK WENINKYFKK VKESNKKRPE DSKTCPYFHQ LDALYRERNK
501: FHSNNNIAAS SSSSGLVKPD NSVPLMVQPE QQWPPAVTTA TTTPAAAQPD QQSQPSEQNF DDEEGTDEEY DDEDEEEENE EEEGGEFELV PSNNNNNKTT
601: NNL
Arabidopsis Description
Duplicated homeodomain-like superfamily protein [Source:UniProtKB/TrEMBL;Acc:Q9C6K3]
SUBAcon: [nucleus]
SUBAcon: [nucleus]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.