Subcellular Localization
min:
: max
Winner_takes_all: nucleus
Predictor Summary:
Predictor Summary:
- nucleus 3
- mitochondrion 2
- plastid 1
- cytosol 1
Predictors | GFP | MS/MS | Papers |
---|---|---|---|
nucleus:
21132161
|
PPI
Inferred distinct locusB in Crop
Inferred from Arabidopsis experimental PPI
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
KRH38714 | Soybean | nucleus | 97.03 | 96.89 |
VIT_16s0022g01860.t01 | Wine grape | cytosol, nucleus, plastid | 77.33 | 79.42 |
GSMUA_Achr11P... | Banana | cytosol, endoplasmic reticulum, extracellular, golgi, mitochondrion, nucleus, plasma membrane, plastid, vacuole | 32.55 | 77.34 |
Zm00001d004153_P002 | Maize | nucleus | 13.68 | 77.04 |
AT5G51660.1 | Thale cress | cytosol | 73.32 | 73.58 |
Solyc03g007100.2.1 | Tomato | nucleus | 71.94 | 73.05 |
Bra022557.1-P | Field mustard | cytosol | 72.63 | 72.58 |
CDY46934 | Canola | cytosol | 72.63 | 72.58 |
CDX91600 | Canola | cytosol | 72.08 | 72.03 |
GSMUA_Achr11P... | Banana | cytosol, plasma membrane, plastid | 25.43 | 68.66 |
TraesCS2B01G074500.1 | Wheat | cytosol | 66.48 | 67.65 |
HORVU2Hr1G011580.28 | Barley | nucleus | 66.55 | 67.48 |
TraesCS2D01G060100.3 | Wheat | cytosol, nucleus, plastid | 66.55 | 67.25 |
KXG25932 | Sorghum | cytosol | 66.9 | 67.22 |
TraesCS2A01G061100.3 | Wheat | nucleus | 66.41 | 66.88 |
GSMUA_Achr11P... | Banana | cytosol | 9.68 | 62.22 |
Zm00001d004154_P002 | Maize | plasma membrane | 51.35 | 61.15 |
Os04t0252200-01 | Rice | cytosol, nucleus, plasma membrane | 9.4 | 56.43 |
KRH04684 | Soybean | extracellular | 3.8 | 52.38 |
Protein Annotations
EntrezGene:100785543 | MapMan:16.2.1.2.1 | Gene3D:2.130.10.10 | EMBL:ACUP02010275 | InterPro:Cleavage/polyA-sp_fac_asu_C | EnsemblPlantsGene:GLYMA_16G203900 |
GO:GO:0003674 | GO:GO:0003676 | GO:GO:0003723 | GO:GO:0003729 | GO:GO:0005488 | GO:GO:0005515 |
GO:GO:0005575 | GO:GO:0005622 | GO:GO:0005623 | GO:GO:0005634 | GO:GO:0006139 | GO:GO:0006378 |
GO:GO:0006379 | GO:GO:0008150 | GO:GO:0008152 | GO:GO:0009987 | UniProt:I1MQ84 | InterPro:IPR015943 |
EnsemblPlants:KRH09220 | ProteinID:KRH09220 | ProteinID:KRH09220.1 | PFAM:PF03178 | PFAM:PF10433 | PANTHER:PTHR10644 |
PANTHER:PTHR10644:SF2 | UniParc:UPI000233B069 | InterPro:WD40/YVTN_repeat-like_dom_sf | SEG:seg | : | : |
Description
hypothetical protein
Coordinates
chr16:-:36485467..36506395
Molecular Weight (calculated)
158375.0 Da
IEP (calculated)
6.134
GRAVY (calculated)
-0.084
Length
1447 amino acids
Sequence
(BLAST)
(BLAST)
0001: MSFAAYKMMQ CPTGIDNCAA GFLTHSRSDF VPLQPDDLDA EWPSRPRHHV GSLPNLVVTA ANVLEVYAVR LQEDQPPKAA ADSRRGALLD GIAGASLELV
0101: CHYRLHGNVE TMAVLSIGGG DVSRRRDSIM LTFADAKISV LEYDDSIHGL RTSSLHCFEG PEWLHLKRGR EQFARGPVVK VDPQGRCGGV LIYDLQMIIL
0201: KATQAGSGLV GEDDALGSSG AVAARIESSY MINLRDLDMR HVKDFTFVHG YIEPVMVILH ERELTWAGRV SWKHHTCMIS ALSISTTLKQ HPLIWSAVNL
0301: PHDAYKLLAV PSPIGGVLVI SANTIHYHSQ SASCALALNS YAVTLDSSQE IPRSSFNVEL DAANATWLLS DVALLSTKTG ELLLLTLVYD GRVVQRLDLS
0401: KSKASVLSSG ITTIGNSLFF LASRLGDSML VQFSCGSGVS MLSSNLKEEV GDIEADAPSK RLRRSPSDAL QDMVSGEELS LYGSAPNRTE SAQKSFSFAV
0501: RDSLINVGPL KDFSYGLRIN ADANATGIAK QSNYELVCCS GHGKNGSLCV LRQSIRPEVI TEVELPGCKG IWTVYHKSTR SHNADSSKMA DDDDEYHAYL
0601: IISLEARTMV LETADLLSEV TESVDYYVQG KTLAAGNLFG RCRVIQVYER GARILDGSFM TQDVSFGASN LESGSASDSA IALSVSIADP FVLLRMSDGS
0701: IRLLIGDPST CTISVTSPAS FESSKGSVSS CTLYHDKGPE PWLRKTSTDA WLSTGVGETI DGTDGAAQDH GDIYCVVCFD NGNLEIFDVP NFNCVFSVEN
0801: FMSGKSHLVD ALMKEVLKDS KQGDRDGVIN QGRKENIPDM KVVELAMQRW SGQHSRPFLF GILSDGTILC YHAYLYESPD STSKVEDSAS AGGSIGLSST
0901: NVSRLRNLRF VRVPLDAYAR EDTSNGPPCQ QITIFKNIGS YEGFFLSGSR PAWVMVLRER LRVHPQLCDG SIVAFTVLHN VNCNQGLIYV TSQGVLKICQ
1001: LPSGSNYDSY WPVQKIPLKA TPHQVTYFAE KNLYPLIVSF PVLKPLNQVI SLVDQDINHQ NESQNMNPDE QNRFYPIDEF EVRIMEPEKS GGPWQTKATI
1101: PMQSSENALT VRMVTLVNTT SKENETLLAI GTAYVQGEDV AARGRILLFS LGKNTDNPQT LVSEVYSKEL KGAISALASL QGHLLIASGP KIILHKWNGT
1201: ELNGIAFFDA PPLHVVSLNI VKNFILIGDI HKSIYFLSWK EQGAQLSLLA KDFGSLDCFA TEFLIDGSTL SLMVSDDNRN IQIFYYAPKM SESWKGQKLL
1301: SRAEFHVGAH VTKFLRLQML STSDRAGAVP GSDKTNRFAL LFGTLDGSIG CIAPLDEITF RRLQSLQRKL VDAVPHVAGL NPRAFRLFRS NGKAHRPGPD
1401: SIVDCELLCH YEMLPLEEQL EIAHQVGTTR SQILSNLSDL SLGTSFL
0101: CHYRLHGNVE TMAVLSIGGG DVSRRRDSIM LTFADAKISV LEYDDSIHGL RTSSLHCFEG PEWLHLKRGR EQFARGPVVK VDPQGRCGGV LIYDLQMIIL
0201: KATQAGSGLV GEDDALGSSG AVAARIESSY MINLRDLDMR HVKDFTFVHG YIEPVMVILH ERELTWAGRV SWKHHTCMIS ALSISTTLKQ HPLIWSAVNL
0301: PHDAYKLLAV PSPIGGVLVI SANTIHYHSQ SASCALALNS YAVTLDSSQE IPRSSFNVEL DAANATWLLS DVALLSTKTG ELLLLTLVYD GRVVQRLDLS
0401: KSKASVLSSG ITTIGNSLFF LASRLGDSML VQFSCGSGVS MLSSNLKEEV GDIEADAPSK RLRRSPSDAL QDMVSGEELS LYGSAPNRTE SAQKSFSFAV
0501: RDSLINVGPL KDFSYGLRIN ADANATGIAK QSNYELVCCS GHGKNGSLCV LRQSIRPEVI TEVELPGCKG IWTVYHKSTR SHNADSSKMA DDDDEYHAYL
0601: IISLEARTMV LETADLLSEV TESVDYYVQG KTLAAGNLFG RCRVIQVYER GARILDGSFM TQDVSFGASN LESGSASDSA IALSVSIADP FVLLRMSDGS
0701: IRLLIGDPST CTISVTSPAS FESSKGSVSS CTLYHDKGPE PWLRKTSTDA WLSTGVGETI DGTDGAAQDH GDIYCVVCFD NGNLEIFDVP NFNCVFSVEN
0801: FMSGKSHLVD ALMKEVLKDS KQGDRDGVIN QGRKENIPDM KVVELAMQRW SGQHSRPFLF GILSDGTILC YHAYLYESPD STSKVEDSAS AGGSIGLSST
0901: NVSRLRNLRF VRVPLDAYAR EDTSNGPPCQ QITIFKNIGS YEGFFLSGSR PAWVMVLRER LRVHPQLCDG SIVAFTVLHN VNCNQGLIYV TSQGVLKICQ
1001: LPSGSNYDSY WPVQKIPLKA TPHQVTYFAE KNLYPLIVSF PVLKPLNQVI SLVDQDINHQ NESQNMNPDE QNRFYPIDEF EVRIMEPEKS GGPWQTKATI
1101: PMQSSENALT VRMVTLVNTT SKENETLLAI GTAYVQGEDV AARGRILLFS LGKNTDNPQT LVSEVYSKEL KGAISALASL QGHLLIASGP KIILHKWNGT
1201: ELNGIAFFDA PPLHVVSLNI VKNFILIGDI HKSIYFLSWK EQGAQLSLLA KDFGSLDCFA TEFLIDGSTL SLMVSDDNRN IQIFYYAPKM SESWKGQKLL
1301: SRAEFHVGAH VTKFLRLQML STSDRAGAVP GSDKTNRFAL LFGTLDGSIG CIAPLDEITF RRLQSLQRKL VDAVPHVAGL NPRAFRLFRS NGKAHRPGPD
1401: SIVDCELLCH YEMLPLEEQL EIAHQVGTTR SQILSNLSDL SLGTSFL
0001: MSFAAYKMMH WPTGVENCAS GYITHSLSDS TLQIPIVSVH DDIEAEWPNP KRGIGPLPNV VITAANILEV YIVRAQEEGN TQELRNPKLA KRGGVMDGVY
0101: GVSLELVCHY RLHGNVESIA VLPMGGGNSS KGRDSIILTF RDAKISVLEF DDSIHSLRMT SMHCFEGPDW LHLKRGRESF PRGPLVKVDP QGRCGGVLVY
0201: GLQMIILKTS QVGSGLVGDD DAFSSGGTVS ARVESSYIIN LRDLEMKHVK DFVFLHGYIE PVIVILQEEE HTWAGRVSWK HHTCVLSALS INSTLKQHPV
0301: IWSAINLPHD AYKLLAVPSP IGGVLVLCAN TIHYHSQSAS CALALNNYAS SADSSQELPA SNFSVELDAA HGTWISNDVA LLSTKSGELL LLTLIYDGRA
0401: VQRLDLSKSK ASVLASDITS VGNSLFFLGS RLGDSLLVQF SCRSGPAASL PGLRDEDEDI EGEGHQAKRL RMTSDTFQDT IGNEELSLFG STPNNSDSAQ
0501: KSFSFAVRDS LVNVGPVKDF AYGLRINADA NATGVSKQSN YELVCCSGHG KNGALCVLRQ SIRPEMITEV ELPGCKGIWT VYHKSSRGHN ADSSKMAADE
0601: DEYHAYLIIS LEARTMVLET ADLLTEVTES VDYYVQGRTI AAGNLFGRRR VIQVFEHGAR ILDGSFMNQE LSFGASNSES NSGSESSTVS SVSIADPYVL
0701: LRMTDDSIRL LVGDPSTCTV SISSPSVLEG SKRKISACTL YHDKGPEPWL RKASTDAWLS SGVGEAVDSV DGGPQDQGDI YCVVCYESGA LEIFDVPSFN
0801: CVFSVDKFAS GRRHLSDMPI HELEYELNKN SEDNTSSKEI KNTRVVELAM QRWSGHHTRP FLFAVLADGT ILCYHAYLFD GVDSTKAENS LSSENPAALN
0901: SSGSSKLRNL KFLRIPLDTS TREGTSDGVA SQRITMFKNI SGHQGFFLSG SRPGWCMLFR ERLRFHSQLC DGSIAAFTVL HNVNCNHGFI YVTAQGVLKI
1001: CQLPSASIYD NYWPVQKIPL KATPHQVTYY AEKNLYPLIV SYPVSKPLNQ VLSSLVDQEA GQQLDNHNMS SDDLQRTYTV EEFEIQILEP ERSGGPWETK
1101: AKIPMQTSEH ALTVRVVTLL NASTGENETL LAVGTAYVQG EDVAARGRVL LFSFGKNGDN SQNVVTEVYS RELKGAISAV ASIQGHLLIS SGPKIILHKW
1201: NGTELNGVAF FDAPPLYVVS MNVVKSFILL GDVHKSIYFL SWKEQGSQLS LLAKDFESLD CFATEFLIDG STLSLAVSDE QKNIQVFYYA PKMIESWKGL
1301: KLLSRAEFHV GAHVSKFLRL QMVSSGADKI NRFALLFGTL DGSFGCIAPL DEVTFRRLQS LQKKLVDAVP HVAGLNPLAF RQFRSSGKAR RSGPDSIVDC
1401: ELLCHYEMLP LEEQLELAHQ IGTTRYSILK DLVDLSVGTS FL
0101: GVSLELVCHY RLHGNVESIA VLPMGGGNSS KGRDSIILTF RDAKISVLEF DDSIHSLRMT SMHCFEGPDW LHLKRGRESF PRGPLVKVDP QGRCGGVLVY
0201: GLQMIILKTS QVGSGLVGDD DAFSSGGTVS ARVESSYIIN LRDLEMKHVK DFVFLHGYIE PVIVILQEEE HTWAGRVSWK HHTCVLSALS INSTLKQHPV
0301: IWSAINLPHD AYKLLAVPSP IGGVLVLCAN TIHYHSQSAS CALALNNYAS SADSSQELPA SNFSVELDAA HGTWISNDVA LLSTKSGELL LLTLIYDGRA
0401: VQRLDLSKSK ASVLASDITS VGNSLFFLGS RLGDSLLVQF SCRSGPAASL PGLRDEDEDI EGEGHQAKRL RMTSDTFQDT IGNEELSLFG STPNNSDSAQ
0501: KSFSFAVRDS LVNVGPVKDF AYGLRINADA NATGVSKQSN YELVCCSGHG KNGALCVLRQ SIRPEMITEV ELPGCKGIWT VYHKSSRGHN ADSSKMAADE
0601: DEYHAYLIIS LEARTMVLET ADLLTEVTES VDYYVQGRTI AAGNLFGRRR VIQVFEHGAR ILDGSFMNQE LSFGASNSES NSGSESSTVS SVSIADPYVL
0701: LRMTDDSIRL LVGDPSTCTV SISSPSVLEG SKRKISACTL YHDKGPEPWL RKASTDAWLS SGVGEAVDSV DGGPQDQGDI YCVVCYESGA LEIFDVPSFN
0801: CVFSVDKFAS GRRHLSDMPI HELEYELNKN SEDNTSSKEI KNTRVVELAM QRWSGHHTRP FLFAVLADGT ILCYHAYLFD GVDSTKAENS LSSENPAALN
0901: SSGSSKLRNL KFLRIPLDTS TREGTSDGVA SQRITMFKNI SGHQGFFLSG SRPGWCMLFR ERLRFHSQLC DGSIAAFTVL HNVNCNHGFI YVTAQGVLKI
1001: CQLPSASIYD NYWPVQKIPL KATPHQVTYY AEKNLYPLIV SYPVSKPLNQ VLSSLVDQEA GQQLDNHNMS SDDLQRTYTV EEFEIQILEP ERSGGPWETK
1101: AKIPMQTSEH ALTVRVVTLL NASTGENETL LAVGTAYVQG EDVAARGRVL LFSFGKNGDN SQNVVTEVYS RELKGAISAV ASIQGHLLIS SGPKIILHKW
1201: NGTELNGVAF FDAPPLYVVS MNVVKSFILL GDVHKSIYFL SWKEQGSQLS LLAKDFESLD CFATEFLIDG STLSLAVSDE QKNIQVFYYA PKMIESWKGL
1301: KLLSRAEFHV GAHVSKFLRL QMVSSGADKI NRFALLFGTL DGSFGCIAPL DEVTFRRLQS LQKKLVDAVP HVAGLNPLAF RQFRSSGKAR RSGPDSIVDC
1401: ELLCHYEMLP LEEQLELAHQ IGTTRYSILK DLVDLSVGTS FL
Arabidopsis Description
CPSF160Cleavage and polyadenylation specificity factor subunit 1 [Source:UniProtKB/Swiss-Prot;Acc:Q9FGR0]
SUBAcon: [cytosol]
SUBAcon: [cytosol]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.