Subcellular Localization
min:
: max
Winner_takes_all: nucleus
Predictor Summary:
Predictor Summary:
- cytosol 5
- mitochondrion 2
- extracellular 1
- endoplasmic reticulum 1
- vacuole 1
- plasma membrane 1
- golgi 1
- plastid 1
Predictors | GFP | MS/MS | Papers |
---|---|---|---|
nucleus:
21132161
endoplasmic reticulum: 27224218 nucleus: 27291164 |
msms PMID:
21132161
doi
Soybean Genomics and Improvement Laboratory, USDA-ARS, Beltsville, MD 20705, USA. bret.cooper@ars.usda.gov
|
PPI
No PPI Data
Homology
Paralog
locus | Identity | Homology Identity |
---|
Ortholog
locus | Homology Species | Location | Identity | Homology Identity |
---|---|---|---|---|
KRH36263 | Soybean | cytosol, endoplasmic reticulum, nucleus | 97.64 | 97.64 |
VIT_06s0004g01160.t01 | Wine grape | cytosol | 90.09 | 89.95 |
Solyc01g006280.2.1 | Tomato | extracellular, nucleus | 88.21 | 88.49 |
PGSC0003DMT400081881 | Potato | cytosol | 88.05 | 88.33 |
GSMUA_Achr10P... | Banana | cytosol, mitochondrion, plastid | 81.76 | 87.69 |
CDY37133 | Canola | cytosol | 86.79 | 87.2 |
Bra018851.1-P | Field mustard | cytosol | 86.79 | 87.2 |
CDY22798 | Canola | cytosol | 86.48 | 86.89 |
OQU89595 | Sorghum | cytosol | 86.48 | 86.61 |
Bra030488.1-P | Field mustard | cytosol | 85.85 | 86.39 |
CDY29587 | Canola | cytosol | 85.53 | 86.08 |
CDY62106 | Canola | cytosol | 54.09 | 85.57 |
KRH33286 | Soybean | nucleus | 48.11 | 79.27 |
Os09t0446800-01 | Rice | plastid | 86.01 | 77.37 |
TraesCS5A01G232800.1 | Wheat | nucleus | 86.32 | 77.32 |
TraesCS5B01G231100.1 | Wheat | plastid | 86.32 | 76.68 |
TraesCS5D01G235400.2 | Wheat | plastid | 85.53 | 76.08 |
Zm00001d020669_P001 | Maize | extracellular | 86.16 | 74.76 |
HORVU5Hr1G058370.1 | Barley | cytosol | 44.03 | 74.47 |
CDY62107 | Canola | plastid | 27.67 | 72.43 |
AT2G12280.1 | Thale cress | plastid | 9.91 | 70.0 |
AT2G12200.1 | Thale cress | mitochondrion, plastid | 6.6 | 66.67 |
Protein Annotations
KEGG:00670+6.3.4.3 | KEGG:00720+6.3.4.3 | Gene3D:1.10.8.770 | EntrezGene:100806844 | Gene3D:3.10.410.10 | MapMan:7.5.7.1 |
EMBL:ACUP02012778 | InterPro:Formate_THF_ligase | InterPro:Formate_THF_ligase_CS | EnsemblPlantsGene:GLYMA_20G243700 | GO:GO:0000166 | GO:GO:0003674 |
GO:GO:0003824 | GO:GO:0004329 | GO:GO:0004477 | GO:GO:0004488 | GO:GO:0005488 | GO:GO:0005524 |
GO:GO:0005575 | GO:GO:0005622 | GO:GO:0005623 | GO:GO:0005737 | GO:GO:0006139 | GO:GO:0008150 |
GO:GO:0008152 | GO:GO:0009058 | GO:GO:0009113 | GO:GO:0009257 | GO:GO:0009987 | GO:GO:0016787 |
GO:GO:0055114 | UniProt:I1NJ85 | EnsemblPlants:KRG93013 | ProteinID:KRG93013 | ProteinID:KRG93013.1 | HAMAP:MF_01543 |
InterPro:P-loop_NTPase | PFAM:PF01268 | ScanProsite:PS00721 | ScanProsite:PS00722 | PANTHER:PTHR43274 | MetaCyc:PWY-1722 |
MetaCyc:PWY-2161 | MetaCyc:PWY-2201 | MetaCyc:PWY-3841 | SUPFAM:SSF52540 | UniParc:UPI000233DE2D | SEG:seg |
Description
hypothetical protein
Coordinates
chr20:-:47396292..47400237
Molecular Weight (calculated)
67441.4 Da
IEP (calculated)
6.975
GRAVY (calculated)
0.033
Length
636 amino acids
Sequence
(BLAST)
(BLAST)
001: MSSSTTVRKL QVVSPVPADI DIANSVEPVH ISQIAKDLNL SPNHYDLYGK YKAKVLLSVL DELQGSEDGY YVVVGGITPT PLGEGKSTTT VGLCQALGAF
101: LDKKVVTCLR QPSQGPTFGI KGGAAGGGYS QVIPMDEFNL HLTGDIHAIT AANNLLAAAI DTRIFHESTQ SDKALFNRLC PPNKEGKRSF SDVMFRRLTK
201: LGISKTNPDD LTPEEVNKFA RLDIDPNSIT WRRVMDINDR FLRKIAIGQG PDEKGMVRET GFDISVASEI MAVLALTTSL ADMRERLGKM VIGNSKSGDP
301: VTADDLGVGG ALTVLMKDAI HPTLMQTLEG TPVLVHAGPF ANIAHGNSSI VADKIALKLV GPGGFVVTEA GFGADIGAEK FMNIKCRYSG LTPQCAIIVA
401: TIRALKMHGG GPAVVAGRPL DHAYLTENVA LVEAGCVNMA RHISNTKSYG VNVVVAINKF STDTEAELNA VRSAALAAGA YDAVICTHHA NGGKGAVDLG
501: IAVQKACENV TQPLKFLYPV DLSIKEKIEA IAKSYGASGV EYSEQAEKQI EMYSKQGFSG LPICMAKTQY SFSDNAAAKG APSGFVLPIR DVRASIGAGF
601: IYPLVGTMST MPGLPTRPCF YDIDLDTTTG KVIGLS
101: LDKKVVTCLR QPSQGPTFGI KGGAAGGGYS QVIPMDEFNL HLTGDIHAIT AANNLLAAAI DTRIFHESTQ SDKALFNRLC PPNKEGKRSF SDVMFRRLTK
201: LGISKTNPDD LTPEEVNKFA RLDIDPNSIT WRRVMDINDR FLRKIAIGQG PDEKGMVRET GFDISVASEI MAVLALTTSL ADMRERLGKM VIGNSKSGDP
301: VTADDLGVGG ALTVLMKDAI HPTLMQTLEG TPVLVHAGPF ANIAHGNSSI VADKIALKLV GPGGFVVTEA GFGADIGAEK FMNIKCRYSG LTPQCAIIVA
401: TIRALKMHGG GPAVVAGRPL DHAYLTENVA LVEAGCVNMA RHISNTKSYG VNVVVAINKF STDTEAELNA VRSAALAAGA YDAVICTHHA NGGKGAVDLG
501: IAVQKACENV TQPLKFLYPV DLSIKEKIEA IAKSYGASGV EYSEQAEKQI EMYSKQGFSG LPICMAKTQY SFSDNAAAKG APSGFVLPIR DVRASIGAGF
601: IYPLVGTMST MPGLPTRPCF YDIDLDTTTG KVIGLS
001: MSSSTRKLEV VSPVPADIDI ANSVEPLHIS EIAKDLNINP LHYDLYGKYK AKVLLSAFDE LQGQEDGYYV VVGGITPTPL GEGKSTTTVG LCQALGAYLD
101: KKVVTCLRQP SQGPTFGIKG GAAGGGYSQV IPMDEFNLHL TGDIHAITAS NNLLAAAIDT RIFHETSQSD KALFNRLCPP NKEGKRSFSD IMFRRLTKLG
201: ISKTSPEELT PEEIKKFARL DIDPASITWR RVMDVNDRFL RKITIGQGPE EKGMTRETGF DISVASEIMA VLALTTSLGD MRERLGKMVI GNSKAGDPIT
301: ADDLGVGGAL TVLMKDAINP TLMQTLEGTP VLVHAGPFAN IAHGNSSIVA DKIALKLVGP GGFVVTEAGF GSDIGTEKFM NIKCRYSGLT PQCAIVVATV
401: RALKMHGGGP DVVAGRPLDR AYVSENVSLV EAGCVNLAKH ISNTKAYGVN VIVAVNMFAT DTEAELNAVR KFSMDAGAFD AVVCSHHAHS GKGAVDLGIA
501: VEKACQNITQ PLRFLYPLDI GIKDKIEAIA KSYGASGVEY SDQAEKQIEM YTQQGFSNLP ICMSKTQYSF SHDASKKGAP SGFVLPIRDV RGSIGAGFIY
601: PLVGTMSTMP GLPTRPCFYE IDIDTETGKV RGLS
101: KKVVTCLRQP SQGPTFGIKG GAAGGGYSQV IPMDEFNLHL TGDIHAITAS NNLLAAAIDT RIFHETSQSD KALFNRLCPP NKEGKRSFSD IMFRRLTKLG
201: ISKTSPEELT PEEIKKFARL DIDPASITWR RVMDVNDRFL RKITIGQGPE EKGMTRETGF DISVASEIMA VLALTTSLGD MRERLGKMVI GNSKAGDPIT
301: ADDLGVGGAL TVLMKDAINP TLMQTLEGTP VLVHAGPFAN IAHGNSSIVA DKIALKLVGP GGFVVTEAGF GSDIGTEKFM NIKCRYSGLT PQCAIVVATV
401: RALKMHGGGP DVVAGRPLDR AYVSENVSLV EAGCVNLAKH ISNTKAYGVN VIVAVNMFAT DTEAELNAVR KFSMDAGAFD AVVCSHHAHS GKGAVDLGIA
501: VEKACQNITQ PLRFLYPLDI GIKDKIEAIA KSYGASGVEY SDQAEKQIEM YTQQGFSNLP ICMSKTQYSF SHDASKKGAP SGFVLPIRDV RGSIGAGFIY
601: PLVGTMSTMP GLPTRPCFYE IDIDTETGKV RGLS
Arabidopsis Description
THFSFormate--tetrahydrofolate ligase [Source:UniProtKB/Swiss-Prot;Acc:Q9SPK5]
SUBAcon: [cytosol]
SUBAcon: [cytosol]
Hydropathy Plot
About CropPAL
The Protein Annotated Locations Database (CropPAL) houses large scale proteomic and GFP localization data from published experimental studies in Soybean (Glycine max), Maize (Zea mays), Wheat (Triticum aestivum), Barley (Hordeum vulgare), Rice (Oryza sativa), Field mustard (Brassica rapa), Canola (Brassica napus), Sorghum (Sorghum bicolor), Potato (Solanum tuberosum), Tomato (Solanum lycopersicum), Banana (Musa acuminata) and Wine grape (Vitis vinifera) as well as precomputed predictions for protein subcellular localizations using protein sequences.