Endoglucanase H

Details

Name
Endoglucanase H
Kind
protein
Synonyms
  • 3.2.1.4
  • Cellulase H
  • EgH
  • Endo-1,4-beta-glucanase H
Gene Name
celH
UniProtKB Entry
P16218Swiss-Prot
Organism
Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372)
NCBI Taxonomy ID
203119
Amino acid sequence
>lcl|BSEQ0017435|Endoglucanase H
MKKRLLVSFLVLSIIVGLLSFQSLGNYNSGLKIGAWVGTQPSESAIKSFQELQGRKLDIV
HQFINWSTDFSWVRPYADAVYNNGSILMITWEPWEYNTVDIKNGKADAYITRMAQDMKAY
GKEIWLRPLHEANGDWYPWAIGYSSRVNTNETYIAAFRHIVDIFRANGATNVKWVFNVNC
DNVGNGTSYLGHYPGDNYVDYTSIDGYNWGTTQSWGSQWQSFDQVFSRAYQALASINKPI
IIAEFASAEIGGNKARWITEAYNSIRTSYNKVIAAVWFHENKETDWRINSSPEALAAYRE
AIGAGSSNPTPTPTWTSTPPSSSPKAVDPFEMVRKMGMGTNLGNTLEAPYEGSWSKSAME
YYFDDFKAAGYKNVRIPVRWDNHTMRTYPYTIDKAFLDRVEQVVDWSLSRGFVTIINSHH
DDWIKEDYNGNIERFEKIWEQIAERFKNKSENLLFEIMNEPFGNITDEQIDDMNSRILKI
IRKTNPTRIVIIGGGYWNSYNTLVNIKIPDDPYLIGTFHYYDPYEFTHKWRGTWGTQEDM
DTVVRVFDFVKSWSDRNNIPVYFGEFAVMAYADRTSRVKWYDFISDAALERGFACSVWDN
GVFGSLDNDMAIYNRDTRTFDTEILNALFNPGTYPSYSPKPSPTPRPTKPPVTPAVGEKM
LDDFEGVLNWGSYSGEGAKVSTKIVSGKTGNGMEVSYTGTTDGYWGTVYSLPDGDWSKWL
KISFDIKSVDGSANEIRFMIAEKSINGVGDGEHWVYSITPDSSWKTIEIPFSSFRRRLDY
QPPGQDMSGTLDLDNIDSIHFMYANNKSGKFVVDNIKLIGATSDPTPSIKHGDLNFDNAV
NSTDLLMLKRYILKSLELGTSEQEEKFKKAADLNRDNKVDSTDLTILKRYLLKAISEIPI
Number of residues
900
Molecular Weight
102415.005
Theoretical pI
5.26
GO Classification
Functions
cellulase activity
Processes
cellulose catabolic process
General Function
This enzyme catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.
Specific Function
beta-glucosidase activity
Pfam Domain Function
Signal Regions
1-44
Transmembrane Regions
Not Available
Cellular Location
Cytoplasmic
Gene sequence
>lcl|BSEQ0017436|Endoglucanase H (celH)
ATGAAAAAAAGGCTTTTAGTTTCTTTTTTGGTGTTAAGCATAATTGTAGGATTACTTTCT
TTTCAGTCGCTTGGTAATTACAACAGTGGTTTAAAAATCGGTGCTTGGGTGGGAACCCAG
CCGTCAGAATCAGCAATTAAGAGTTTTCAGGAACTTCAGGGTAGAAAGCTTGATATTGTC
CACCAGTTTATTAACTGGTCAACTGATTTTTCCTGGGTAAGACCTTATGCCGACGCTGTT
TATAATAACGGCTCAATATTAATGATTACCTGGGAACCTTGGGAATACAACACTGTAGAT
ATCAAAAACGGTAAAGCGGATGCTTACATAACCAGAATGGCGCAAGATATGAAAGCCTAT
GGCAAGGAAATTTGGTTAAGACCTCTTCATGAAGCCAACGGAGACTGGTATCCATGGGCC
ATAGGATATTCTTCAAGAGTAAACACAAACGAAACTTACATAGCCGCTTTCAGACATATT
GTCGATATTTTCCGTGCCAACGGAGCCACCAACGTCAAATGGGTGTTTAATGTAAACTGC
GACAATGTAGGTAACGGCACAAGTTATCTGGGTCATTATCCCGGAGATAATTATGTAGAC
TACACCTCAATTGACGGATACAACTGGGGTACCACTCAAAGCTGGGGAAGCCAATGGCAA
AGCTTTGATCAGGTTTTCTCCAGAGCCTACCAAGCTTTGGCATCAATAAACAAACCCATC
ATTATAGCAGAGTTTGCATCAGCTGAAATAGGCGGAAACAAGGCAAGATGGATTACAGAA
GCATATAACTCTATAAGAACATCCTACAACAAGGTAATTGCTGCAGTATGGTTTCACGAG
AACAAAGAAACCGACTGGAGAATCAACTCAAGTCCTGAAGCCCTTGCAGCATACAGGGAG
GCAATAGGAGCCGGTTCATCAAATCCTACCCCTACTCCAACTTGGACCTCTACTCCACCA
TCAAGCTCACCAAAGGCTGTCGACCCCTTTGAAATGGTTAGAAAAATGGGTATGGGAACA
AACCTCGGAAACACTCTCGAAGCTCCCTATGAAGGCTCCTGGTCCAAGTCTGCCATGGAA
TATTATTTTGATGATTTTAAAGCTGCAGGATATAAAAACGTAAGAATCCCTGTAAGATGG
GACAACCATACAATGAGGACATACCCGTATACCATTGACAAAGCCTTTTTGGACAGGGTT
GAGCAAGTGGTTGACTGGTCACTTTCAAGAGGTTTTGTTACAATTATAAATTCTCACCAT
GATGACTGGATCAAGGAAGACTATAACGGAAACATAGAACGGTTTGAAAAGATATGGGAA
CAGATTGCGGAAAGGTTTAAAAACAAATCCGAAAATCTTCTGTTTGAAATCATGAATGAG
CCTTTCGGTAACATTACAGACGAACAAATAGACGACATGAACAGCAGAATATTAAAAATA
ATCAGAAAGACCAATCCAACCCGTATTGTTATAATAGGCGGAGGTTATTGGAACAGTTAT
AATACGCTTGTAAACATTAAAATTCCTGATGACCCATACTTAATCGGAACTTTCCATTAC
TATGACCCATATGAATTTACTCACAAGTGGAGAGGTACATGGGGTACTCAGGAAGACATG
GATACTGTAGTAAGAGTATTTGATTTTGTTAAGAGTTGGTCTGACAGAAACAATATCCCG
GTATATTTTGGAGAATTTGCCGTAATGGCTTATGCCGACAGAACTTCCCGTGTAAAATGG
TATGATTTTATAAGTGATGCGGCCCTGGAGCGCGGTTTTGCATGTTCCGTATGGGATAAC
GGCGTTTTTGGTTCATTGGATAATGACATGGCTATTTACAACAGAGATACCCGTACCTTT
GACACTGAAATCCTCAATGCACTATTTAATCCCGGAACATATCCGTCTTATTCTCCGAAA
CCTTCACCAACTCCAAGACCGACCAAACCGCCCGTAACACCGGCTGTCGGTGAAAAAATG
CTGGATGATTTTGAGGGTGTGTTAAATTGGGGTTCATACTCCGGTGAAGGTGCAAAAGTT
TCAACAAAAATTGTGTCCGGAAAAACAGGAAACGGCATGGAAGTCAGCTACACCGGGACA
ACGGACGGCTACTGGGGAACAGTATACAGTTTACCGGACGGCGATTGGTCAAAATGGCTT
AAAATCTCTTTTGACATTAAGTCCGTTGACGGTTCTGCCAATGAAATCAGATTTATGATT
GCTGAAAAAAGCATAAACGGTGTGGGAGACGGAGAACACTGGGTTTACTCAATAACTCCC
GACAGTTCGTGGAAAACTATAGAAATACCGTTCTCCAGCTTTAGAAGAAGACTTGATTAT
CAGCCGCCTGGACAGGATATGAGCGGTACTTTGGATCTTGACAATATAGATTCAATTCAC
TTCATGTATGCCAACAACAAGTCGGGAAAATTTGTCGTAGACAATATCAAGCTGATTGGT
GCTACTTCCGATCCGACTCCTTCAATAAAACACGGAGATTTGAACTTCGATAATGCAGTG
AATTCTACAGACTTGTTAATGCTTAAAAGGTATATCCTCAAATCTTTGGAACTCGGTACA
TCTGAGCAGGAGGAAAAATTCAAAAAAGCGGCAGATTTAAACAGGGACAACAAGGTCGAC
TCCACTGACTTGACAATTTTGAAAAGATACTTGCTGAAAGCCATCAGTGAAATACCCATA
TAA
Chromosome Location
Not Available
Locus
Not Available
External Identifiers
ResourceLink
UniProtKB IDP16218
UniProtKB Entry NameGUNH_CLOTH
GenBank Protein ID144774
GenBank Gene IDM31903
PDB ID(s)1V0A, 2BV9, 2BVD, 2CIP, 2CIT, 2LRO, 2LRP, 2V3G, 2VI0, 4U3A, 4U5I, 4U5K
KEGG IDcth:Cthe_1472
General References
  1. Yague E, Beguin P, Aubert JP: Nucleotide sequence and deletion analysis of the cellulase-encoding gene celH of Clostridium thermocellum. Gene. 1990 Apr 30;89(1):61-7. [Article]

Associated Data

Drug Relations
DrugDrug groupPharmacological action?TypeActionsDetails
4-MethylcoumarinexperimentalunknowntargetDetails