Genome polyprotein
Details
- Name
- Genome polyprotein
- Synonyms
- 3.6.1.15
- P2A
- Gene Name
- Not Available
- Organism
- HHAV
- Amino acid sequence
>lcl|BSEQ0019410|Genome polyprotein MNMSRQGIFQTVGSGLDHILSLADIEEEQMIQSVDRTAVTGASYFTSVDQSSVHTAEVGS HQVEPLRTSVDKPGSKKTQGEKFFLIHSADWLTTHALFHEVAKLDVVKLLYNEQFAVQGL LRYHTYARFGIEIQVQINPTPFQQGGLICAMVPGDQSYGSIASLTVYPHGLLNCNINNVV RIKVPFIYTRGAYHFKDPQYPVWELTIRVWSELNIGTGTSAYTSLNVLARFTDLELHGLT PLSTQMMRNEFRVSTTENVVNLSNYEDARAKMSFALDQEDWKSDPSQGGGIKITHFTTWT SIPTLAAQFPFNASDSVGQQIKVIPVDPYFFQMTNTNPDQKCITALASICQMFCFWRGDL VFDFQVFPTKYHSGRLLFCFVPGNELIDVSGITLKQATTAPCAVMDITGVQSTLRFRVPW ISDTPYRVNRYTKSAHQKGEYTAIGKLIVYCYNRLTSPSNVASHVRVNVYLSAINLECFA PLYHAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMKDLKGKANRGKMDVSGVQAPV GAITTIEDPVLAKKVPETFPELKPGESRHTSDHMSIYKFMGRSHFLCTFTFNSNNKEYTF PITLSSTSNPPHGLPSTLRWFFNLFQLYRGPLDLTIIIIGATDVDGMAWFTPVGLAVDTP WVEKESALSIDYKTALGAVRFNTRRTGNIQIRLPWYSYLYAVSGALDGLGDKTDSTFGLV SIQIANYNHSDEYLSFSCYLSVTEQSEFYFPRAPLNSNAMLSTESMMSRIAAGDLESSVD DPRSEEDKRFESHIECRKPYKELRLEVGKQRLKYAQEELSNEVLPPPRKKKGLFSQAKIS LFYTEEHEIMKFSWRGVTADTRALRRFGFSLAAGRSVWTLEMDAGVLTGRLIRLNDEKWT EMKDDKIVSLIEKFTSNKYWSKVNFPHGMLDLEEIAANSKDFPNMSETDLCFLLHWLNPK KINLADRMLGLSGVQEIKEQGVGLIAECRTFLDSIAGTLKSMMFGFHHSVTVEIINTVLC FVKSGILLYVMQQLNQDEHSHIIGLLRVMNYVDIGCSVISCGKVFSKMLETVFNWQMDSR MMELRTQSFSNWLRDICSGITIFKNFKDAIYWLYTKLNDFYEVNYGKKKDILNILKDNQQ KIEKAIEEADKFSILQIQDVEKFEQYQKGVDLIQKLRTVHSMAQVDPNLMVHLSPLRDCI ARVHQKLKNLGSINQAMVTRCEPVVCYLYGKRGGGKSLTSIALATKICKHYGVEPEKNIY TKPVASDYWDGYSGQLVCIIDDIGQNTTDEDWSDFCQLVSGCPLRLNMASLEEKGRHFSS PFIIATSNWSNPSPKTVYVKEAIDRRLHFKVEVNPASFSKNPHNDMLNVNLAKTNDAIKD MSCVDLIMDGHNVSLMDLLSSLVMTVEIRKQNMTAFMELWSQGISDDDNDSAMAEFFQSF PSGEPSNSKLSGFFQSVTNHKWVAVGAAVGILGVLVGGWFVYKHFSRKEEEPIPAEGVYH GVTKPKQVIKLDADPVESQSTLEIAGLVRKNLVQFGVGEKNGCVRWVMNALGVKDDWLLV PSHAYKFEKDYEMMEFYFNRGGTYYSISAGNVVIQSLDVGFQDVVLMKVPTIPKFRDITQ HFIKKGDVPRALNRLATLVTTVNGTPMLISEGPLKMEEKATYVHKKNDGTTVDLTVDQAW RGKGEGLPGMCGGALVSSNQSIQNAILGIHVAGGNSILVAKLVTQEMFQNIDKKIESQRI MKVEFTQCSMNVVSKTLFRKSPIHHHIDKTMINFPAAMPFSKAEIDPMAMMLSKYSLPIV EEPEDYKEASIFYQNKIVGKTQLVDDFLDLDMAITGAPGIDAINMDSSPGFPYVQERLTK RDLIWLDENGLLLGVHPRLAQRILFNTVMMENCSDLDVVFTTCPKDELRPLEKVLESKTR AIDACPLDYTILCRMYWGPAISYFHLNPGFHTGVAIGIDPDCQWDELFKTMIRFGDVGLD LDFSAFDASLSPFMIREAGRIMSELSGTPSHFGTALMNTIIYSKHLLYNCCYHVCGSMPS GSPCTALLNSIINNVNLYYVFSKIFGKSPVFFCQALKILCYGDDVLIVFSRDVQIDNLDL IGQKIVDEFKKLGMTATSADKNVPQLKPVSELTFLKRSFNLVEDRIRPAISEKTIWSLIA WQRSNAEFEQNLENAQWFAFMHGYEFYQKFYYFVQSCLEKEMIEYRLKSYDWWRMRFYDQ CFICDLS
- Number of residues
- 2227
- Molecular Weight
- 251425.38
- Theoretical pI
- 6.53
- GO Classification
- FunctionsATP binding / cysteine-type endopeptidase activity / ion channel activity / RNA binding / RNA helicase activity / RNA-directed RNA polymerase activity / structural molecule activityProcessespore formation by virus in membrane of host cell / protein oligomerization / RNA-protein covalent cross-linking / suppression by virus of host gene expression / suppression by virus of host MAVS activity / suppression by virus of host MAVS activity by MAVS proteolysis / transcription, DNA-templated / viral entry into host cell / viral RNA genome replication / virion attachment to host cellComponentshost cell cytoplasmic vesicle membrane / host cell mitochondrial outer membrane / integral to membrane of host cell / membrane / viral capsid
- General Function
- Structural molecule activity
- Specific Function
- Capsid proteins VP1, VP2, and VP3 form a closed capsid enclosing the viral positive strand RNA genome. All these proteins contain a beta-sheet structure called beta-barrel jelly roll. Together they form an icosahedral capsid (T=3) composed of 60 copies of each VP1, VP2, and VP3, with a diameter of approximately 300 Angstroms. VP1 is situated at the 12 fivefold axes, whereas VP2 and VP3 are located at the quasi-sixfold axes. The capsid interacts with HAVCR1 to provide virion attachment to target cell (By similarity).Protein VP0: VP0 precursor is a component of immature procapsids. The N-terminal domain of VP0, protein VP4, is needed for the assembly of 12 pentamers into the icosahedral structure. Unlike other picornaviruses, HAV VP4 does not seem to be myristoylated and has not been detected in mature virions, supposedly owing to its small size (By similarity).VP1-2A precursor is a component of immature procapsids and corresponds to an extended form of the structural protein VP1. The C-terminal domain of VP1-2A, protein 2A, acts as an assembly signal that allows multimerization of VP1-2A and formation of pentamers of VP1-VP2-VP3 trimers. It is proteolytically removed from the precursor by a host protease and does not seem to be found in mature particles (By similarity).Protein 2B and 2BC precursor affect membrane integrity and cause an increase in membrane permeability.Protein 2C: Associates with and induces structural rearrangements of intracellular membranes. It displays RNA-binding, nucleotide binding and NTPase activities (By similarity).Protein 3A, via its hydrophobic domain, serves as membrane anchor to the 3AB and 3ABC precursors.The 3AB precursor interacts with the 3CD precursor and with RNA structures found at both the 5'- and 3'-termini of the viral genome. Since the 3AB precursor contains the hydrophobic domain 3A, it probably anchors the whole viral replicase complex to intracellular membranes on which viral RNA synthesis occurs (By similarity).The 3ABC precursor is targeted to the mitochondrial membrane where protease 3C activity cleaves and inhibits the host antiviral protein MAVS, thereby disrupting activation of IRF3 through the IFIH1/MDA5 pathway. In vivo, the protease activity of 3ABC precursor is more efficient in cleaving the 2BC precursor than that of protein 3C. The 3ABC precursor may therefore play a role in the proteolytic processing of the polyprotein (By similarity).Protein 3B is covalently linked to the 5'-end of both the positive-strand and negative-strand genomic RNAs. It acts as a genome-linked replication primer (By similarity).Protease 3C: cysteine protease that generates mature viral proteins from the precursor polyprotein. In addition to its proteolytic activity, it binds to viral RNA, and thus influences viral genome replication. RNA and substrate bind cooperatively to the protease. Also cleaves host proteins such as PCBP2 (By similarity).RNA-directed RNA polymerase 3D-POL replicates genomic and antigenomic RNA by recognizing replications specific signals.
- Pfam Domain Function
- Transmembrane Regions
- Not Available
- Cellular Location
- Virion
- Gene sequence
>lcl|BSEQ0006522|6684 bp ATGAATATGTCTAGACAGGGTATTTTTCAGACTGTTGGGAGTGGTCTTGACCACATCCTG TCTCTGGCAGATATTGAGGAAGAGCAAATGATTCAGTCAGTTGATAGGACTGCAGTGACT GGTGCTTCTTATTTTACTTCTGTGGATCAATCTTCAGTTCATACTGCTGAGGTTGGATCA CATCAGGTTGAACCTTTGAGAACCTCTGTTGATAAACCTGGATCAAAGAAGACTCAAGGA GAGAAATTTTTTTTGATTCATTCTGCAGATTGGCTTACTACACATGCTCTTTTCCATGAA GTTGCAAAATTGGATGTGGTGAAATTATTGTATAATGAACAGTTTGCTGTTCAAGGTTTG TTGAGATACCATACATATGCAAGATTTGGCATTGAAATTCAAGTTCAGATAAACCCTACA CCTTTTCAACAAGGGGGATTGATCTGTGCTATGGTTCCTGGTGATCAGAGCTATGGTTCT ATAGCATCATTGACTGTTTATCCTCATGGTTTGCTGAATTGCAATATCAACAATGTGGTG AGAATAAAGGTTCCATTTATTTACACAAGAGGTGCTTACCATTTTAAAGATCCACAATAC CCAGTCTGGGAATTGACAATTAGAGTTTGGTCAGAATTAAATATTGGAACAGGAACTTCA GCTTATACTTCACTCAATGTTTTAGCTAGATTTACAGATTTGGAGTTGCATGGATTAACT CCTCTTTCCACACAAATGATGAGGAATGAATTTAGGGTCAGTACCACTGAAAATGTGGTG AATTTGTCAAATTATGAAGATGCAAGAGCAAAGATGTCTTTTGCTTTAGATCAGGAAGAT TGGAAATCTGATCCATCCCAAGGTGGCGGAATTAAAATTACTCATTTTACTACTTGGACA TCTATCCCAACTTTGGCTGCTCAGTTTCCATTTAATGCTTCTGATTCAGTTGGTCAGCAA ATTAAAGTTATTCCAGTTGATCCATATTTTTTCCAAATGACAAACACAAACCCTGACCAA AAATGTATAACTGCTTTGGCTTCTATTTGTCAGATGTTTTGTTTTTGGAGAGGAGATCTT GTCTTTGATTTTCAGGTTTTTCCGACCAAATATCATTCAGGTAGATTATTATTTTGTTTT GTTCCTGGTAATGAGCTAATAGATGTTTCTGGAATCACGTTAAAACAGGCAACTACTGCT CCTTGTGCAGTGATGGACATTACAGGAGTGCAGTCAACTTTGAGATTTCGTGTTCCTTGG ATTTCTGATACACCCTATCGGGTAAACAGATATACAAAGTCAGCACATCAGAAAGGTGAG TACACTGCCATTGGGAAGCTTATTGTGTACTGTTATAACAGATTGACTTCTCCTTCTAAC GTTGCTTCCCATGTTAGAGTGAATGTTTATCTTTCAGCAATTAATTTGGAATGCTTTGCT CCTCTTTATCATGCCATGGATGTTACCACACAGGTTGGAGATGATTCTGGGGGTTTTTCA ACAACAGTTTCTACAGAGCAGAATGTTCCAGATCCCCAAGTTGGTATAACAACCATGAAG GATTTAAAAGGCAAAGCTAATAGGGGAAAAATGGATGTTTCAGGAGTGCAAGCACCTGTG GGAGCTATCACAACAATTGAGGATCCTGTTTTAGCAAAGAAAGTACCTGAGACATTCCCT GAATTGAAACCTGGAGAGTCCAGGCATACATCAGATCATATGTCTATTTACAAGTTTATG GGAAGGTCTCATTTTTTGTGCACTTTTACTTTTAATTCAAACAATAAAGAGTACACATTT CCTATAACCTTGTCTTCAACCTCCAATCCTCCTCATGGTTTACCATCAACATTAAGGTGG TTTTTCAATTTGTTTCAATTGTATAGAGGACCTTTAGATCTGACAATTATCATCATCGGA GCAACTGATGTAGATGGCATGGCTTGGTTTACACCAGTAGGTCTTGCCGTTGATACTCCC TGGGTAGAAAAGGAGTCAGCCCTGTCTATTGACTACAAAACTGCTCTTGGAGCTGTCAGA TTCAATACAAGGAGAACAGGGAACATTCAGATTAGGTTACCATGGTATTCTTATTTATAT GCTGTGTCTGGAGCACTGGATGGTTTGGGAGATAAGACAGATTCTACATTCGGATTGGTT TCTATTCAGATTGCAAATTATAATCATTCTGATGAATATTTGTCTTTTAGTTGCTATTTG TCTGTCACAGAACAATCAGAATTTTATTTTCCCAGAGCTCCATTGAATTCAAATGCTATG TTATCCACTGAATCAATGATGAGCAGAATTGCAGCTGGAGACTTGGAGTCATCAGTGGAT GATCCTAGATCAGAGGAGGATAAAAGATTTGAGAGTCATATAGAATGTAGGAAACCCTAT AAAGAATTGAGATTGGAAGTTGGGAAACAAAGACTCAAGTATGCTCAGGAAGAATTGTCA AATGAAGTGCTCCCACCCCCTAGGAAAAAGAAGGGGTTGTTTTCACAAGCTAAAATTTCT CTTTTTTACACTGAGGAGCATGAAATAATGAAATTTTCTTGGAGAGGAGTGACTGCAGAT ACTAGAGCTTTAAGGAGGTTTGGATTTTCTTTGGCTGCTGGCAGAAGTGTGTGGACTCTT GAAATGGATGCTGGGGTTCTAACTGGGAGATTGATTAGACTGAATGATGAAAAATGGACA GAAATGAAGGATGACAAAATTGTTTCATTGATTGAAAAGTTTACAAGCAACAAATATTGG TCCAAAGTGAATTTCCCACATGGGATGTTGGATCTTGAAGAAATTGCTGCCAATTCCAAA GATTTTCCTAACATGTCTGAGACTGATCTGTGTTTCTTGCTGCATTGGTTAAATCCAAAG AAAATAAATTTAGCAGATAGAATGCTTGGATTGTCTGGAGTTCAGGAAATTAAAGAACAA GGTGTTGGATTAATAGCAGAATGTAGAACTTTCTTAGATTCTATTGCTGGAACTTTAAAA TCTATGATGTTTGGATTTCATCATTCTGTGACTGTTGAAATTATAAATACTGTGCTTTGT TTTGTTAAGAGTGGAATTTTGCTTTATGTTATGCAACAATTGAATCAGGATGAACACTCT CACATAATTGGTTTGTTGAGAGTTATGAATTATGTAGATATTGGTTGTTCAGTTATTTCA TGTGGCAAAGTTTTCTCCAAAATGCTGGAAACAGTCTTTAATTGGCAGATGGACTCTAGA ATGATGGAGTTGAGAACACAGAGTTTTTCCAACTGGTTAAGAGATATCTGTTCAGGGATC ACTATTTTTAAAAACTTCAAGGATGCAATTTATTGGCTCTATACAAAATTGAACGATTTT TATGAAGTGAATTATGGTAAGAAGAAGGATATTTTAAATATCCTTAAAGACAACCAACAA AAAATAGAGAAAGCCATTGAGGAAGCAGATAAATTTTCGATTTTGCAAATCCAAGATGTG GAAAAATTTGAACAGTATCAGAAAGGGGTTGACTTGATACAAAAATTGAGGACTGTTCAT TCAATGGCTCAGGTTGATCCTAATTTGATGGTTCATTTGTCACCTTTGAGAGACTGCATA GCAAGAGTTCATCAGAAACTCAAAAACCTTGGATCTATAAATCAGGCAATGGTAACAAGA TGTGAGCCAGTTGTTTGCTATTTGTATGGCAAAAGAGGAGGAGGAAAGAGTTTGACATCA ATTGCATTGGCAACTAAAATTTGTAAACATTATGGTGTTGAACCTGAGAAGAACATCTAT ACTAAACCTGTGGCTTCAGATTATTGGGATGGGTATAGTGGACAGTTAGTTTGCATCATT GATGATATCGGTCAAAATACAACAGATGAGGATTGGTCAGACTTTTGTCAGTTGGTGTCA GGATGCCCACTGAGATTAAATATGGCCTCTCTTGAAGAGAAGGGTAGGCATTTTTCTTCT CCTTTCATAATAGCAACTTCAAATTGGTCAAATCCAAGTCCAAAAACAGTTTATGTTAAG GAAGCCATTGATCGTAGACTCCATTTTAAGGTTGAAGTTAATCCTGCTTCATTTTCTAAA AATCCTCACAATGATATGTTGAATGTCAATTTAGCTAAAACAAATGATGCAATCAAGGAT ATGTCTTGTGTTGATCTGATAATGGATGGACATAATGTTTCATTGATGGACTTGCTCAGT TCCCTAGTTATGACAGTTGAAATCAGGAAACAAAACATGACTGCATTTATGGAGTTGTGG TCTCAGGGAATTTCAGATGATGATAATGATAGTGCAATGGCAGAGTTTTTTCAGTCTTTT CCATCTGGTGAACCATCGAATTCCAAATTATCTGGCTTTTTCCAATCTGTTACTAATCAC AAGTGGGTTGCTGTGGGAGCTGCAGTTGGTATTCTTGGAGTGCTCGTTGGGGGATGGTTT GTGTATAAGCATTTCTCCCGAAAAGAGGAAGAGCCAATTCCAGCTGAAGGGGTATATCAT GGTGTAACTAAGCCTAAGCAAGTGATTAAATTAGATGCAGATCCAGTAGAATCTCAGTCA ACTTTGGAAATAGCAGGACTGGTTAGGAAGAATTTGGTTCAGTTTGGAGTTGGAGAGAAG AATGGATGTGTGAGATGGGTTATGAATGCCTTAGGGGTGAAAGATGATTGGTTGCTTGTA CCTTCCCATGCTTACAAATTTGAGAAAGATTATGAAATGATGGAGTTTTATTTTAATAGA GGTGGAACTTACTATTCAATTTCAGCTGGTAATGTTGTCATTCAATCTTTGGATGTGGGA TTTCAGGATGTTGTTCTGATGAAGGTTCCTACAATTCCTAAGTTTAGAGATATTACCCAA CATTTTATTAAGAAGGGAGATGTGCCTAGAGCTTTGAATCGTCTGGCAACATTAGTGACA ACTGTGAATGGAACTCCTATGTTAATTTCTGAGGGGCCATTAAAGATGGAAGAGAAAGCT ACTTATGTTCATAAGAAAAATGATGGTACAACAGTTGATTTAACTGTGGACCAGGCATGG AGAGGAAAAGGCGAGGGTCTTCCTGGAATGTGTGGTGGGGCCTTGGTTTCATCAAATCAG TCTATACAGAATGCAATTTTGGGTATTCATGTTGCTGGAGGAAATTCAATTCTTGTTGCA AAATTGGTTACTCAAGAAATGTTCCAAAATATTGATAAGAAAATTGAAAGTCAGAGAATC ATGAAAGTGGAATTTACTCAGTGTTCAATGAATGTAGTCTCCAAAACGCTTTTTAGAAAG AGTCCCATTCATCATCACATTGATAAAACCATGATCAATTTTCCTGCAGCTATGCCTTTT TCTAAAGCCGAAATTGATCCAATGGCTATGATGTTATCTAAGTATTCATTACCCATTGTA GAAGAGCCAGAGGATTATAAAGAAGCTTCAATTTTTTATCAAAATAAAATAGTAGGCAAG ACTCAGCTAGTTGATGATTTTCTAGATCTTGATATGGCCATTACAGGGGCCCCAGGAATT GATGCTATTAATATGGATTCATCTCCTGGATTTCCTTATGTTCAAGAGAGGTTGACCAAA AGAGATTTAATTTGGTTGGATGAGAATGGTTTATTGCTGGGAGTTCATCCAAGATTGGCT CAGAGAATTTTGTTCAATACTGTCATGATGGAAAATTGTTCTGATTTGGATGTTGTTTTT ACTACTTGCCCAAAAGATGAATTGAGACCATTGGAGAAGGTGTTGGAATCAAAAACAAGA GCTATTGATGCTTGTCCTCTGGATTACACAATTTTGTGTCGAATGTACTGGGGTCCAGCT ATTAGTTATTTTCATTTGAATCCAGGGTTCCATACAGGTGTTGCTATTGGCATAGATCCT GACTGTCAGTGGGATGAATTATTTAAAACAATGATAAGATTTGGAGATGTCGGTCTTGAT TTAGATTTTTCTGCTTTTGATGCTAGTCTTAGTCCATTTATGATTAGGGAAGCAGGTAGA ATTATGAGTGAATTGTCTGGAACTCCATCCCATTTTGGAACAGCTCTCATGAATACTATC ATTTATTCTAAGCATTTGCTGTACAACTGTTGTTATCATGTTTGTGGTTCAATGCCTTCT GGGTCTCCTTGCACAGCTTTGCTGAATTCAATTATCAATAATGTCAATTTGTATTATGTG TTTTCCAAGATATTTGGAAAGTCTCCAGTTTTCTTTTGTCAGGCTTTGAAGATTCTCTGT TATGGAGATGATGTTCTTATAGTTTTTTCCCGAGATGTTCAGATTGATAATCTTGATTTG ATTGGACAAAAAATTGTAGATGAGTTTAAGAAACTTGGCATGACAGCTACTTCTGCTGAC AAAAATGTACCTCAGCTGAAGCCAGTTTCAGAATTGACTTTTCTCAAGAGATCTTTCAAT TTGGTAGAAGATAGGATCAGACCTGCAATTTCGGAAAAAACCATTTGGTCTTTGATAGCA TGGCAGAGAAGTAACGCTGAGTTTGAGCAGAATTTGGAAAATGCTCAGTGGTTTGCTTTT ATGCATGGCTATGAGTTTTATCAGAAATTTTATTATTTCGTTCAGTCTTGTTTGGAGAAA GAGATGATAGAATACAGACTAAAATCATATGATTGGTGGAGAATGAGATTCTATGACCAG TGCTTCATTTGTGACCTTTCATAA
- Chromosome Location
- Not Available
- Locus
- Not Available
- External Identifiers
Resource Link UniProtKB ID P13901 UniProtKB Entry Name POLG_HAVMB GenBank Gene ID M20273 - General References
- Paul AV, Tada H, von der Helm K, Wissel T, Kiehn R, Wimmer E, Deinhardt F: The entire nucleotide sequence of the genome of human hepatitis A virus (isolate MBB). Virus Res. 1987 Aug;8(2):153-71. [Article]
- Yin J, Bergmann EM, Cherney MM, Lall MS, Jain RP, Vederas JC, James MN: Dual modes of modification of hepatitis A virus 3C protease by a serine-derived beta-lactone: selective crystallization and formation of a functional catalytic triad in the active site. J Mol Biol. 2005 Dec 9;354(4):854-71. Epub 2005 Oct 14. [Article]