Genome polyprotein
Details
- Name
- Genome polyprotein
- Kind
- protein
- Synonyms
- 3.6.1.15
- P2A
- Gene Name
- Not Available
- UniProtKB Entry
- P06441Swiss-Prot
- Organism
- HHAV
- NCBI Taxonomy ID
- 12099
- Amino acid sequence
>lcl|BSEQ0012706|Genome polyprotein MNMSKQGIFQTVGSGLDHILSLADIEEEQMIQSVDRTAVTGASYFTSVDQSSVHTAEVGS HQIEPLKTSVDKPGSKKTQGEKFFLIHSADWLTTHALFHEVAKLDVVKLLYNEQFAVQGL LRYHTYARFGIEIQVQINPTPFQQGGLICAMVPGDQSYGSIASLTVYPHGLLNCNINNVV RIKVPFIYTRGAYHFKDPQYPVWELTIRVWSELNIGTGTSAYTSLNVLARFTDLELHGLT PLSTQMMRNEFRVSTTENVVNLSNYEDARAKMSFALDQEDWKSDPSQGGGIKITHFTTWT SIPTLAAQFPFNASDSVGQQIKVIPVDPYFFQMTNTNPDQKCITALASICQMFCFWRGDL VFDFQVFPTKYHSGRLLFCFVPGNELIDVTGITLKQATTAPCAVMDITGVQSTLRFRVPW ISDTPYRVNRYTKSAHQKGEYTAIGKLIVYCYNRLTSPSNVASHVRVNVYLSAINLECFA PLYHAMDVTTQVGDDSGGFSTTVSTEQNVPDPQVGITTMRDLKGKANRGKMDVSGVQAPR GSYQQQLNDPVLAKKVPETFPELKPGESRHTSDHMSIYKFMGRSHFLCTFTFNSNNKEYT FPITLSSTSNPPHGLPSTLRWFFNLFQLYRGPLDLTIIITGATDVDGMAWFTPVGLAVDP WVEKESALSIDYKTALGAVRFNTRRTGNIQIRLPWYSYLYAVSGALDGLGDKTDSTFGLF LFEIANYNHSDEYLSFSCYLSVTEQSEFYFPRAPLNSNAMLSTESMMSRIAAGDLESSVD DPRSEEDRRFESHIECRKPYKELRLEVGKQRLKYAQEELSNEVLPPPRKMKGLFSQAKIS LFYTEEHEIMKFSWRGVTADTRALRRFGFSLAAGRSVWTLEMDAGVLTGRLIRLNDEKWT EMKDDKIVSLIEKFTSNKYWSKVNFPHGMLDLEEIAANSKDFPNMSETDLCFLLHWLNPK KINLADRMLGLSGVQEIKEQGVGLIAECRTFLDSIAGTLKSMMFGFHHSVTVEIINTVLC FVKSGILLYVIQQLNQDEHSHIIGLLRVMNYADIGCSVISCGKVFSKMLETVFNWQMDSR MMELRTQSFSNWLRDICSGITIFKSFKDAIYWLYTKLKDFYEVNYGKKKDILNILKDNQQ KIEKAIEEADNFCILQIQDVEKFDQYQKGVDLIQKLRTVHSMAQVDPNLGVHLSPLRDCI ARVHQKLKNLGSINQAMVTRCEPVVCYLYGKRGGGKSLTSIALATKICKHYGVEPEKNIY TKPVASDYWDGYSGQLVCIIDDIGQNTTDEDWSDFCQLVSGCPMRLNMASLEEKGRHFSS PFIIATSNWSNPSPKTVYVKEAIDRRLHFKVEVKPASFFKNPHNDMLNVNLAKTNDAIKD MSCVDLIMDGHNISLMDLLSSLVMTVEIRKQNMSEFMELWSQGISDDDNDSAVAEFFQSF PSGEPSNWKLSSFFQSVTNHKWVAVGAAVGILGVLVGGWFVYKHFSRKEEEPIPAEGVYH GVTKPKQVIKLDADPVESQSTLEIAGLVRKNLVQFGVGEKNGCVRWVMNALGVKDDWLLV PSHAYKFEKDYEMMEFYFNRGGTYYSISAGNVVIQSLDVGFQDVVLMKVPTIPKFRDITQ HFIKKGDVPRALNRLATLVTTVNGTPMLISEGPLKMEEKATYVHKKNDGTTVDLTVDQAW RGKGEGLPGMCGGALVSSNQSIQNAILGIHVAGGNSILVAKLVTQEMFQNIDKKIESQRI MKVEFTQCSMNVVSKTLFRKSPIHHHIDKTMINFPAAMPFSKAEIDPMAMMLSKYSLPIV EEPEDYKEASVFYQNKIVGKTQLVDDFLDLDMAITGAPGIDAINMDSSPGFPYVQEKLTK RDLIWLDENGLLLGVHPRLAQRILFNTVMMENCSDLDVVFTTCPKDELRPLEKVLESKTR AIDACPLDYTILCRMYWGPAISYFHLNPGFHTGVAIGIDPDRQWDELFKTMIRFGDVGLD LDFSAFDASLSPFMIREAGRIMSELSGTPSHFGTALINTIIYSKHLLYNCCYHVCGSMPS GSPCTALLNSIINNINLYYVFSKIFGKSPVFFCQALRILCYGDDVLIVFSRDVQIDNLDL IGQKIVDEFKKLGMTATSADKNVPQLKPVSELTFLKRSFNLVEDRIRPAISEKTIWSLMA WQRSNAEFEQNLENAQWFAFMHGYEFYQKFYYFVQSCLEKEMIEYRLKSYDWWRMRFYDQ CFICDLS
- Number of residues
- 2227
- Molecular Weight
- 251897.82
- Theoretical pI
- 6.56
- GO Classification
- FunctionsATP binding / cysteine-type endopeptidase activity / ion channel activity / RNA binding / RNA helicase activity / RNA-directed RNA polymerase activity / structural molecule activityProcessespore formation by virus in membrane of host cell / protein oligomerization / RNA-protein covalent cross-linking / suppression by virus of host gene expression / suppression by virus of host MAVS activity / suppression by virus of host MAVS activity by MAVS proteolysis / transcription, DNA-templated / viral entry into host cell / viral RNA genome replication / virion attachment to host cellComponentshost cell cytoplasmic vesicle membrane / host cell mitochondrial outer membrane / integral to membrane of host cell / membrane / viral capsid
- General Function
- Capsid protein VP1 Capsid proteins VP1, VP2, and VP3 form a closed capsid enclosing the viral positive strand RNA genome. All these proteins contain a beta-sheet structure called beta-barrel jelly roll. Together they form an icosahedral capsid (T=3) composed of 60 copies of each VP1, VP2, and VP3, with a diameter of approximately 300 Angstroms. VP1 is situated at the 12 fivefold axes, whereas VP2 and VP3 are located at the quasi-sixfold axes. The naked capsid interacts with the host receptor HAVCR1 to provide virion attachment to and probably entry into the target cell.
- Specific Function
- ATP binding
- Pfam Domain Function
- Signal Regions
- Not Available
- Transmembrane Regions
- Not Available
- Cellular Location
- Virion
- Gene sequence
>lcl|BSEQ0007431|6684 bp ACTCAGGGGCATTTAGGTTTTTCCTCATTCTTAAACAATAATGAATATGTCCAAACAAGG AATTTTCCAGACTGTTGGGAGTGGCCTTGACCACATCCTGTCTTTGGCAGATATTGAGGA AGAGCAAATGATTCAGTCCGTTGATAGGACTGCAGTGACTGGAGCTTCTTACTTCACTTC TGTGGACCAATCTTCAGTTCATACTGCTGAGGTTGGCTCACATCAAATTGAACCTTTGAA AACCTCTGTTGATAAACCTGGTTCTAAGAAAACTCAGGGGGAAAAGTTTTTCCTGATTCA TTCTGCTGATTGGCTCACTACACATGCTCTCTTTCATGAAGTTGCAAAATTGGATGTGGT GAAACTACTGTATAATGAGCAGTTTGCCGTCCAAGGTTTGTTGAGATACCATACATATGC AAGATTTGGCATTGAGATTCAAGTTCAGATAAATCCCACACCCTTTCAGCAAGGAGGACT AATTTGTGCCATGGTTCCTGGTGACCAAAGTTATGGTTCAATAGCATCCTTGACTGTTTA TCCTCATGGTCTGTTAAATTGCAATATCAACAATGTAGTTAGAATAAAGGTTCCATTTAT TTATACTAGAGGTGCTTATCATTTTAAAGATCCACAGTACCCAGTTTGGGAATTGACAAT CAGAGTTTGGTCAGAGTTGAATATTGGAACAGGAACTTCAGCTTACACTTCACTCAATGT TTTAGCTAGGTTTACAGATTTGGAGTTGCATGGATTAACTCCTCTTTCTACACAGATGAT GAGAAATGAATTTAGGGTCAGTACTACTGAAAATGTTGTAAATTTGTCAAATTATGAAGA TGCAAGGGCAAAAATGTCTTTTGCTTTGGATCAGGAAGATTGGAAGTCTGATCCTTCCCA AGGTGGTGGAATTAAAATTACTCATTTTACTACCTGGACATCCATTCCAACCTTAGCTGC TCAGTTTCCATTTAATGCTTCAGATTCAGTTGGACAACAAATTAAAGTTATTCCAGTGGA CCCATACTTTTTCCAAATGACAAACACTAATCCTGATCAAAAATGTATAACTGCCTTGGC CTCTATTTGTCAGATGTTCTGCTTTTGGAGGGGAGATCTTGTTTTTGATTTTCAGGTTTT TCCAACCAAATATCATTCAGGTAGACTGTTGTTTTGTTTTGTTCCTGGGAATGAGTTAAT AGATGTTACTGGAATTACATTAAAACAGGCAACTACTGCTCCTTGTGCAGTGATGGACAT TACAGGAGTGCAGTCAACCTTGAGATTTCGTGTTCCTTGGATTTCTGATACACCTTATCG AGTGAATAGGTACACGAAGTCAGCACATCAAAAAGGTGAGTACACTGCCATTGGGAAGCT TATTGTGTATTGTTATAACAGACTGACTTCTCCTTCTAATGTTGCCTCTCATGTTAGAGT TAATGTTTATCTTTCAGCAATTAATTTGGAATGTTTTGCTCCTCTTTACCATGCTATGGA TGTTACTACACAGGTTGGAGATGATTCAGGAGGTTTCTCAACAACAGTTTCTACAGAGCA GAATGTTCCTGATCCCCAAGTTGGGATAACAACCATGAGGGATTTAAAAGGAAAAGCCAA TAGGGGAAAGATGGATGTTTCAGGAGTGCAAGCACCTCGTGGGAGCTATCAGCAACAATT GAACGATCCAGTTTTAGCAAAGAAAGTACCTGAGACATTTCCTGAATTGAAGCCTGGAGA GTCCAGACATACATCAGATCACATGTCTATTTATAAATTCATGGGAAGGTCTCATTTTTT GTGCACTTTTACTTTCAATTCAAATAATAAAGAGTACACATTTCCAATAACCCTGTCTTC GACTTCTAATCCTCCTCATGGTTTACCATCAACATTAAGGTGGTTCTTCAATTTGTTTCA GTTGTATAGAGGACCATTGGATTTAACAATTATAATCACAGGAGCCACTGATGTGGATGG TATGGCCTGGTTTACTCCAGTGGGCCTTGCTGTCGACCCTTGGGTGGAAAAGGAGTCAGC TTTGTCTATTGATTATAAAACTGCCCTTGGAGCTGTTAGATTTAATACAAGAAGAACAGG AAACATTCAAATTAGATTGCCGTGGTATTCTTATTTGTATGCCGTGTCTGGAGCACTGGA TGGCTTGGGGGATAAGACAGATTCTACATTTGGATTGTTTCTATTCGAGATTGCAAATTA CAATCATTCTGATGAATATTTGTCCTTCAGTTGTTATTTGTCTGTCACAGAGCAATCAGA GTTCTATTTTCCTAGAGCTCCATTAAATTCAAATGCTATGTTGTCCACTGAATCCATGAT GAGTAGAATTGCAGCTGGAGACTTGGAGTCATCAGTGGATGATCCCAGATCAGAGGAGGA TAGAAGATTTGAGAGTCATATAGAATGTAGGAAACCATACAAAGAATTGAGACTGGAGGT TGGGAAACAAAGACTCAAATATGCTCAGGAAGAGTTATCAAATGAAGTGCTTCCACCTCC TAGGAAAATGAAGGGGTTATTTTCACAAGCTAAAATTTCTCTTTTTTATACTGAGGAGCA TGAAATAATGAAGTTTTCTTGGAGAGGAGTGACTGCTGATACTAGGGCTTTGAGAAGATT TGGATTCTCTCTGGCTGCTGGTAGAAGTGTGTGGACTCTTGAAATGGATGCTGGAGTTCT TACTGGAAGATTGATCAGATTGAATGATGAGAAATGGACAGAAATGAAGGATGATAAGAT TGTTTCATTAATTGAAAAGTTCACAAGCAATAAATATTGGTCTAAAGTGAATTTTCCACA TGGAATGTTGGATCTTGAAGAAATTGCTGCCAATTCTAAGGATTTTCCAAATATGTCTGA GACAGATTTGTGTTTCCTGTTACATTGGCTAAATCCAAAGAAAATCAATTTAGCAGATAG AATGCTTGGATTGTCTGGAGTGCAGGAAATTAAGGAACAGGGTGTTGGACTGATAGCAGA GTGTAGAACTTTCTTGGATTCTATTGCTGGGACTTTGAAATCTATGATGTTTGGGTTTCA TCATTCTGTGACTGTTGAAATTATAAATACTGTGCTTTGTTTTGTTAAGAGTGGAATCCT GCTTTATGTCATACAACAATTGAACCAAGATGAACACTCTCACATAATTGGTTTGTTGAG AGTTATGAATTATGCAGATATTGGCTGTTCAGTTATTTCATGTGGTAAAGTTTTTTCCAA AATGTTAGAAACAGTTTTTAATTGGCAAATGGATTCTAGAATGATGGAGCTGAGGACTCA GAGCTTCTCTAATTGGTTAAGAGATATTTGTTCAGGAATTACTATTTTTAAAAGTTTTAA GGATGCCATATATTGGTTATATACAAAATTGAAGGATTTTTATGAAGTAAATTATGGCAA GAAAAAGGATATTCTTAATATTCTCAAAGATAATCAGCAAAAAATAGAAAAAGCCATTGA AGAAGCAGACAATTTTTGCATTTTGCAAATTCAAGATGTAGAGAAATTTGATCAGTATCA GAAAGGGGTTGATTTAATACAAAAGCTGAGAACTGTCCATTCAATGGCGCAAGTTGACCC CAATTTGGGGGTTCATTTGTCACCTCTCAGAGATTGCATAGCAAGAGTCCACCAAAAGCT CAAGAATCTTGGATCTATAAATCAGGCCATGGTAACAAGATGTGAGCCAGTTGTTTGCTA TTTGTATGGCAAAAGAGGGGGAGGGAAAAGCTTGACTTCAATTGCATTGGCAACCAAAAT TTGTAAACACTATGGTGTTGAACCTGAGAAAAATATTTACACCAAACCTGTGGCCTCAGA TTATTGGGATGGATATAGTGGACAATTAGTTTGCATTATTGATGATATTGGCCAAAACAC AACAGATGAAGATTGGTCAGATTTTTGTCAATTAGTGTCAGGATGCCCAATGAGATTGAA TATGGCTTCTCTAGAGGAGAAGGGCAGACATTTTTCCTCTCCTTTTATAATAGCAACTTC AAATTGGTCAAATCCAAGTCCAAAAACAGTTTATGTTAAGGAAGCAATTGATCGTAGGCT TCATTTTAAGGTTGAAGTTAAACCTGCTTCATTTTTTAAAAATCCTCACAATGATATGTT GAATGTTAATTTGGCCAAAACAAATGATGCAATTAAGGACATGTCTTGTGTTGATTTAAT AATGGATGGACACAATATTTCATTGATGGATTTACTTAGTTCCTTAGTGATGACAGTTGA AATTAGGAAACAGAATATGAGTGAATTCATGGAGTTGTGGTCTCAGGGAATTTCAGATGA TGACAATGATAGTGCAGTGGCTGAGTTTTTCCAGTCTTTTCCATCTGGTGAACCATCAAA TTGGAAGTTATCTAGTTTTTTCCAATCTGTCACTAATCACAAGTGGGTTGCTGTGGGAGC TGCAGTTGGCATTCTTGGAGTGCTTGTGGGAGGATGGTTTGTGTATAAGCATTTTTCCCG CAAAGAGGAAGAACCAATTCCAGCTGAAGGGGTTTATCATGGCGTGACTAAGCCCAAACA AGTGATTAAATTGGATGCAGATCCAGTAGAGTCCCAGTCAACTCTAGAAATAGCAGGATT AGTTAGGAAAAATCTGGTTCAGTTTGGAGTTGGTGAGAAAAATGGATGTGTGAGATGGGT CATGAATGCCTTAGGAGTGAAGGATGATTGGTTGTTAGTACCTTCTCATGCTTATAAATT TGAAAAGGATTATGAAATGATGGAGTTTTACTTCAATAGAGGTGGAACTTACTATTCAAT TTCAGCTGGTAATGTTGTTATTCAATCTTTAGATGTGGGATTTCAAGATGTTGTTTTAAT GAAGGTTCCTACAATTCCCAAGTTTAGAGATATTACTCAACACTTTATTAAGAAAGGAGA TGTGCCTAGAGCCTTAAATCGCTTGGCAACATTAGTGACAACCGTTAATGGAACTCCTAT GTTAATTTCTGAGGGACCATTAAAGATGGAAGAAAAAGCCACTTATGTTCATAAGAAGAA TGATGGTACTACAGTTGATTTGACTGTAGATCAGGCATGGAGAGGAAAAGGTGAAGGTCT TCCTGGAATGTGTGGTGGGGCCCTAGTGTCATCAAATCAGTCCATACAGAATGCAATTTT GGGTATTCATGTTGCTGGAGGAAATTCAATTCTTGTGGCAAAGCTGGTTACTCAAGAAAT GTTTCAAAACATTGATAAGAAAATTGAAAGTCAGAGAATAATGAAAGTGGAATTTACTCA ATGTTCAATGAATGTAGTCTCCAAAACGCTTTTTAGAAAGAGTCCCATTCATCACCACAT TGATAAAACCATGATTAATTTTCCTGCAGCTATGCCTTTCTCTAAAGCTGAAATTGATCC AATGGCTATGATGTTGTCCAAATATTCATTACCTATTGTGGAGGAACCAGAGGATTACAA GGAAGCTTCAGTTTTTTATCAAAACAAAATAGTAGGCAAGACTCAGCTAGTTGATGACTT TTTAGATCTTGATATGGCTATTACAGGGGCTCCAGGCATTGATGCTATCAATATGGATTC ATCTCCTGGGTTTCCTTATGTTCAAGAAAAATTGACCAAAAGAGATTTAATTTGGTTGGA TGAAAATGGTTTGCTGTTAGGAGTTCACCCAAGATTGGCCCAGAGAATTTTATTTAATAC TGTCATGATGGAAAATTGTTCTGACTTAGATGTTGTTTTTACAACTTGTCCAAAAGATGA ATTGAGACCATTAGAGAAAGTTTTGGAATCAAAAACAAGAGCCATTGATGCTTGTCCTTT GGATTATACAATTCTATGTCGAATGTATTGGGGTCCAGCTATCAGTTATTTCCATTTGAA TCCAGGGTTTCACACAGGTGTTGCTATTGGCATAGATCCTGATAGACAGTGGGATGAATT ATTTAAAACAATGATAAGATTTGGAGATGTTGGTCTTGATTTAGATTTCTCTGCTTTTGA TGCCAGTCTTAGTCCATTTATGATTAGGGAAGCAGGTAGAATCATGAGTGAATTATCTGG AACACCATCTCATTTTGGAACAGCTCTTATCAATACTATCATTTATTCTAAACATCTGCT GTACAACTGTTGTTATCATGTTTGTGGTTCAATGCCTTCTGGGTCTCCTTGCACAGCTTT GTTGAATTCAATTATTAATAATATTAATCTGTATTATGTGTTTTCTAAAATATTTGGAAA GTCTCCAGTTTTCTTTTGTCAAGCTTTGAGGATCCTTTGTTACGGAGATGATGTTTTGAT AGTTTTTTCCAGAGATGTTCAAATTGACAATCTTGACTTGATTGGACAGAAAATTGTAGA TGAGTTCAAAAAACTTGGCATGACAGCCACCTCAGCTGATAAAAATGTGCCTCAACTGAA GCCAGTTTCAGAATTGACTTTTCTCAAAAGATCTTTCAATTTGGTGGAGGATAGAATTAG ACCTGCAATTTCAGAAAAGACAATTTGGTCTTTGATGGCTTGGCAGAGAAGTAACGCTGA GTTTGAGCAGAATTTAGAAAATGCTCAGTGGTTTGCTTTTATGCATGGCTATGAGTTCTA TCAGAAATTTTATTATTTTGTTCAGTCCTGTTTGGAGAAAGAGATGATAGAATATAGACT TAAATCTTATGATTGGTGGAGAAT
- Chromosome Location
- Not Available
- Locus
- Not Available
- External Identifiers
Resource Link UniProtKB ID P06441 UniProtKB Entry Name POLG_HAVLA GenBank Protein ID 329597 GenBank Gene ID K02990 PDB ID(s) 2H6M, 2H9H - General References
- Najarian R, Caput D, Gee W, Potter SJ, Renard A, Merryweather J, Van Nest G, Dina D: Primary structure and gene organization of human hepatitis A virus. Proc Natl Acad Sci U S A. 1985 May;82(9):2627-31. [Article]
Associated Data
- Drug Relations
Drug Drug group Pharmacological action? Type Actions Details N-BENZYLOXYCARBONYL-L-SERINE-BETALACTONE experimental unknown target Details