TCDB is operated by the Saier Lab Bioinformatics Group
« See all members of the family


1.A.53.1.6
Genome polyprotein [Cleaved into: Core protein p21 (Capsid protein C) (p21); Core protein p19; Envelope glycoprotein E1 (gp32) (gp35); Envelope glycoprotein E2 (NS1) (gp68) (gp70); p7; Protease NS2-3 (p23) (EC 3.4.22.-); Serine protease NS3 (EC 3.4.21.98) (EC 3.6.1.15) (EC 3.6.4.13) (Hepacivirin) (NS3P) (p70); Non-structural protein 4A (NS4A) (p8); Non-structural protein 4B (NS4B) (p27); Non-structural protein 5A (NS5A) (p56); RNA-directed RNA polymerase (EC 2.7.7.48) (NS5B) (p68)]

Accession Number:O92530
Protein Name:Genome polyprotein
Length:3013
Molecular Weight:328200.00
Species:Hepatitis C virus genotype 6d (isolate VN235) (HCV) [356422]
Number of TMSs:15
Location1 / Topology2 / Orientation3: Host endoplasmic reticulum membrane1 / Single-pass type I membrane protein2
Substrate cations

Cross database links:

Pfam: PF07652    PF01543    PF01542    PF01539    PF01560    PF01538    PF01006    PF01001    PF01506    PF08300    PF08301    PF12941    PF02907    PF00998    PF07652    PF01543    PF01542    PF01539    PF01560    PF01538    PF01006    PF01001    PF01506    PF08300    PF08301    PF12941    PF02907    PF00998   

Gene Ontology

GO:0044167 C:host cell endoplasmic reticulum membrane
GO:0044186 C:host cell lipid particle
GO:0044191 C:host cell mitochondrial membrane
GO:0042025 C:host cell nucleus
GO:0044220 C:host cell perinuclear region of cytoplasm
GO:0020002 C:host cell plasma membrane
GO:0016021 C:integral to membrane
GO:0030529 C:ribonucleoprotein complex
GO:0019028 C:viral capsid
GO:0019031 C:viral envelope
GO:0055036 C:virion membrane
GO:0005524 F:ATP binding
GO:0008026 F:ATP-dependent helicase activity
GO:0004197 F:cysteine-type endopeptidase activity
GO:0005216 F:ion channel activity
GO:0003723 F:RNA binding
GO:0003968 F:RNA-directed RNA polymerase activity
GO:0004252 F:serine-type endopeptidase activity
GO:0070008 F:serine-type exopeptidase activity
GO:0005198 F:structural molecule activity
GO:0008270 F:zinc ion binding
GO:0006915 P:apoptotic process
GO:0030683 P:evasion by virus of host immune response
GO:0006508 P:proteolysis
GO:0006355 P:regulation of transcription, DNA-dependent
GO:0006351 P:transcription, DNA-dependent
GO:0019087 P:transformation of host cell by virus
GO:0019079 P:viral genome replication
GO:0044167 C:host cell endoplasmic reticulum membrane
GO:0044186 C:host cell lipid particle
GO:0044191 C:host cell mitochondrial membrane
GO:0042025 C:host cell nucleus
GO:0044220 C:host cell perinuclear region of cytoplasm
GO:0020002 C:host cell plasma membrane
GO:0016021 C:integral to membrane
GO:0030529 C:ribonucleoprotein complex
GO:0019028 C:viral capsid
GO:0019031 C:viral envelope
GO:0055036 C:virion membrane
GO:0005524 F:ATP binding
GO:0008026 F:ATP-dependent helicase activity
GO:0004197 F:cysteine-type endopeptidase activity
GO:0005216 F:ion channel activity
GO:0003723 F:RNA binding
GO:0003968 F:RNA-directed RNA polymerase activity
GO:0004252 F:serine-type endopeptidase activity
GO:0070008 F:serine-type exopeptidase activity
GO:0005198 F:structural molecule activity
GO:0008270 F:zinc ion binding
GO:0006915 P:apoptotic process
GO:0030683 P:evasion by virus of host immune response
GO:0006508 P:proteolysis
GO:0006355 P:regulation of transcription, DNA-dependent
GO:0006351 P:transcription, DNA-dependent
GO:0019087 P:transformation of host cell by virus
GO:0019079 P:viral genome replication

References (8)

[1] “The entire nucleotide sequences of three hepatitis C virus isolates in genetic groups 7-9 and comparison with those in the other eight genetic groups.”  Tokita H.et.al.   9714232
[2] “Properties of the hepatitis C virus core protein: a structural protein that modulates cellular processes.”  McLauchlan J.et.al.   10718937
[3] “Structural biology of hepatitis C virus.”  Penin F.et.al.   14752815
[4] “An RNA-binding protein, hnRNP A1, and a scaffold protein, septin 6, facilitate hepatitis C virus replication.”  Kim C.S.et.al.   17229681
[5] “The entire nucleotide sequences of three hepatitis C virus isolates in genetic groups 7-9 and comparison with those in the other eight genetic groups.”  Tokita H.et.al.   9714232
[6] “Properties of the hepatitis C virus core protein: a structural protein that modulates cellular processes.”  McLauchlan J.et.al.   10718937
[7] “Structural biology of hepatitis C virus.”  Penin F.et.al.   14752815
[8] “An RNA-binding protein, hnRNP A1, and a scaffold protein, septin 6, facilitate hepatitis C virus replication.”  Kim C.S.et.al.   17229681

External Searches:

  • Search: DB with
  • BLAST ExPASy (Swiss Institute of Bioinformatics (SIB) BLAST)
  • CDD Search (Conserved Domain Database)
  • Search COGs (Clusters of Orthologous Groups of proteins)
  • 2° Structure (Network Protein Sequence Analysis)

Analyze:

Predict TMSs (Predict number of transmembrane segments)
Window Size: Angle:  
Window Size: Angle:  
FASTA formatted sequence
1:	MSTLPKPQKR NQRNTNRRPQ DVKFPGGGQI VGGVYLLPRR GPRLGVRATR KTSERSQPRG 
61:	RRQPIPKARR QTGRTWAQPG YPWPLYGNEG CGWMGWLLSP RGSRPHWGPN DPRRRSRNLG 
121:	KVIDTLTCGF ADLMGYIPVV GAPLGGVAAA LAHGVRAVED GINYATGNLP GCSFSIFLLA 
181:	LLSCLTTPAS AVHYANKSGI YHLTNDCPNS SIVYEAEDFI MHLPGCVPCI KSGNGSSCWL 
241:	PATLTIAVPN ASIPVRGFRR HVDLMVGAAA FCSAMYVGDL CGGIFLVGQL FSFNPRRHWV 
301:	VQDCNCSIYV GHITGHRMAW DMMMNWSPTA TLVLSYVMRI PQVIMDIFTG GHWGILAGIL 
361:	YYSMVANWAK VLCILFLFAG VDATTRTTGA QAARATLGFT GLFQTGAKQN IHLINTNGSW 
421:	HINRTALNCN DSLNTGFMAA LFYLHKFNST GCPERLSACK SITQFAQGWG PVTYANVSGS 
481:	SEDRPYCWHY APRPCGVVSA RSVCGPVYCF TPSPVVVGTT DRRGVPTYTW GENESDVFLL 
541:	ESLRPPAGAW YGCTWMNSTG YTKTCGAPPC HIGPPDQFCP TDCFRKHPEA TYRKCGSGPW 
601:	LTPRCLVDYP YRLWHYPCTV NYTIHKVRLF INGLEHRFDA ACNWTRGERC ELEDRDRIEM 
661:	SPLLFSTTEL AILPCSFTTM PALSTGLVHL HQNIVDIQYL YGLAPALVSW AVRWEYVVLA 
721:	FLLLADARIC ACLWMVLLIS QVEAALENLI VLNAASAASS QGWIYCLVFI CCAWYIKGRV 
781:	VPGATYAILH LWPLLLLVLA LPQRAYAQDR EQGASIGVVV IAAITIFTLT PAYKTMLVHF 
841:	LWWNQYFIAR SEALIQQWVP SLRVRGGRDA VILLTCLLHP SLGFDITKML LALLGPLYLL 
901:	QVSLLRVPYY VRAHALLRVC ILVRRVAGGK YIQAALLKLG AWTGTYIYDH LAPLSTWASD 
961:	GLRDLAVAVE PVTFSPMEKK IITWGADTAA CGDILAGLPV SARLGHLLFL GPADDMKSMG 
1021:	WRLLAPITAY CQQTRGLLGT IVTSLTGRDR NVVEGEIQVL STATQSFLGT AINGVMWTVY 
1081:	HGAGSKTLAG PKGPVCQMYT NVDQDMVGWP APPGTRSLTP CTCGASDLYL VTRNADVIPA 
1141:	RRRGDTRAGL LSPRPLSTLK GSSGGPLMCP SDHVVGLFRA AVCTRGVAKA LDFVPVENME 
1201:	TTMRSPVFTD NSTPPAVPQT YQVGYLHAPT GSGKSTKVPA AYASQGYKVL VLNPSVAATL 
1261:	GFGSYMSTAH GIDPNIRTGV RTITTGGPIT YSTYGKFLAD GGCSGGAYDI IICDECHSTD 
1321:	PTTVLGIGTV LDQAETAGVR LTVLATATPP GSVTVPHPNI TETALPSTGE VPFYGKAIPL 
1381:	ECIKGGRHLI FCHSKKKCDE LAKQLRTLGL NAVAFYRGVD VSVIPTAGDV VVCATDALMT 
1441:	GYTGDFDSVI DCNVAVTQIV DFSLDPTFSI ETTTVPQDAV ARSQRRGRTG RGKPGVYRYV 
1501:	SQGERPSGMF DTVVLCEAYD VGCAWYELTP SETTVRLRAY LNTPGLPVCQ DHLEFWEGVF 
1561:	TGMTHIDAHF LSQTKQGGEN FAYLVAYQAT VCARAKAPPP SWDTMWKCLI RLKPMLTGPT 
1621:	PLLYRLGAVQ NEIITTHPIT KYIMTCMAAD LEVITSTWVL AGGIVAALAA YCLTVGSVVI 
1681:	CGRIVTSGKP VPLPDREVLY RQFDEMEECS RHIPYLAEGQ QIAEQFKQKI LGLLQNTAKQ 
1741:	AEDLKPAVQS AWPKLEQFWQ KHLWNFVSGV QYLAGLSTLP GNPAVASLMS FSAALTSPLS 
1801:	TSTTLLLNIL GGWVASQLAP PTASTAFVVS GLAGAAVGSI GLGKVIIDIL AGYGAGVSGA 
1861:	LVAFKIMSGE APAVEDMVNL LPALLSPGAL VVGVVCAAVL RRHVGPSEGA TQWMNRLIAF 
1921:	ASRGNHVSPT HYVPETDASR AVTTILSSLT ITSLLRRLHE WISGDWSAPC SCSWLKDVWD 
1981:	WVCTVLSDFK TWLRAKLVPT LPGIPFISCQ RGFRGVWRGD GVNYTTCSCG ANITGHVKNG 
2041:	SMKIVGPKMC SNVWNNRFPI NAITTGPSVP VPEPNYHKAL WRVSAEDYVE VVRVNDHHYI 
2101:	VGATADNLKC PCQVPAPEFF TEVDGVRLHR FAPPCRPLMR DDITFSVGLS TYVVGSQLPC 
2161:	EPEPDVVILT SMLTDPDHIT AETAARRLAR GSPPSLASSS ASQLSAPSLK ATCTTAGKHP 
2221:	DAELIEANLL WRQEVGGNIT RVESENKIIV LDSFDPLIAE TDDREISVGA ECFNPPRPKF 
2281:	PPALPVWARP DYNPPLLQPW KAPDYEPPLV HGCALPPKGL PPVPPPRKKR VVQLDEGSAK 
2341:	RALAELAQTS FPPSTATLSE DSGRETSTLS SDMTPPREEA DRASDDGSYS SMPPLEGEPG 
2401:	DPDLSSGSWS TVSEDHDSVV CCSMSYSWTG ALITPCAAEE EKLPISPLSN ALIRHHNLVY 
2461:	STTSRSASLR QKKVTFDRVQ VVDQHYYDVL KEIKTKASGV SAKLLSVEEA CALTPPHSAR 
2521:	SKFGYGAKEV RGLASKAVNH INSVWEDLLE DNSTPIPTTI MAKNEVFCVD AQKGGRKPAR 
2581:	LIVYPDLGVR VCEKRALYDV TQKLPIAVMG AAYGFQYSPK QRVDYLLKMW RSKKTPMGFS 
2641:	YDTRCFDSTV TERDIRTEED IYQCCQLDPV AKKAITSLTE RLYCGGPMYN SRGQSCGYRR 
2701:	CRASGVLTTS LGNTLTCYLK AQAACRAAKL KDFDMLVCGD DLVVISESMG VAEDASALRA 
2761:	FTEAMTRYSA PPGDDPQPEY DLELITSCSS NVSVAHDGAG QRYYYLTRDP LTPLSRAAWE 
2821:	TARHTPVNSW LGNIIMYAPT IWVRMVLMTH FFAILQSQEI LHKALDFDMY GVTYSVTPLD 
2881:	LPYIIQRLHG MAAFSLHGYS PGELNRVASC LRKLGAPPLR AWRHRARAVR AKLIAQGGKH 
2941:	AICGKYLFNW AVRTKLKLTP LRGAANLDLS GWFVSGGSGG DIFHSVSRAR PRNLLLCLLL 
3001:	LTVGVGIFLL PAR