TCDB is operated by the Saier Lab Bioinformatics Group
« See all members of the family


1.A.53.1.7
Genome polyprotein [Cleaved into: Core protein p21 (Capsid protein C) (p21); Core protein p19; Envelope glycoprotein E1 (gp32) (gp35); Envelope glycoprotein E2 (NS1) (gp68) (gp70); p7; Protease NS2-3 (p23) (EC 3.4.22.-); Serine protease NS3 (EC 3.4.21.98) (EC 3.6.1.15) (EC 3.6.4.13) (Hepacivirin) (NS3P) (p70); Non-structural protein 4A (NS4A) (p8); Non-structural protein 4B (NS4B) (p27); Non-structural protein 5A (NS5A) (p56); RNA-directed RNA polymerase (EC 2.7.7.48) (NS5B) (p68)]

Accession Number:O39927
Protein Name:Genome polyprotein
Length:3018
Molecular Weight:329020.00
Species:Hepatitis C virus genotype 6a (isolate EUHK2) (HCV) [356420]
Number of TMSs:12
Location1 / Topology2 / Orientation3: Host endoplasmic reticulum membrane1 / Single-pass type I membrane protein2
Substrate cations

Cross database links:

Pfam: PF07652    PF01543    PF01542    PF01539    PF01560    PF01538    PF01006    PF01001    PF01506    PF08300    PF08301    PF12941    PF02907    PF00998   

Gene Ontology

GO:0044167 C:host cell endoplasmic reticulum membrane
GO:0044186 C:host cell lipid particle
GO:0044191 C:host cell mitochondrial membrane
GO:0042025 C:host cell nucleus
GO:0044220 C:host cell perinuclear region of cytoplasm
GO:0020002 C:host cell plasma membrane
GO:0016021 C:integral to membrane
GO:0030529 C:ribonucleoprotein complex
GO:0019028 C:viral capsid
GO:0019031 C:viral envelope
GO:0055036 C:virion membrane
GO:0005524 F:ATP binding
GO:0008026 F:ATP-dependent helicase activity
GO:0004197 F:cysteine-type endopeptidase activity
GO:0005216 F:ion channel activity
GO:0003723 F:RNA binding
GO:0003968 F:RNA-directed RNA polymerase activity
GO:0004252 F:serine-type endopeptidase activity
GO:0070008 F:serine-type exopeptidase activity
GO:0005198 F:structural molecule activity
GO:0008270 F:zinc ion binding
GO:0006915 P:apoptotic process
GO:0030683 P:evasion by virus of host immune response
GO:0006508 P:proteolysis
GO:0006355 P:regulation of transcription, DNA-dependent
GO:0006351 P:transcription, DNA-dependent
GO:0019087 P:transformation of host cell by virus
GO:0019079 P:viral genome replication

References (4)

[1] “Complete coding sequence of hepatitis C virus genotype 6a.”  Adams A.et.al.   9177282
[2] “Properties of the hepatitis C virus core protein: a structural protein that modulates cellular processes.”  McLauchlan J.et.al.   10718937
[3] “Structural biology of hepatitis C virus.”  Penin F.et.al.   14752815
[4] “An RNA-binding protein, hnRNP A1, and a scaffold protein, septin 6, facilitate hepatitis C virus replication.”  Kim C.S.et.al.   17229681

External Searches:

  • Search: DB with
  • BLAST ExPASy (Swiss Institute of Bioinformatics (SIB) BLAST)
  • CDD Search (Conserved Domain Database)
  • Search COGs (Clusters of Orthologous Groups of proteins)
  • 2° Structure (Network Protein Sequence Analysis)

Analyze:

Predict TMSs (Predict number of transmembrane segments)
Window Size: Angle:  
Window Size: Angle:  
FASTA formatted sequence
1:	MSTLPKPQRK TKRNTNRRPM DVKFPGGGQI VGGVYLLPRK GPRLGVRATR KTSERSQPRG 
61:	RRQPIPKARQ PQGRHWAQPG YPWPLYGSEG CGWAGWLLSP RGSRPHWGPN DPRRRSRNLG 
121:	KVIDTLTCGF ADLMWYIPVV GAPLGGVAAA LAHGVRAIED GINYATGNLP GCSFSIFLLA 
181:	LLSCLTTPAS ALTYGNSSGL YHLTNDCSNS SIVLEADAMI LHLPGCLPCV RVGNQSTCWH 
241:	AVSPTLATPN ASTPATGFRR HVDLLAGAAV VCSSLYIGDL CGSLFLAGQL FAFQPRRHWT 
301:	VQDCNCSIYT GHVTGHKMAW DMMMNWSPTT TLVLSSILRV PEICASVIFG GHWGILLAVA 
361:	YFGMAGNWLK VLAVLFLFAG VEAQTMIAHG VSQTTSGFAS LLTPGAKQNI QLINTNGSWH 
421:	INRTALNCND SLQTGFLASL FYTHKFNSSG CPERMAACKP LAEFRQGWGQ ITHKNVSGPS 
481:	DDRPYCWHYA PRPCEVVPAR SVCGPVYCFT PSPVVVGTTD KRGNPTYTWG ENETDVFMLE 
541:	SLRPPTGGWF GCTWMNSTGF TKTCGAPPCQ IVPGNYNSSA NELLCPTDCF RKHPEATYQR 
601:	CGSGPWVTPR CLVDYAYRLW HYPCTVNFTL HKVRMFVGGT EHRFDVACNW TRGERCELHD 
661:	RNRIEMSPLL FSTTQLSILP CSFSTMPALS TGLIHLHQNI VDVQYLYGVS TNVTSWVVKW 
721:	EYIVLMFLVL ADARICTCLW LMLLISTVEA AVERLVVLNA ASAAGTAGWW WAVLFLCCVW 
781:	YVKGRLVPAC TYMALGMWPL LLTILALPPR AYAMDNEQAA SLGAVGLLVI TIFSITPMYK 
841:	KLLNCFIWWN QYFLARAEAM VHEWVPDLRV RGGRDSIILL TCLLHPQLGF EVTKILLAVL 
901:	APLYILQYSL LKVPYFVRAH ILLRACLLVR RLAGGKYVQA CLLRLGAWTG TFVYDHLAPL 
961:	SDWASDGLRD LAVAVEPVIF SPMEKKIITW GADTAACGDI LSGLPVSARL GNLVLLGPAD 
1021:	DMQRGGWKLL APITAYAQQT RGLVGTIVTS LTGRDKNEVE GEVQVVSTDT QSFVATSING 
1081:	VMWTVYHGPG FKTLAGPKGP VCQMYTNVDL DLVGWPSPPG ARSLTPCNCG SSDLYLVTRE 
1141:	ADVIPARRRG DSRAALLSPR PISTLKGSSG GPIMCPSGHV VGLFRAAVCT RGVAKSLDFI 
1201:	PVENMETTMR SPSFTDNSTP PAVPQTYQVG YLHAPTGSGK STRVPAAYAS QGYKVLVLNP 
1261:	SVAATLSFGS YMRQAYGVEP NIRTGVRTVT TGGAITYSTY GEFLADGGCS GGAYDIIICD 
1321:	ECHSTDPTTV LGVGTVLDQA ETAGVRLTVL PTATPPGSVT VPHPNITETA LPTTGEIPFY 
1381:	GKAIPLEYIK GGRHLIFCHS KKKCDELAGK LKSLGLNAVA FYRGVDVSVI PTSGDVVVCA 
1441:	TDALMTGYTG DFDSVIDCNV AVTQVVDFSL DPTFSIETTT VPQDAVSRSQ RRGRTGRGKP 
1501:	GVYRFVSQGE RPSGMFDTVV LCEAYDTGCA WYELTPSETT VRLRAYMNTP GLPVCQDHLE 
1561:	FWEGVFTGLT HIDAHFLSHT KQAGENFAYL VAYQATVCAR AKAPPPSWDM MWKCLIRLKP 
1621:	TLTGPTPLLY RLGAVQNGVI TTHPITKYIM TCMSADLEVI TSTWVLVGGV LAALAAYCLS 
1681:	VGCVVICGRI TLTGKPAVVP DREILYQQFD EMEECSRHIP YLAEGQQIAE QFRQKVLGLL 
1741:	QASAKQAEEL KPAVHSAWPR VEDFWRKHMW NFVSGIQYLA GLSTLPGNPA VASLMSFTAS 
1801:	LTSPLRTSQT LLLNILGGWI AAQVAPPPAS TAFVVSGLAG AAVGSIRLGR VLVDVLAGYG 
1861:	AGVSGALVAF KIMSGECPST EDMVNLLPAL LSPGVALVGV VCAAILRRHV GPAEGANQWM 
1921:	NRLIAFASRG NHVSPTHYVP ETDASKNVTQ ILTSLTITSL LRRLHQWVNE DTATPCATSW 
1981:	LRDVWDWVCT VLSDFKVWLQ AKLFPRLPGI PFLSCQAGYR GVWAGDGVCH TTCTCGAVIA 
2041:	GHVKNGTMKI TGPKTCSNTW HGTFPINATT TGPSTPRPAP NYQRALWRVS AEDYVEVRRL 
2101:	GDCHYVVGVT AEGLKCPCQV PAPEFFTEVD GVRIHRYAPP CKPLLRDEVT FSVGLSNYAV 
2161:	GSQLPCEPEP DVTVVTSMLT DPTHITAETA ARRLKKGSPP SLASSSANQL SAPSLRATCT 
2221:	TSQKHPEMEL LQANLLWKHE MGSHIPRVQS ENKVVVLDSF ELYPLEYEER EISVSVECHR 
2281:	QPRCKFPPVF PVWARPDNNP PFIQAWQMPG YEPPVVSGCA VAPPKPAPVP PPRRKRLVHL 
2341:	DESTVSHALA QLADKVFVES SNDPGPSSDS GLSITSPVPP DPTTPEDAGS EAESYSSMPP 
2401:	LEGEPGDPDL SSGSWSTVSD EDDVVCCSMS YSWTGALITP CAAEEEKLPI NPLSNSLVRH 
2461:	HNMVYSTTSR SASLRQKKVT FDRVQVFDQH YQDVLKEIKL RASTVQAKLL SIEEACDLTP 
2521:	SHSARSKYGY GAQDVRSRAS KAVDHIPSVW EGLLEDSDTP IPTTIMAKNE VFCVDPSKGG 
2581:	RKPARLIVYP DLGVRVCEKM ALYDVTQKLP QAVMGPAYGF QYSPNQRVEY LLKMWRSKKV 
2641:	PMGFSYDTRC FDSTVTERDI RTENDIYQSC QLDPVARRVV SSLTERLYVG GPMANSKGQS 
2701:	CGYRRCRASG VLPTSMGNTL TCYLKAQAAC RAANIKDCDM LVCGDDLVVI CESAGVQEDT 
2761:	ASLRAFTDAM TRYSAPPGDA PQPTYDLELI TSCSSNVSVA HEGNGKKYYY LTRDCTTPLA 
2821:	RAAWETARHT PVNSWLGNII MFAPTIWVRM VLMNHFFSIL QSQEQLEKAF DFDIYGVTYS 
2881:	VSPLDLPAII QRLHGMAAFS LHGYSPVELN RVGACLRKLG VLPSRAWRHR ARAVRAKLIA 
2941:	QGGKAAICGK YLFNWAVKTK LKLTPLVSAS KLDLSGWFVA GYDGGDIYHS VSQARPRFLL 
3001:	LGLLLLTVGV GIFLLPAR