TCDB is operated by the Saier Lab Bioinformatics Group
« See all members of the family


1.G.20.1.3
M polyprotein, Gn-Gc-NSm, of 1403 aas and 5 - 7 TMSs.

Accession Number:J4FBX9
Protein Name:M polyprotein
Length:1403
Molecular Weight:158387.00
Species:Shamonda virus [159150]
Number of TMSs:7
Substrate small molecules

Cross database links:

External Searches:

  • Search: DB with
  • BLAST ExPASy (Swiss Institute of Bioinformatics (SIB) BLAST)
  • CDD Search (Conserved Domain Database)
  • Search COGs (Clusters of Orthologous Groups of proteins)
  • 2° Structure (Network Protein Sequence Analysis)

Analyze:

Predict TMSs (Predict number of transmembrane segments)
Window Size: Angle:  
Window Size: Angle:  
FASTA formatted sequence
1:	MKTILRIASI LAQCAIMICL PLKNSIGGRC FTGGEPFKTI NATSAPSEVC LRDDISMVKS 
61:	IGIHSRGDDS DMITSSVTFY RLYYVKDWHD CNPISDHMGT FLVLNIEDSG SIKAENYACR 
121:	TRCDISLNRD RGTIELTSTN LNHYSITGTT IASGWFKLKL EVQLLSTCES ISVTCGQKTL 
181:	EFKACFRQHR KCINYFHGSI LPEIMIEGIC ANLEVIILVF FICLNTILAI IITNTYVIYA 
241:	LIPLIYPFYK LYGIIYNKCL KKCKNCKLAI HPFTICPTKC ICGMVYNSTE ALYLHRQCNN 
301:	CTGYKALTHA RIACKKKIPN AIMAIFTTIL IFSFLTPVSA ECYNLSELPQ DYINMVNYIG 
361:	MRSLLGYVVA SLALLIAIVV LLQNKLAEQA LKLYYVNCAF CGMIHHKRNL ILEQGFTNQC 
421:	LTCICHDKNI HKATKKCTIR YKWHITNNIR WLVFVSILII MPASIYPMQC LRSEEITNLE 
481:	EASACISVYQ NVTQKKQYHE LIKSMSEQLS SDEVSILLPQ VVPSYINLIH EIENENDLHT 
541:	AIVKEIILAN LYPEIVKKYY SAAGPDTVKW RTILLNAGLH ICSEHVVKMI CRCALLQQEC 
601:	QSVTSDDGNQ IETYYKSHKE EFYEDMASIF KVIYTAFPGL TKFLLVRSMS SKALQDAVPV 
661:	LGKLKYYTRN NNHLNGIITF AEHIISKNVT SESRTINFEV RKLTGQQFTD KNVGSSGITT 
721:	CQTPKLVTCT GKRLRSLQKE YIACSNNGVK MYLKEDKIYC RVGADLCVGD KYCLISFTPI 
781:	TDKENVDKLI CYATEFRDQS NGMLKSSQSI RVKKLGSCAL KGQLVNIAMS SENLLYKYDT 
841:	IYHKKTPLVD EYCLSEKCTS DHYPYSSENL KNCVWTITNH KFQSQLHIDH QDIESFISGI 
901:	KLSLHNDLIT HNYKPTQNMP HIIPNYKSIS IAGSDNGNTI TDAYVLFTIP LTTGLSQGFS 
961:	VNTKTNKGLF DLVVYIKRAT IKAEYTFEYE TGPTIGINAV HSEKCTGFCP REIPHATNWL 
1021:	TFTKEHTSSW GCEEWGCFAI NTGCVYGSCQ DIIKPEGKVY KKIGSEAIDA EICITDSSET 
1081:	FCTEITSYNP ILGEKVQIEI MSQDSSLLPS NIFQKNNNIY KGDINPKGTF AKKCGSVQKV 
1141:	QDQIYGSGEP RFDYICHAAS RKDVVVRKCY DNAYISCSTL ESIQNMDLIK DESKWYLRQD 
1201:	TGLYGSVKVK LLLGDLNYKQ DDTTTTRITA KAVCGGCTDC FDDVSCRVDV TSNNIASCSV 
1261:	ESTCSPYINR LSLVEGSQQL HLKFKCKMEA IAFSICGIKA EIRSEIVKSH KILDLASLSQ 
1321:	TSYIREYDKK CGTWLCRVYN EGIEGIIGPI WKEFNIWLKY GTVVVVLIFS VILIVKVINP 
1381:	LIKLIINTLK HNEQMYLLES KQK