高级检索

Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/CHN/SH-HG20230504-480/2023 ORF1ab polyprotein (ORF1ab), ORF1a polyprotein (ORF1ab), surface glycoprotein (S), ORF3a protein (ORF3a), envelope protein (E), membrane glycoprotein (M), ORF6 protein (ORF6), ORF7a protein (ORF7a), and ORF7b (ORF7b) genes, complete cds; ORF8 gene, complete sequence; and nucleocapsid phosphoprotein (N) and ORF10 protein (ORF10) genes, complete cds.

GenBase:
C_AA014168.1

LOCUS       C_AA014168             29749 bp    ss-RNA  linear   VRL 11-MAY-2023
DEFINITION  Severe acute respiratory syndrome coronavirus 2 isolate
            SARS-CoV-2/human/CHN/SH-HG20230504-480/2023 ORF1ab polyprotein
            (ORF1ab), ORF1a polyprotein (ORF1ab), surface glycoprotein (S),
            ORF3a protein (ORF3a), envelope protein (E), membrane glycoprotein
            (M), ORF6 protein (ORF6), ORF7a protein (ORF7a), and ORF7b (ORF7b)
            genes, complete cds; ORF8 gene, complete sequence; and nucleocapsid
            phosphoprotein (N) and ORF10 protein (ORF10) genes, complete cds.
ACCESSION   C_AA014168
VERSION     C_AA014168.1
KEYWORDS    .
SOURCE      Severe acute respiratory syndrome coronavirus 2
  ORGANISM  Severe acute respiratory syndrome coronavirus 2
            Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
            Betacoronavirus; Sarbecovirus; Severe acute respiratory
            syndrome-related coronavirus.
REFERENCE   1  (bases 1 to 29749)
  AUTHORS   Zhang,W.
  TITLE     Direct Submission
  JOURNAL   Submitted (11-MAY-2023) Microbe lab, Shanghai Municipal Center for
            Disease Control & Prevention, west zhongshan road 1380, Shanghai
            200336, China
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method       :: Consensus sequence method v. 940-000133-00
            Sequencing Technology :: MGISEQ-200
            ##Genome-Assembly-Data-END##
            .
FEATURES             Location/Qualifiers
     source          1..29749
                     /organism="Severe acute respiratory syndrome coronavirus 2"
                     /mol_type="genomic RNA"
                     /isolate="SARS-CoV-2/human/CHN/SH-HG20230504-480/2023"
                     /isolation_source="Pharyngeal swab"
                     /host="Homo sapiens; Host_age: 35; Host_sex: Male"
                     /country="China:Shanghai"
                     /collection_date="2023-05-01"
                     /note="Passage_details/history: Original;
                     Additional_host_information: Patient infected while
                     traveling in Thailand"
     gene            219..21499
                     /gene="ORF1ab"
     CDS             join(219..13412,13412..21499)
                     /gene="ORF1ab"
                     /ribosomal_slippage
                     /codon_start=1
                     /product="ORF1ab polyprotein"
                     /protein_id="C_AAB28906.1"
                     /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQH
                     LRDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETL
                     GVLVPHVGEIPVAYRKVLLRKNGNKGAGGHRYGADLKSFDLGDELGTDPYEDFQENWNT
                     KHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFI
                     DTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSII
                     KTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKAT
                     CEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRK
                     GGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEK
                     VNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAK
                     KGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQY
                     SLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKF
                     KEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSII
                     IGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLT
                     EEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNT
                     FTLKGGAPTKVTFGDDTVIEVQGYKSVNIIFELDERIDKVLNEKCSAYTVELGTEVNEF
                     ACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDED
                     EEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVG
                     QQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTGNVYIKNADIVEE
                     AKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSG
                     HNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCV
                     DTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQR
                     KQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAP
                     YIVGDVVQEGVLTAVVIPTKKASGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAK
                     TVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTI
                     QRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEE
                     AARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQS
                     TQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINL
                     HTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTT
                     DPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQD
                     AYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCG
                     QQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYE
                     LKHSTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYT
                     TTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCD
                     NIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVW
                     HVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEV
                     VENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKP
                     NELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYM
                     PYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLIN
                     IIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIP
                     CSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAI
                     MQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCN
                     SSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTF
                     ISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVN
                     LDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGD
                     SAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGF
                     VDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHIN
                     AQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIA
                     LKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTR
                     DIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTT
                     NGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDT
                     NVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAG
                     VCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLFTNMFTPLIQPIGALDISASIVAGGIVA
                     IVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFIVLCLTPVYSFLPGVYSVIYLYLT
                     FYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSF
                     STFEEAALCTFLLNKEMYLKLRSDVLLPFTQYNRYLALYNKYKYFSGAMDTTSYREAAC
                     CHLAKALNDFSNSGSDVLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTL
                     NGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCV
                     LKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSC
                     GSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNV
                     LAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLD
                     MCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLL
                     TILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSL
                     ATVAYFNMVYMPASWVMRIMTWLDMVDTSLKLKDCVMYASAVVLLILMTARTVYDDGAR
                     RVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEY
                     CPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMN
                     SQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVES
                     SSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ
                     AIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKL
                     EKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLN
                     IIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNS
                     PNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTIKGG
                     RFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNL
                     NRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVK
                     MLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTT
                     CANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNRVCGVSAARLTPC
                     GTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQ
                     HEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCD
                     TLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNA
                     GIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVD
                     TDLTKPYIKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLF
                     STVFPLTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAA
                     DPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSV
                     ELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVI
                     VNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNR
                     ARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENP
                     HLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGS
                     SLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYEC
                     LYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQ
                     NNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIV
                     KTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVML
                     TNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHV
                     IPTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLY
                     KNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIA
                     TVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVY
                     RGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQ
                     KVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDK
                     CSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVN
                     ARLCAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAE
                     IVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAW
                     RKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAI
                     TRAKVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKVITGLHPTQA
                     PTHLSVDTKFKTEGLCVDVPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRH
                     VRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPP
                     PGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVK
                     IGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDL
                     YCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHMVVK
                     AALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKF
                     TDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAF
                     VNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYL
                     DAYNMMISAGFSLWVYKQFDTYNLWNTFTRLQSLENVAFNVVNKGHFDGQQGEVPVSII
                     NNTVYTKVDGVDVELFENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANTVIW
                     DYKRDAPAHISTIGVCSMTDIAKKPIETICAPLTVFFDGRVDGQVDLFRNARNGVLITE
                     GSVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQEFK
                     PRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPF
                     ELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTI
                     DYTEISFMLWCKDGHVETFYPKLQSSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSAT
                     LPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTL
                     LVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYI
                     CGFIQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYL
                     GKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQINDMI
                     LSLLSKGRLIIRENNRVVISSDVLVNN"
     mat_peptide     219..758
                     /gene="ORF1ab"
                     /product="leader protein"
     mat_peptide     759..2672
                     /gene="ORF1ab"
                     /product="nsp2"
     mat_peptide     2673..8507
                     /gene="ORF1ab"
                     /product="nsp3"
     mat_peptide     8508..10007
                     /gene="ORF1ab"
                     /product="nsp4"
     mat_peptide     10008..10925
                     /gene="ORF1ab"
                     /product="3C-like proteinase"
     mat_peptide     10926..11786
                     /gene="ORF1ab"
                     /product="nsp6"
     mat_peptide     11787..12035
                     /gene="ORF1ab"
                     /product="nsp7"
     mat_peptide     12036..12629
                     /gene="ORF1ab"
                     /product="nsp8"
     mat_peptide     12630..12968
                     /gene="ORF1ab"
                     /product="nsp9"
     mat_peptide     12969..13385
                     /gene="ORF1ab"
                     /product="nsp10"
     mat_peptide     join(13386..13412,13412..16180)
                     /gene="ORF1ab"
                     /product="RNA-dependent RNA polymerase"
     mat_peptide     16181..17983
                     /gene="ORF1ab"
                     /product="helicase"
     mat_peptide     17984..19564
                     /gene="ORF1ab"
                     /product="3'-to-5' exonuclease"
     mat_peptide     19565..20602
                     /gene="ORF1ab"
                     /product="endoRNAse"
     mat_peptide     20603..21496
                     /gene="ORF1ab"
                     /product="2'-O-ribose methyltransferase"
     CDS             219..13427
                     /gene="ORF1ab"
                     /codon_start=1
                     /product="ORF1a polyprotein"
                     /protein_id="C_AAB28909.1"
                     /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQH
                     LRDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETL
                     GVLVPHVGEIPVAYRKVLLRKNGNKGAGGHRYGADLKSFDLGDELGTDPYEDFQENWNT
                     KHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFI
                     DTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSII
                     KTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKAT
                     CEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRK
                     GGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEK
                     VNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAK
                     KGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQY
                     SLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKF
                     KEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSII
                     IGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLT
                     EEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNT
                     FTLKGGAPTKVTFGDDTVIEVQGYKSVNIIFELDERIDKVLNEKCSAYTVELGTEVNEF
                     ACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDED
                     EEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVG
                     QQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTGNVYIKNADIVEE
                     AKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSG
                     HNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCV
                     DTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQR
                     KQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAP
                     YIVGDVVQEGVLTAVVIPTKKASGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAK
                     TVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTI
                     QRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEE
                     AARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQS
                     TQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINL
                     HTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTT
                     DPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQD
                     AYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCG
                     QQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYE
                     LKHSTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYT
                     TTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCD
                     NIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVW
                     HVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEV
                     VENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKP
                     NELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYM
                     PYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLIN
                     IIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIP
                     CSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAI
                     MQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCN
                     SSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTF
                     ISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVN
                     LDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGD
                     SAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGF
                     VDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHIN
                     AQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIA
                     LKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTR
                     DIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTT
                     NGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDT
                     NVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAG
                     VCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLFTNMFTPLIQPIGALDISASIVAGGIVA
                     IVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFIVLCLTPVYSFLPGVYSVIYLYLT
                     FYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSF
                     STFEEAALCTFLLNKEMYLKLRSDVLLPFTQYNRYLALYNKYKYFSGAMDTTSYREAAC
                     CHLAKALNDFSNSGSDVLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTL
                     NGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCV
                     LKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSC
                     GSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNV
                     LAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLD
                     MCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLL
                     TILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSL
                     ATVAYFNMVYMPASWVMRIMTWLDMVDTSLKLKDCVMYASAVVLLILMTARTVYDDGAR
                     RVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEY
                     CPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMN
                     SQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVES
                     SSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ
                     AIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKL
                     EKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLN
                     IIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNS
                     PNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTIKGG
                     RFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNL
                     NRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVK
                     MLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTT
                     CANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNGFAV"
     mat_peptide     219..758
                     /gene="ORF1ab"
                     /product="leader protein"
     mat_peptide     759..2672
                     /gene="ORF1ab"
                     /product="nsp2"
     mat_peptide     2673..8507
                     /gene="ORF1ab"
                     /product="nsp3"
     mat_peptide     8508..10007
                     /gene="ORF1ab"
                     /product="nsp4"
     mat_peptide     10008..10925
                     /gene="ORF1ab"
                     /product="3C-like proteinase"
     mat_peptide     10926..11786
                     /gene="ORF1ab"
                     /product="nsp6"
     mat_peptide     11787..12035
                     /gene="ORF1ab"
                     /product="nsp7"
     mat_peptide     12036..12629
                     /gene="ORF1ab"
                     /product="nsp8"
     mat_peptide     12630..12968
                     /gene="ORF1ab"
                     /product="nsp9"
     mat_peptide     12969..13385
                     /gene="ORF1ab"
                     /product="nsp10"
     mat_peptide     13386..13424
                     /gene="ORF1ab"
                     /product="nsp11"
     stem_loop       13420..13447
                     /gene="ORF1ab"
                     /note="Coronavirus frameshifting stimulation element
                     stem-loop 1"
     stem_loop       13432..13486
                     /gene="ORF1ab"
                     /note="Coronavirus frameshifting stimulation element
                     stem-loop 2"
     gene            21507..25316
                     /gene="S"
     CDS             21507..25316
                     /gene="S"
                     /codon_start=1
                     /product="surface glycoprotein"
                     /protein_id="C_AAB28910.1"
                     /translation="MFVFLVLLPLVSSQCVNLITRTQSYTNSFTRGVYYPDKVFRSSVL
                     HSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPALPFNDGVYFASTEKSNIIRGWIFG
                     TTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLDVYQKNNKSWMESEFRVYSSANNCT
                     FEYVSQPFLMDLEGKEGNFKNLREFVFKNIDGYFKIYSKHTPINLERDLPQGFSALEPL
                     VDLPIGINITRFQTLLALHRSYLTPVDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTI
                     TDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFHEVFNA
                     TTFASVYAWNRKRISNCVADYSVIYNFAPFFAFKCYGVSPTKLNDLCFTNVYADSFVIR
                     GNEVSQIAPGQTGNIADYNYKLPDDFTGCVIAWNSNKLDSKPSGNYNYLYRLFRKSKLK
                     PFERDISTEIYQAGNKPCNGVAGPNCYSPLQSYGFRPTYGVGHQPYRVVVLSFELLHAP
                     ATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQ
                     TLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYSTG
                     SNVFQTRAGCLIGAEYVNNSYECDIPIGAGICASYQTQTKSHRRARSVASQSIIAYTMS
                     LGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYG
                     SFCTQLKRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKYFGGFNFSQILPDPSKPSKRS
                     FIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTS
                     ALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKI
                     QDSLSSTASALGKLQDVVNHNAQALNTLVKQLSSKFGAISSVLNDILSRLDKVEAEVQI
                     DRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSF
                     PQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYE
                     PQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDIS
                     GINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVM
                     VTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT"
     gene            25325..26152
                     /gene="ORF3a"
     CDS             25325..26152
                     /gene="ORF3a"
                     /codon_start=1
                     /product="ORF3a protein"
                     /protein_id="C_AAB28911.1"
                     /translation="MDLFMRIFTIGTVTLKQGEIKDATPSDFVRATATIPIQASLPFGW
                     LIVGVALLAVFQSASKIITLKKRWQLALSKGVHFVCNLLLLFVTVYSHLLLVAAGLEAP
                     FLYLYALVYFLQSINFVRIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPYNSV
                     TSSIVITSGDGTTSPISEHDYQIGGYTEKWESGVKDCVVLHSYFTSDYYQLYSTQLSTD
                     IGVEHVTFFIYNKIVDEPEEHVQIHTIDGSSGVVNPVMEPIYDEPTTTTSVPL"
     gene            26177..26404
                     /gene="E"
     CDS             26177..26404
                     /gene="E"
                     /codon_start=1
                     /product="envelope protein"
                     /protein_id="C_AAB28912.1"
                     /translation="MYSFVSEEIGALIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCN
                     IVNVSLVKPSFYVYSRVKNLNSSRVPDLLV"
     gene            26455..27123
                     /gene="M"
     CDS             26455..27123
                     /gene="M"
                     /codon_start=1
                     /product="membrane glycoprotein"
                     /protein_id="C_AAB28913.1"
                     /translation="MADSNGTITVEELKKLLEEWNLVIGFLFLTWICLLQFAYANRNRF
                     LYIIKLIFLWLLWPVTLTCFVLAAVYRINWITGGIAIAMACLVGLMWLSYFIASFRLFA
                     RTRSMWSFNPETNILLNVPLHGTILTRPLLESELVIGAVILRGHLRIAGHHLGRCDIKD
                     LPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDNIALLVQ
                     "
     gene            27134..27319
                     /gene="ORF6"
     CDS             27134..27319
                     /gene="ORF6"
                     /codon_start=1
                     /product="ORF6 protein"
                     /protein_id="C_AAB28914.1"
                     /translation="MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLT
                     ENKYSQLDEEQPMEIL"
     gene            27326..27691
                     /gene="ORF7a"
     CDS             27326..27691
                     /gene="ORF7a"
                     /codon_start=1
                     /product="ORF7a protein"
                     /protein_id="C_AAB28915.1"
                     /translation="MKIILFLALITLATCELYHYQECVRGTTVLLKEPCSSGTYEGNSP
                     FHPLADNKFALTCFSTQFAFACPDGVKHVYQLRARSVSPKLFIRQEEVQELYSPIFLIV
                     AAIVFITLCFTLKRKTE"
     gene            27688..27819
                     /gene="ORF7b"
     CDS             27688..27819
                     /gene="ORF7b"
                     /codon_start=1
                     /product="ORF7b"
                     /protein_id="C_AAB28916.1"
                     /translation="MIELSLIDFYLCFLAFLLFLVLIMLIIFWFSLELQDHNETCHA"
     gene            27826..28191
                     /gene="ORF8"
     misc_feature    27826..28191
                     /gene="ORF8"
                     /note="similar to ORF8 protein"
     gene            28206..29456
                     /gene="N"
     CDS             28206..29456
                     /gene="N"
                     /codon_start=1
                     /product="nucleocapsid phosphoprotein"
                     /protein_id="C_AAB28907.1"
                     /translation="MSDNGPQNQRNALRITFGGPSDSTGSNQNGGARSKQRRPQGLPNN
                     TASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMKDLSPR
                     WYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQGTT
                     LPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSKRTSPARMAGNGGDAALALLLLD
                     RLNQLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQTQG
                     NFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLDDKD
                     PNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTVTLLPAADLDD
                     FSKQLQQSMSRADSTQA"
     gene            29481..29597
                     /gene="ORF10"
     CDS             29481..29597
                     /gene="ORF10"
                     /codon_start=1
                     /product="ORF10 protein"
                     /protein_id="C_AAB28908.1"
                     /translation="MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFNLT"
     stem_loop       29532..29567
                     /gene="ORF10"
                     /note="Coronavirus 3' UTR pseudoknot stem-loop 1"
     stem_loop       29552..29580
                     /gene="ORF10"
                     /note="Coronavirus 3' UTR pseudoknot stem-loop 2"
     stem_loop       29651..29665
                     /note="Coronavirus 3' stem-loop II-like motif (s2m)"
ORIGIN
        1 ctcttgtaga tctgttctct aaacgaactt taaaatctgt gtggctgtca ctcggctgca
       61 tgcttagtgc actcacgcag tataattaat aactaattac tgtcgttgac aggacacgag
      121 taactcgtct atcttctgca ggctgcttac ggtttcgtcc gtgttgcagc cgatcatcag
      181 cacatctagg ttttgtccgg gtgtgaccga aaggtaagat ggagagcctt gtccctggtt
      241 tcaacgagaa aacacacgtc caactcagtt tgcctgtttt acaggttcgc gacgtgctcg
      301 tacgtggctt tggagactcc gtggaggagg tcttatcaga ggcacgtcaa catcttagag
      361 atggcacttg tggcttagta gaagttgaaa aaggcgtttt gcctcaactt gaacagccct
      421 atgtgttcat caaacgttcg gatgctcgaa ctgcacctca tggtcatgtt atggttgagc
      481 tggtagcaga actcgaaggc attcagtacg gtcgtagtgg tgagacactt ggtgtccttg
      541 tccctcatgt gggcgaaata ccagtggctt accgcaaggt tcttcttcgt aagaacggta
      601 ataaaggagc tggtggccat aggtacggcg ccgatctaaa gtcatttgac ttaggcgacg
      661 agcttggcac tgatccttat gaagattttc aagaaaactg gaacactaaa catagcagtg
      721 gtgttacccg tgaactcatg cgtgagctta acggaggggc atacactcgc tatgtcgata
      781 acaacttctg tggccctgat ggctaccctc ttgagtgcat taaagacctt ctagcacgtg
      841 ctggtaaagc ttcatgcact ttgtccgaac aactggactt tattgacact aagaggggtg
      901 tatactgctg ccgtgaacat gagcatgaaa ttgcttggta cacggaacgt tctgaaaaga
      961 gctatgaatt gcagacacct tttgaaatta aattggcaaa gaaatttgac accttcaatg
     1021 gggaatgtcc aaattttgta tttcccttaa attccataat caagactatt caaccaaggg
     1081 ttgaaaagaa aaagcttgat ggctttatgg gtagaattcg atctgtctat ccagttgcgt
     1141 caccaaatga atgcaaccaa atgtgccttt caactctcat gaagtgtgat cattgtggtg
     1201 aaacttcatg gcagacgggc gattttgtta aagccacttg cgaattttgt ggcactgaga
     1261 atttgactaa agaaggtgcc actacttgtg gttacttacc ccaaaatgct gttgttaaaa
     1321 tttattgtcc agcatgtcac aattcagaag taggacctga gcatagtctt gccgaatacc
     1381 ataatgaatc tggcttgaaa accattcttc gtaagggtgg tcgcactatt gcctttggag
     1441 gctgtgtgtt ctcttatgtt ggttgccata acaagtgtgc ctattgggtt ccacgtgcta
     1501 gcgctaacat aggttgtaac catacaggtg ttgttggaga aggttccgaa ggtcttaatg
     1561 acaaccttct tgaaatactc caaaaagaga aagtcaacat caatattgtt ggtgacttta
     1621 aacttaatga agagatcgcc attattttgg catctttttc tgcttccaca agtgcttttg
     1681 tggaaactgt gaaaggtttg gattataaag cattcaaaca aattgttgaa tcctgtggta
     1741 attttaaagt tacaaaagga aaagctaaaa aaggtgcctg gaatattggt gaacagaaat
     1801 caatactgag tcctctttat gcatttgcat cagaggctgc tcgtgttgta cgatcaattt
     1861 tctcccgcac tcttgaaact gctcaaaatt ctgtgcgtgt tttacagaag gccgctataa
     1921 caatactaga tggaatttca cagtattcac tgagactcat tgatgctatg atgttcacat
     1981 ctgatttggc tactaacaat ctagttgtaa tggcctacat tacaggtggt gttgttcagt
     2041 tgacttcgca gtggctaact aacatctttg gcactgttta tgaaaaactc aaacccgtcc
     2101 ttgattggct tgaagagaag tttaaggaag gtgtagagtt tcttagagac ggttgggaaa
     2161 ttgttaaatt tatctcaacc tgtgcttgtg aaattgtcgg tggacaaatt gtcacctgtg
     2221 caaaggaaat taaggagagt gttcagacat tctttaagct tgtaaataaa tttttggctt
     2281 tgtgtgctga ctctatcatt attggtggag ctaaacttaa agccttgaat ttaggtgaaa
     2341 catttgtcac gcactcaaag ggattgtaca gaaagtgtgt taaatccaga gaagaaactg
     2401 gcctactcat gcctctaaaa gccccaaaag aaattatctt cttagaggga gaaacacttc
     2461 ccacagaagt gttaacagag gaagttgtct tgaaaactgg tgatttacaa ccattagaac
     2521 aacctactag tgaagctgtt gaagctccat tggttggtac accagtttgt attaacgggc
     2581 ttatgttgct cgaaatcaaa gacacagaaa agtactgtgc ccttgcacct aatatgatgg
     2641 taacaaacaa taccttcaca ctcaaaggcg gtgcaccaac aaaggttact tttggtgatg
     2701 acactgtgat agaagtgcaa ggttacaaga gtgtgaatat catttttgaa cttgatgaaa
数据集编号
序列分析