高级检索

Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/CHN/XZCDC_0026/2024 ORF1ab polyprotein (ORF1ab), ORF1a polyprotein (ORF1ab), surface glycoprotein (S), ORF3a protein (ORF3a), envelope protein (E), membrane glycoprotein (M), ORF6 protein (ORF6), ORF7a protein (ORF7a), ORF7b (ORF7b), ORF8 protein (ORF8), nucleocapsid phosphoprotein (N), and ORF10 protein (ORF10) genes, complete cds.

GenBase:
C_AA070748.1

LOCUS       C_AA070748             29822 bp    ss-RNA  linear   VRL 07-MAY-2024
DEFINITION  Severe acute respiratory syndrome coronavirus 2 isolate
            SARS-CoV-2/human/CHN/XZCDC_0026/2024 ORF1ab polyprotein (ORF1ab),
            ORF1a polyprotein (ORF1ab), surface glycoprotein (S), ORF3a protein
            (ORF3a), envelope protein (E), membrane glycoprotein (M), ORF6
            protein (ORF6), ORF7a protein (ORF7a), ORF7b (ORF7b), ORF8 protein
            (ORF8), nucleocapsid phosphoprotein (N), and ORF10 protein (ORF10)
            genes, complete cds.
ACCESSION   C_AA070748
VERSION     C_AA070748.1
KEYWORDS    .
SOURCE      Severe acute respiratory syndrome coronavirus 2
  ORGANISM  Severe acute respiratory syndrome coronavirus 2
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
            Betacoronavirus; Sarbecovirus; Severe acute respiratory
            syndrome-related coronavirus.
REFERENCE   1  (bases 1 to 29822)
  AUTHORS   Hong,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (07-MAY-2024) inspection and verification office, Center
            for Disease Control and Prevention of Tibet Autonomous Region,
            Linkuo northroad 21, Lhasa, Tibet 850000, China
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method       :: weiweilai v. 20240507
            Sequencing Technology :: Illumina
            ##Genome-Assembly-Data-END##
            .
FEATURES             Location/Qualifiers
     source          1..29822
                     /organism="Severe acute respiratory syndrome coronavirus 2"
                     /mol_type="genomic RNA"
                     /isolate="SARS-CoV-2/human/CHN/XZCDC_0026/2024"
                     /host="Homo sapiens; Host_age: 51 years; Host_age_unit:
                     year"
                     /country="China:Xizang"
                     /collection_date="2024-04-19"
     gene            254..21534
                     /gene="ORF1ab"
     CDS             join(254..13447,13447..21534)
                     /gene="ORF1ab"
                     /ribosomal_slippage
                     /codon_start=1
                     /product="ORF1ab polyprotein"
                     /protein_id="C_AAH32360.1"
                     /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQH
                     LKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETL
                     GVLVPHVGEIPVAYRKVLLRKNGNKGAGGHRYGADLKSFDLGDELGTDPYEDFQENWNT
                     KHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKDSCTLSEQLDFI
                     DTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSII
                     KTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKAT
                     CEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRK
                     GGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEK
                     VNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAK
                     KGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQY
                     SLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKF
                     KEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSII
                     IGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLT
                     EEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNT
                     FTLKGGAPTKVTFGDDTVIEVQGYKSVNIIFELDERIDKVLNEKCSAYTVELGTEVNEF
                     ACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDED
                     EEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVG
                     QQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEE
                     AKKVKPTLVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSG
                     HNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCV
                     DTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQR
                     KQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAP
                     YIVGDVVQEGVLTAVVIPTKKASGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAK
                     TVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTI
                     QRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEE
                     AARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQS
                     TQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINL
                     HTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTT
                     DPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQD
                     AYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCG
                     QQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYE
                     LKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYT
                     TTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCD
                     NIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYRHYTPSFKKGAKLLHKPIVW
                     HVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEV
                     VENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKP
                     NELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYM
                     PYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLIN
                     IIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIP
                     CSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAI
                     MQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCN
                     SSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTF
                     ISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVN
                     LDSLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGD
                     SAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGF
                     VDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHIN
                     AQVAKSHNITLIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIA
                     LKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTR
                     DIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTT
                     NGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDT
                     NVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAG
                     VCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLFTNMFTPLIQPIGALDISASIVAGGIVA
                     IVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFIVLCLTPVYSFLPGVYSVIYLYLT
                     FYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSF
                     STFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAAC
                     CHLAKALNDFSNSGSDVLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTL
                     NGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCV
                     LKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSC
                     GSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNV
                     LAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLD
                     MCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLL
                     TILTSLLFLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSL
                     ATVAYFNMVYMPASWVMRIMTWLDMVDTSLKLKDCVMYASAVVLLILMTARTVYDDGAR
                     RVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEY
                     CPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFKYMN
                     SQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVES
                     SSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ
                     AIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKL
                     EKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLN
                     IIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNS
                     PNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTIKGG
                     RFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNL
                     NRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVK
                     MLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTT
                     CANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNRVCGVSAARLTPC
                     GTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQ
                     HEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCD
                     TLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNA
                     GIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVD
                     TDLTKPYIKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLF
                     STVFPLTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAA
                     DPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSV
                     ELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVI
                     VNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNR
                     ARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENP
                     HLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGG
                     SLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYEC
                     LYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQ
                     NNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIV
                     KTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVML
                     TNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHV
                     ISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLY
                     KNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIA
                     TVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVY
                     RGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQ
                     KVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDK
                     CSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVN
                     ARLCAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAE
                     IVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTHNPAW
                     RKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAI
                     TRAKVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKVITGLHPTQA
                     PTHLSVDTKFKTEGLCVDVPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRH
                     VRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPP
                     PGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVK
                     IGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDL
                     YCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHMVVK
                     AALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKF
                     TDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAF
                     VNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYL
                     DAYNMMISAGFSLWVYKQFDTYNLWNTFTRLQSLENVAFNVVNKGHFDGQQGEVPVSII
                     NNTVYTKVDGVDVELFENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANTVIW
                     DYKRDAPAHISTIGVCSMTDIAKKPIETICAPLTVFFDGRVDGQVDLFRNARNGVLITE
                     GSVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQEFK
                     PRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPF
                     ELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTI
                     DYTEISFMLWCKDGHVETFYPKLQSSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSAT
                     LPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTL
                     LVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYI
                     CGFIQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYL
                     GKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQINDMI
                     LSLLSKGRLIIRENNRVVISSDVLVNN"
     mat_peptide     254..793
                     /gene="ORF1ab"
                     /product="leader protein"
     mat_peptide     794..2707
                     /gene="ORF1ab"
                     /product="nsp2"
     mat_peptide     2708..8542
                     /gene="ORF1ab"
                     /product="nsp3"
     mat_peptide     8543..10042
                     /gene="ORF1ab"
                     /product="nsp4"
     mat_peptide     10043..10960
                     /gene="ORF1ab"
                     /product="3C-like proteinase"
     mat_peptide     10961..11821
                     /gene="ORF1ab"
                     /product="nsp6"
     mat_peptide     11822..12070
                     /gene="ORF1ab"
                     /product="nsp7"
     mat_peptide     12071..12664
                     /gene="ORF1ab"
                     /product="nsp8"
     mat_peptide     12665..13003
                     /gene="ORF1ab"
                     /product="nsp9"
     mat_peptide     13004..13420
                     /gene="ORF1ab"
                     /product="nsp10"
     mat_peptide     join(13421..13447,13447..16215)
                     /gene="ORF1ab"
                     /product="RNA-dependent RNA polymerase"
     mat_peptide     16216..18018
                     /gene="ORF1ab"
                     /product="helicase"
     mat_peptide     18019..19599
                     /gene="ORF1ab"
                     /product="3'-to-5' exonuclease"
     mat_peptide     19600..20637
                     /gene="ORF1ab"
                     /product="endoRNAse"
     mat_peptide     20638..21531
                     /gene="ORF1ab"
                     /product="2'-O-ribose methyltransferase"
     CDS             254..13462
                     /gene="ORF1ab"
                     /codon_start=1
                     /product="ORF1a polyprotein"
                     /protein_id="C_AAH32364.1"
                     /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQH
                     LKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETL
                     GVLVPHVGEIPVAYRKVLLRKNGNKGAGGHRYGADLKSFDLGDELGTDPYEDFQENWNT
                     KHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKDSCTLSEQLDFI
                     DTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSII
                     KTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKAT
                     CEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRK
                     GGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEK
                     VNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAK
                     KGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQY
                     SLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKF
                     KEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSII
                     IGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLT
                     EEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNT
                     FTLKGGAPTKVTFGDDTVIEVQGYKSVNIIFELDERIDKVLNEKCSAYTVELGTEVNEF
                     ACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDED
                     EEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVG
                     QQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEE
                     AKKVKPTLVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSG
                     HNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCV
                     DTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQR
                     KQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAP
                     YIVGDVVQEGVLTAVVIPTKKASGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAK
                     TVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTI
                     QRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEE
                     AARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQS
                     TQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINL
                     HTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTT
                     DPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQD
                     AYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCG
                     QQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYE
                     LKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYT
                     TTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCD
                     NIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYRHYTPSFKKGAKLLHKPIVW
                     HVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEV
                     VENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKP
                     NELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYM
                     PYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLIN
                     IIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIP
                     CSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAI
                     MQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCN
                     SSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTF
                     ISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVN
                     LDSLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGD
                     SAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGF
                     VDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHIN
                     AQVAKSHNITLIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIA
                     LKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTR
                     DIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTT
                     NGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDT
                     NVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAG
                     VCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLFTNMFTPLIQPIGALDISASIVAGGIVA
                     IVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFIVLCLTPVYSFLPGVYSVIYLYLT
                     FYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSF
                     STFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAAC
                     CHLAKALNDFSNSGSDVLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTL
                     NGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCV
                     LKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSC
                     GSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNV
                     LAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLD
                     MCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLL
                     TILTSLLFLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSL
                     ATVAYFNMVYMPASWVMRIMTWLDMVDTSLKLKDCVMYASAVVLLILMTARTVYDDGAR
                     RVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEY
                     CPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFKYMN
                     SQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVES
                     SSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ
                     AIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKL
                     EKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLN
                     IIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNS
                     PNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTIKGG
                     RFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNL
                     NRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVK
                     MLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTT
                     CANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNGFAV"
     mat_peptide     254..793
                     /gene="ORF1ab"
                     /product="leader protein"
     mat_peptide     794..2707
                     /gene="ORF1ab"
                     /product="nsp2"
     mat_peptide     2708..8542
                     /gene="ORF1ab"
                     /product="nsp3"
     mat_peptide     8543..10042
                     /gene="ORF1ab"
                     /product="nsp4"
     mat_peptide     10043..10960
                     /gene="ORF1ab"
                     /product="3C-like proteinase"
     mat_peptide     10961..11821
                     /gene="ORF1ab"
                     /product="nsp6"
     mat_peptide     11822..12070
                     /gene="ORF1ab"
                     /product="nsp7"
     mat_peptide     12071..12664
                     /gene="ORF1ab"
                     /product="nsp8"
     mat_peptide     12665..13003
                     /gene="ORF1ab"
                     /product="nsp9"
     mat_peptide     13004..13420
                     /gene="ORF1ab"
                     /product="nsp10"
     mat_peptide     13421..13459
                     /gene="ORF1ab"
                     /product="nsp11"
     stem_loop       13455..13482
                     /gene="ORF1ab"
                     /note="Coronavirus frameshifting stimulation element
                     stem-loop 1"
     stem_loop       13467..13521
                     /gene="ORF1ab"
                     /note="Coronavirus frameshifting stimulation element
                     stem-loop 2"
     gene            21542..25348
                     /gene="S"
     CDS             21542..25348
                     /gene="S"
                     /codon_start=1
                     /product="surface glycoprotein"
                     /protein_id="C_AAH32365.1"
                     /translation="MFVFLVLLPLVSSQCVXXXXXXXXXXXYTNSFTRGVYYPDKVFRS
                     SVLHSTQDLFLPFFSNVTWFHAISGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIF
                     GTTLDSKTQSLLIVNNATNVFIKVCEFQFCNDPFLDVYHKNNKSWMESESGVYSSANNC
                     TFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPIIGRDFPQGFSALEPL
                     VDLPIGINITRFQTLLALNRSYLTPGDSSSGWTAGAADYYVGYLQPRTFLLKYNENGTI
                     TDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNVTNLCPFHEVFNA
                     TRFASVYAWNRTRISNCVADYSVLYNFAPFFAFKCYGVSPTKLNDLCFTNVYADSFVIK
                     GNEVSQIAPGQTGNIADYNYKLPDDFTGCVIAWNSNKLDSKHSGNYDYWYRSFRKSKLK
                     PFERDISTEIYQAGNKPCKGKGPNCYFPLQSYGFRPTYGVGHQPYRVVVLSFELLHAPA
                     TVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTKSNKKFLPFQQFGRDIVDTTDAVRDPQT
                     LEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVSVAIHADQLTPTWRVYSTGS
                     NVFQTRAGCLIGAEYVNNSYECDIPIGAGICASYQTQTKSRRRARSVASQSIIAYTMSL
                     GAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGS
                     FCTQLKRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKYFGGFNFSQILPDPSKPSKRSF
                     IEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSA
                     LLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQ
                     DSLFSTASALGKLQDVVNHNAQALNTLVKQLSSKFGAISSVLNDILSRLDKVEAEVQID
                     RLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFP
                     QSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEP
                     QIITTDNTFVSGNCDVVIGIVNNTVYDPLQLELDSFKEELDKYFKNHTSPDVDLGDISG
                     INASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMV
                     TIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT"
     gene            25357..26184
                     /gene="ORF3a"
     CDS             25357..26184
                     /gene="ORF3a"
                     /codon_start=1
                     /product="ORF3a protein"
                     /protein_id="C_AAH32366.1"
                     /translation="MDLFMRIFTIGTVTLKQGEIKDATPSDFVRATATIPIQASLPFGW
                     LIVGVALLAVFQSASKIITLKKRWQLALSKGVHFVCNLLLLFVTVYSHLLLVAAGLEAP
                     FLYLYALVYFLQSINFVRIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPYNSV
                     TSSIVITSGDGTTSPISEHDYQIGGYTEKWESGVKDCVVLHSYFTSDYYQLYSTQLSTD
                     IGVEHVTFFIYNKIVDEPEEHVQIHTIDGSSGVVNPVMEPIYDEPTTTTSVPL"
     gene            26209..26436
                     /gene="E"
     CDS             26209..26436
                     /gene="E"
                     /codon_start=1
                     /product="envelope protein"
                     /protein_id="C_AAH32367.1"
                     /translation="MYSFVSEEIGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCN
                     IVNVSLVKPSFYVYSRVKNLNSSRVPDLLV"
     gene            26487..27155
                     /gene="M"
     CDS             26487..27155
                     /gene="M"
                     /codon_start=1
                     /product="membrane glycoprotein"
                     /protein_id="C_AAH32368.1"
                     /translation="MAHSNGTITVEELKKLLEEWNLVIGFLFLAWICLLQFAYANRNRF
                     LYIIKLIFLWLLWPVTLTCFVLAAVYRINWITGGIAIAMACLVGLMWLSYFIASFRLFV
                     RTRSMWSFNPETNILLNVPLHGTILTRPLLESELVIGAVILRGHLRIAGHHLGRCDIKD
                     LPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDNIALLVQ
                     "
     gene            27166..27351
                     /gene="ORF6"
     CDS             27166..27351
                     /gene="ORF6"
                     /codon_start=1
                     /product="ORF6 protein"
                     /protein_id="C_AAH32369.1"
                     /translation="MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLT
                     ENKYSQLDEEQPMEIL"
     gene            27358..27723
                     /gene="ORF7a"
     CDS             27358..27723
                     /gene="ORF7a"
                     /codon_start=1
                     /product="ORF7a protein"
                     /protein_id="C_AAH32370.1"
                     /translation="MKIILFLALITLATCELYHYQECVRGTTVLLKEPCSSGTYEGNSP
                     FHPLADNKFALTCFSTQFAFACPDGVKHVYQLRARSVSPKLFIRQEEVQELYSPIFLIV
                     AAIVFITLCFTLKRKTE"
     gene            27720..27851
                     /gene="ORF7b"
     CDS             27720..27851
                     /gene="ORF7b"
                     /codon_start=1
                     /product="ORF7b"
                     /protein_id="C_AAH32371.1"
                     /translation="MIELSLIDFYLCFLAFLLLLVLIMLIIFWFSLELQDHNETCHA"
     gene            27858..28223
                     /gene="ORF8"
     CDS             27858..28223
                     /gene="ORF8"
                     /codon_start=1
                     /product="ORF8 protein"
                     /protein_id="C_AAH32361.1"
                     /translation="MKFLVFLGIITTVAAFHQECSLQSCTQHQPYVVDDPCPIHFYSKW
                     YIRVGARKSAPLIELCVDEAGSKSPIQYIDIGNYTVSCLPFTINCQEPKLGSLVVRCSF
                     YEDFLEYHDVRVVLDFI"
     gene            28238..29488
                     /gene="N"
     CDS             28238..29488
                     /gene="N"
                     /codon_start=1
                     /product="nucleocapsid phosphoprotein"
                     /protein_id="C_AAH32362.1"
                     /translation="MSDNGPQNQRNALRITFGGPSDSTGSNQNGGARSKQRRPQGLPNN
                     TASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMKDLSPR
                     WYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQGTT
                     LPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSKRTSPARMAGNGGDAALALLLLD
                     RLNKLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQTQG
                     NFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLDDKD
                     PNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTVTLLPAADLDD
                     FSKQLQQSMSRADSTQA"
     gene            29513..29629
                     /gene="ORF10"
     CDS             29513..29629
                     /gene="ORF10"
                     /codon_start=1
                     /product="ORF10 protein"
                     /protein_id="C_AAH32363.1"
                     /translation="MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFNLT"
     stem_loop       29564..29599
                     /gene="ORF10"
                     /note="Coronavirus 3' UTR pseudoknot stem-loop 1"
     stem_loop       29584..29612
                     /gene="ORF10"
                     /note="Coronavirus 3' UTR pseudoknot stem-loop 2"
     stem_loop       29683..29723
                     /note="Coronavirus 3' stem-loop II-like motif (s2m)"
ORIGIN
        1 taccttccta ggtaacaaac caaccaactt ttgatctctt gtagatctgt tctctaaacg
       61 aactttaaaa tctgtgtggc tgtcactcgg ctgcatgctt agtgcactca cgcagtataa
      121 ttaataacta attactgtcg ttgacaggac acgagtaact cgtctatctt ctgcaggctg
      181 cttacggttt cgtccgtgtt gcagccgatc atcagcacat ctaggttttg tccgggtgtg
      241 accgaaaggt aagatggaga gccttgtccc tggtttcaac gagaaaacac acgtccaact
      301 cagtttgcct gttttacagg ttcgcgacgt gctcgtacgt ggctttggag actccgtgga
      361 ggaggtctta tcagaggcac gtcaacatct taaagatggc acttgtggct tagtagaagt
      421 tgaaaaaggc gttttgcctc aacttgaaca gccctatgtg ttcatcaaac gttcggatgc
      481 tcgaactgca cctcatggtc atgttatggt tgagctggta gcagaactcg aaggcattca
      541 gtacggtcgt agtggtgaga cacttggtgt ccttgtccct catgtgggcg aaataccagt
      601 ggcttaccgc aaggttcttc ttcgtaagaa cggtaataaa ggagctggtg gccataggta
      661 cggcgccgat ctaaagtcat ttgacttagg cgacgagctt ggcactgatc cttatgaaga
      721 ttttcaagaa aactggaaca ctaaacatag cagtggtgtt acccgtgaac tcatgcgtga
      781 gcttaacgga ggggcataca ctcgctatgt cgataacaac ttctgtggcc ctgatggcta
      841 ccctcttgag tgcattaaag accttctagc acgtgctggt aaagattcat gcactttgtc
      901 cgaacaactg gactttattg acactaagag gggtgtatac tgctgccgtg aacatgagca
      961 tgaaattgct tggtacacgg aacgttctga aaagagctat gaattgcaga caccttttga
     1021 aattaaattg gcaaagaaat ttgacacctt caatggggaa tgtccaaatt ttgtatttcc
     1081 cttaaattcc ataatcaaga ctattcaacc aagggttgaa aagaaaaagc ttgatggctt
     1141 tatgggtaga attcgatctg tctatccagt tgcgtcacca aatgaatgca accaaatgtg
     1201 cctttcaact ctcatgaagt gtgatcattg tggtgaaact tcatggcaga cgggcgattt
     1261 tgttaaagcc acttgcgaat tttgtggcac tgagaatttg actaaagaag gtgccactac
     1321 ttgtggttac ttaccccaaa atgctgttgt taaaatttat tgtccagcat gtcacaattc
     1381 agaagtagga cctgagcata gtcttgccga ataccataat gaatctggct tgaaaaccat
     1441 tcttcgtaag ggtggtcgca ctattgcctt tggaggctgt gtgttctctt atgttggttg
     1501 ccataacaag tgtgcctatt gggttccacg tgctagcgct aacataggtt gtaaccatac
     1561 aggtgttgtt ggagaaggtt ccgaaggtct taatgacaac cttcttgaaa tactccaaaa
     1621 agagaaagtc aacatcaata ttgttggtga ctttaaactt aatgaagaga tcgccattat
     1681 tttggcatct ttttctgctt ccacaagtgc ttttgtggaa actgtgaaag gtttggatta
     1741 taaagcattc aaacaaattg ttgaatcctg tggtaatttt aaagttacaa aaggaaaagc
     1801 taaaaaaggt gcctggaata ttggtgaaca gaaatcaata ctgagtcctc tttatgcatt
     1861 tgcatcagag gctgctcgtg ttgtacgatc aattttctcc cgcactcttg aaactgctca
     1921 aaattctgtg cgtgttttac agaaggccgc tataacaata ctagatggaa tttcacagta
     1981 ttcactgaga ctcattgatg ctatgatgtt cacatctgat ttggctacta acaatctagt
     2041 tgtaatggcc tacattacag gtggtgttgt tcagttgact tcgcagtggc taactaacat
     2101 ctttggcact gtttatgaaa aactcaaacc cgtccttgat tggcttgaag agaagtttaa
     2161 ggaaggtgta gagtttctta gagacggttg ggaaattgtt aaatttatct caacctgtgc
     2221 ttgtgaaatt gtcggtggac aaattgtcac ctgtgcaaag gaaattaagg agagtgttca
     2281 gacattcttt aagcttgtaa ataaattttt ggctttgtgt gctgactcta tcattattgg
     2341 tggagctaaa cttaaagcct tgaatttagg tgaaacattt gtcacgcact caaagggatt
     2401 gtacagaaag tgtgttaaat ccagagaaga aactggccta ctcatgcctc taaaagcccc
     2461 aaaagaaatt atcttcttag agggagaaac acttcccaca gaagtgttaa cagaggaagt
     2521 tgtcttgaaa actggtgatt tacaaccatt agaacaacct actagtgaag ctgttgaagc
     2581 tccattggtt ggtacaccag tttgtattaa cgggcttatg ttgctcgaaa tcaaagacac
数据集编号
序列分析