LOCUS C_AA060754 29393 bp ss-RNA linear VRL 11-MAR-2024 DEFINITION Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/CHN/HainanCDC20230753/2023 ORF1ab polyprotein (ORF1ab) and ORF1a polyprotein (ORF1ab) genes, partial cds; surface glycoprotein (S), ORF3a protein (ORF3a), envelope protein (E), membrane glycoprotein (M), ORF6 protein (ORF6), ORF7a protein (ORF7a), and ORF7b (ORF7b) genes, complete cds; ORF8 gene, complete sequence; and nucleocapsid phosphoprotein (N) and ORF10 protein (ORF10) genes, complete cds. ACCESSION C_AA060754 VERSION C_AA060754.1 KEYWORDS . SOURCE Severe acute respiratory syndrome coronavirus 2 ORGANISM Severe acute respiratory syndrome coronavirus 2 Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Betacoronavirus; Sarbecovirus; Severe acute respiratory syndrome-related coronavirus. REFERENCE 1 (bases 1 to 29393) AUTHORS Cui,l. TITLE Direct Submission JOURNAL Submitted (11-MAR-2024) Microbiology laboratory, HainanCDC, No.40, Haifu Road, Meilan District, Haikou, Hainan 570203, China COMMENT ##Genome-Assembly-Data-START## Assembly Method :: IPH-NANO v. 1.12 Sequencing Technology :: Nanopore ##Genome-Assembly-Data-END## . FEATURES Location/Qualifiers source 1..29393 /organism="Severe acute respiratory syndrome coronavirus 2" /mol_type="genomic RNA" /isolate="SARS-CoV-2/human/CHN/HainanCDC20230753/2023" /host="Homo sapiens; Host_age: Unknown" /country="China:Hainan,Haikou" /collection_date="2023-12-13" gene <1..21233 /gene="ORF1ab" CDS join(<1..13146,13146..21233) /gene="ORF1ab" /ribosomal_slippage /codon_start=1 /product="ORF1ab polyprotein" /protein_id="C_AAG31585.1" /translation="SLPVLQVRDVLVRGFGDSVEEVLSEARQHLRDGTCGLVEVEKGVL PQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETLGVLVPHVGEIPVAYRK VLLRKNGNKGAGGHRYGADLKSFDLGDELGTDPYEDFQENWNTKHSSGVTRELMRELNG GAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTKRGVYCCREHEHEI AWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSIIKTIQPRVEKKKLDGFM GRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEFCGTENLTKEGATT CGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGRTIAFGGCVFSYVG CHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEKVNINIVGDFKLNEEIA IILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGAWNIGEQKSILSPL YAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITIXDGISQYSLRLIDAMMFTSDLAT NNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKFKEGVEFLRDGWEIVKF ISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSIIIGGAKLKALNLGETFV THSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLTEEVVLKTGDLQPLEQP TSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNTFTLKGGAPTKVTFGDD TVIEVQGYKSVNIIFELDERIDKVLNEKCSAYTVELGTEVNEFACVVADAVIKTLQPVS ELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQ YEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTI VEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVY LKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVN KGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNL YDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTT TLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVV IPTKKASGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSI ISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDY GARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSV SSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVY YTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFG PTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTK KWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCAL ILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMG TLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNY QCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCT EIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKK PASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTW CIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVK TTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHG LAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRS TNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLI YSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPS LETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNS WLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRV ECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRP INPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPIN VIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTF SSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKL SHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVK DFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGGKIVNNWLKQLIK VTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFANKHADF DTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLPRVFSAVG NICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVAYESLRPDT RYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSGRWVLNNDYY RSLPGVFCGVDAVNLFTNMFTPLIQPIGALDISASIVAGGIVAIVVTCLAYYFMRFRRA FGEYSHVVAFNTLLFLMSFIVLCLTPVYSFLPGVYSVIYLYLTFYLTNDVSFLAHIQWM VMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFEEAALCTFLLNKE MYLKLRSDVLLPFTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSD VLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVI CTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYK FVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSCGSVGFNIDYDCVSFCY MHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFL NRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGR TILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLLTILTSLLVLVQSTQWS LFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSLATVAYFNMVYMPASWV MRIMTWLDMVDTSLKLKDCVMYASAVVLLILMTARTVYDDGARRVWTLMNVLTLVYKVY YGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEYCPIFFITGNTLQCIML VYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKNSIDAFKL NIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDIL LAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQAIASEFSSLPSYAAFA TAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQAR SEDKRAKVTSAMQTMLFTMLRKLDNDALXNIINNARDGCVPLNIIPLTTASKLMVVIPD YNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNLAWPLIVTALRANS AVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFVLALLSDLQDLKWA RFPKSDGTGTXYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQ AGNATEVPANSTVLSFCAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYRAFDIY NDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQHEETIYNLLKDCPAVA KHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDY FNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNAGIVGVLTLDNQDLNGN WYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVDTDLTKPYIKWDLLKYD FTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLFSTVFPLTSFGPLVRKI FVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAADPAMHAASGNLLLDKR TTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAIS DYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWG KARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNR QFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPN MLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGSSLYVKPGGTSSGDATT AYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFY AYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETDL TKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAI DAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEA MYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVIPTSHKLVLSVNPYVC NAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVGSDNVTDFNAI ATCDWTNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIATVREVLSDRELHLSWE VGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVYRGTTTYKLNVGDYFVL TSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQKVGMQKYSTLQGPPGT GKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDK FKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLCAKHYVYIGDPAQ LPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKA HKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVA SKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKVGILCIMSDRDL YDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKVITGLHPTQAPTHLSVDTKFKTEGLC VDXPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATR EAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGDQFKHLIPLMYKGL PWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDRRATC FSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHVASCDAI MTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHMVVKAALLADKFPVLHDIGN PKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKFTDGVCLFWNCNVDRYP ANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFVNLKQLPFFYYSDSPC ESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYLDAYNMMISAGFSLWVY KQFDTYNLWNTFTRLQSLENVAFNVVNKGHFDGQQGEVPVSIINNTVYTKVDGVDVELF ENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANTVIWDYKRDAPAHISTIGVC SMTDIAKKPIETICAPLTVFFDGRVDGQVDLFRNARNGVLITEGSVKGLQPSVGPKQAS LNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQEFKPRSQMEIDFLELAMDE FIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPFELEDFIPMDSTVKNYF ITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTIDYTEISFMLWCKDGHV ETFYPKLQSSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSATLPKGIMMNVAKYTQLC QYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADST LIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYICGFIQQKLALGGSVAI KITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYLGKPREQIDGYVMHANY IFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQINDMILSLLSKGRLIIRENNR VVISSDVLVNN" mat_peptide <1..492 /gene="ORF1ab" /product="leader protein" mat_peptide 493..2406 /gene="ORF1ab" /product="nsp2" mat_peptide 2407..8241 /gene="ORF1ab" /product="nsp3" mat_peptide 8242..9741 /gene="ORF1ab" /product="nsp4" mat_peptide 9742..10659 /gene="ORF1ab" /product="3C-like proteinase" mat_peptide 10660..11520 /gene="ORF1ab" /product="nsp6" mat_peptide 11521..11769 /gene="ORF1ab" /product="nsp7" mat_peptide 11770..12363 /gene="ORF1ab" /product="nsp8" mat_peptide 12364..12702 /gene="ORF1ab" /product="nsp9" mat_peptide 12703..13119 /gene="ORF1ab" /product="nsp10" mat_peptide join(13120..13146,13146..15914) /gene="ORF1ab" /product="RNA-dependent RNA polymerase" mat_peptide 15915..17717 /gene="ORF1ab" /product="helicase" mat_peptide 17718..19298 /gene="ORF1ab" /product="3'-to-5' exonuclease" mat_peptide 19299..20336 /gene="ORF1ab" /product="endoRNAse" mat_peptide 20337..21230 /gene="ORF1ab" /product="2'-O-ribose methyltransferase" CDS <1..>12755 /gene="ORF1ab" /codon_start=1 /product="ORF1a polyprotein" /protein_id="C_AAG31588.1" /translation="SLPVLQVRDVLVRGFGDSVEEVLSEARQHLRDGTCGLVEVEKGVL PQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETLGVLVPHVGEIPVAYRK VLLRKNGNKGAGGHRYGADLKSFDLGDELGTDPYEDFQENWNTKHSSGVTRELMRELNG GAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTKRGVYCCREHEHEI AWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSIIKTIQPRVEKKKLDGFM GRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEFCGTENLTKEGATT CGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGRTIAFGGCVFSYVG CHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEKVNINIVGDFKLNEEIA IILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGAWNIGEQKSILSPL YAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITIXDGISQYSLRLIDAMMFTSDLAT NNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKFKEGVEFLRDGWEIVKF ISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSIIIGGAKLKALNLGETFV THSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLTEEVVLKTGDLQPLEQP TSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNTFTLKGGAPTKVTFGDD TVIEVQGYKSVNIIFELDERIDKVLNEKCSAYTVELGTEVNEFACVVADAVIKTLQPVS ELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQ YEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTI VEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVY LKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVN KGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNL YDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTT TLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVV IPTKKASGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSI ISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDY GARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSV SSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVY YTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFG PTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTK KWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCAL ILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMG TLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNY QCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCT EIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKK PASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTW CIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVK TTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHG LAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRS TNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLI YSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPS LETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNS WLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRV ECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRP INPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPIN VIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTF SSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKL SHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVK DFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGGKIVNNWLKQLIK VTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFANKHADF DTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLPRVFSAVG NICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVAYESLRPDT RYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSGRWVLNNDYY RSLPGVFCGVDAVNLFTNMFTPLIQPIGALDISASIVAGGIVAIVVTCLAYYFMRFRRA FGEYSHVVAFNTLLFLMSFIVLCLTPVYSFLPGVYSVIYLYLTFYLTNDVSFLAHIQWM VMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFEEAALCTFLLNKE MYLKLRSDVLLPFTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSD VLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVI CTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYK FVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSCGSVGFNIDYDCVSFCY MHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFL NRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGR TILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLLTILTSLLVLVQSTQWS LFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSLATVAYFNMVYMPASWV MRIMTWLDMVDTSLKLKDCVMYASAVVLLILMTARTVYDDGARRVWTLMNVLTLVYKVY YGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEYCPIFFITGNTLQCIML VYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKNSIDAFKL NIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDIL LAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQAIASEFSSLPSYAAFA TAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQAR SEDKRAKVTSAMQTMLFTMLRKLDNDALXNIINNARDGCVPLNIIPLTTASKLMVVIPD YNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNLAWPLIVTALRANS AVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFVLALLSDLQDLKWA RFPKSDGTGTXYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQ AGNATEVPANSTVLSFCA" mat_peptide <1..492 /gene="ORF1ab" /product="leader protein" mat_peptide 493..2406 /gene="ORF1ab" /product="nsp2" mat_peptide 2407..8241 /gene="ORF1ab" /product="nsp3" mat_peptide 8242..9741 /gene="ORF1ab" /product="nsp4" mat_peptide 9742..10659 /gene="ORF1ab" /product="3C-like proteinase" mat_peptide 10660..11520 /gene="ORF1ab" /product="nsp6" mat_peptide 11521..11769 /gene="ORF1ab" /product="nsp7" mat_peptide 11770..12363 /gene="ORF1ab" /product="nsp8" mat_peptide 12364..12702 /gene="ORF1ab" /product="nsp9" mat_peptide 12703..>12755 /gene="ORF1ab" /product="nsp10" stem_loop <13211..13220 /gene="ORF1ab" /note="Coronavirus frameshifting stimulation element stem-loop 2" gene 21241..25053 /gene="S" CDS 21241..25053 /gene="S" /codon_start=1 /product="surface glycoprotein" /protein_id="C_AAG31589.1" /translation="MFVFLVLLPLVSSQCVNLITRTQSYTNSFTRGVYYPDKVFRSSVL HSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPALPFNDGVYFASTEKSNIIRGWIFG TTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLDVYYXKNNKSWMESEFRVYSSANNC TFEYVSQPFLMDLEGKEGNFKNLREFVFKNIDGYFKIYSKHTPINLERDLPQGFSALEP LVDLPIGINITRFQTLLALHRSYLTPVDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGT ITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFHEVFN ATTFASVYAWXRKRISNCVADYSVIYNFAPFFAFKCYGVSPTKLNDLCFTNVYADSFVI RGNEVSQIAPGQTGNIADYNYKLPDDFTGCVIAWNSNKLDSKPSGNYNYLYRFLRKSKL KPFERDISTEIYQVGNKPCNGVAGPNCYSPLQSYGFRPTYGVGHQPYRVVVLSFELLHA PATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTIDAVRDP QTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYST GSNVFQTRAGCLIGAEYVNNSYECDIPIGAGICASYQTQTKSHXRARSVASQSIIAYTM SLGAENLVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQY GSFCTQLKRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKYFGGFNFSQILPDPSKPSKR SFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYT SALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGK IQDSLSSTASALGKLQDVVNHNAQALNTLVKQLSSKFGAISSVLNDILSRLDKVEAEVQ IDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMS FPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFY EPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDI SGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIV MVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT" gene 25062..25889 /gene="ORF3a" CDS 25062..25889 /gene="ORF3a" /codon_start=1 /product="ORF3a protein" /protein_id="C_AAG31590.1" /translation="MDLFMRIFTIGTVTLKQGEIKDATPSDFVRATATIPIQASLPFGW LIVGVALLAVFQSASKIITLKKRWQLALSKGVHFVCNLLLLFVIVYSHLLLVAAGLEAP FLYLYALVYFLQSINFVRIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPYNSV TSSIVITSGDGTTSPISEHDYQIGGYTEKWESGVKDCVVLHSYFTSDYYQLYSTQLSTD IGVEHVTFFIYNKIVDEPEEHVQIHTIDGSSGVVNPVMEPIYDEPTTTTSVPL" gene 25914..26141 /gene="E" CDS 25914..26141 /gene="E" /codon_start=1 /product="envelope protein" /protein_id="C_AAG31591.1" /translation="MYSFVSEEIGALIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCN IVNVSLVKPSFYVYSRVKNLNSSRVPDLLV" gene 26192..26860 /gene="M" CDS 26192..26860 /gene="M" /codon_start=1 /product="membrane glycoprotein" /protein_id="C_AAG31592.1" /translation="MADSNGTITVEELKKLLEEWNLVIGFLFLTWICLLQFAYANRNRF LYIIKLIFLWLLWPVTLTCFVLAAVYRINWITGGIAIAMACLVGLMWLSYFIASFRLFA RTRSMWSFNPETNILLNVPLHGTILTRPLLESELVIGAVILRGHLRIAGHHLGRCDIKD LPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDNIALLVQ " gene 26871..27056 /gene="ORF6" CDS 26871..27056 /gene="ORF6" /codon_start=1 /product="ORF6 protein" /protein_id="C_AAG31593.1" /translation="MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLT ENKYSQLDEEQPMEIL" gene 27063..27428 /gene="ORF7a" CDS 27063..27428 /gene="ORF7a" /codon_start=1 /product="ORF7a protein" /protein_id="C_AAG31594.1" /translation="MKIILFLALITLATCELYHYQECVRGTTVLLKEPCSSGTYEGNSP FHPLADNKFALTCFSTQFAFACPDGVKHVYQLRARSVSPKLFIRQEEVQELYSPIFLIV AAIVFITLCFTLKRKTE" gene 27425..27556 /gene="ORF7b" CDS 27425..27556 /gene="ORF7b" /codon_start=1 /product="ORF7b" /protein_id="C_AAG31595.1" /translation="MIELSLIDFYLCFLAFLLFLVLIMLIIFWFSLELQDHNETCHA" gene 27563..27928 /gene="ORF8" misc_feature 27563..27928 /gene="ORF8" /note="similar to ORF8 protein" gene 27943..29193 /gene="N" CDS 27943..29193 /gene="N" /codon_start=1 /product="nucleocapsid phosphoprotein" /protein_id="C_AAG31586.1" /translation="MSDNGPQNQRNALRITFGGPSDSTGSNQNGGARSKQRRPQGLPNN TASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMKDLSPR WYXYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQGTT LPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSKRTSPARMAGNGGDAALALLLLD RLNQLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQTQG NFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLDDKD PNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTVTLLPAADLDD FSKQLQQSMSRADSTQA" gene 29218..29334 /gene="ORF10" CDS 29218..29334 /gene="ORF10" /codon_start=1 /product="ORF10 protein" /protein_id="C_AAG31587.1" /translation="MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFNLT" stem_loop 29269..29304 /gene="ORF10" /note="Coronavirus 3' UTR pseudoknot stem-loop 1" stem_loop 29289..29317 /gene="ORF10" /note="Coronavirus 3' UTR pseudoknot stem-loop 2" stem_loop 29388..>29393 /note="Coronavirus 3' stem-loop II-like motif (s2m)" ORIGIN 1 agtttgcctg ttttacaggt tcgcgacgtg ctcgtacgtg gctttggaga ctccgtggag 61 gaggtcttat cagaggcacg tcaacatctt agagatggca cttgtggctt agtagaagtt 121 gaaaaaggcg ttttgcctca acttgaacag ccctatgtgt tcatcaaacg ttcggatgct 181 cgaactgcac ctcatggtca tgttatggtt gagctggtag cagaactcga aggcattcag 241 tacggtcgta gtggtgagac acttggtgtc cttgtccctc atgtgggcga aataccagtg 301 gcttaccgca aggttcttct tcgtaagaac ggtaataaag gagctggtgg ccataggtac 361 ggcgccgatc tcaagtcatt tgacttaggc gacgagcttg gcactgatcc ttatgaagat 421 tttcaagaaa actggaacac taaacatagc agtggtgtta cccgtgaact catgcgtgag 481 cttaacggag gggcatacac tcgctatgtc gataacaact tctgtggccc tgatggctac 541 cctcttgagt gcattaaaga ccttctagca cgtgctggta aagcttcatg cactttgtcc 601 gaacaactgg actttattga cactaagagg ggtgtatact gctgccgtga acatgagcat 661 gaaattgctt ggtacacgga acgttctgaa aagagctatg aattgcagac accttttgaa 721 attaaattgg caaagaaatt tgacaccttc aatggggaat gtccaaattt tgtatttccc 781 ttaaattcca taatcaagac tattcaacca agggttgaaa agaaaaagct tgatggcttt 841 atgggtagaa ttcgatctgt ctatccagtt gcgtcaccaa atgaatgcaa ccaaatgtgc 901 ctttcaactc tcatgaagtg tgatcattgt ggtgaaactt catggcagac gggcgatttt 961 gttaaagcca cttgcgaatt ttgtggcact gagaatttga ctaaagaagg tgccactact 1021 tgtggttact taccccaaaa tgctgttgtt aaaatttatt gtccagcatg tcacaattca 1081 gaagtaggac ctgagcatag tcttgccgaa taccataatg aatctggctt gaaaaccatt 1141 cttcgtaagg gtggtcgcac tattgccttt ggaggctgtg tgttctctta tgttggttgc 1201 cataacaagt gtgcctattg ggttccacgt gctagcgcta acataggttg taaccataca 1261 ggtgttgttg gagaaggttc cgaaggtctt aatgacaacc ttcttgaaat actccaaaaa 1321 gagaaagtca acatcaatat tgttggtgac tttaaactta atgaagagat cgccattatt 1381 ttggcatctt tttctgcttc cacaagtgct tttgtggaaa ctgtgaaagg tttggattat 1441 aaagcattca aacaaattgt tgaatcctgt ggtaatttta aagttacaaa aggaaaagct 1501 aaaaaaggtg cctggaatat tggtgaacag aaatcaatac tgagtcctct ttatgcattt 1561 gcatcagagg ctgctcgtgt tgtacgatca attttctccc gcactcttga aactgctcaa 1621 aattctgtgc gtgttttaca gaaggccgct ataacaatan tagatggaat ttcacagtat 1681 tcactgagac tcattgatgc tatgatgttc acatctgatt tggctactaa caatctagtt 1741 gtaatggcct acattacagg tggtgttgtt cagttgactt cgcagtggct aactaacatc 1801 tttggcactg tttatgaaaa actcaaaccc gtccttgatt ggcttgaaga gaagtttaag 1861 gaaggtgtag agtttcttag agacggttgg gaaattgtta aatttatctc aacctgtgct 1921 tgtgaaattg tcggtggaca aattgtcacc tgtgcaaagg aaattaagga gagtgttcag 1981 acattcttta agcttgtaaa taaatttttg gctttgtgtg ctgactctat cattattggt 2041 ggagctaaac ttaaagcctt gaatttaggt gaaacatttg tcacgcactc aaagggattg 2101 tacagaaagt gtgttaaatc cagagaagaa actggcctac tcatgcctct aaaagcccca 2161 aaagaaatta tcttcttaga gggagaaaca cttcccacag aagtgttaac agaggaagtt 2221 gtcttgaaaa ctggtgattt acaaccatta gaacaaccta ctagtgaagc tgttgaagct 2281 ccattggttg gtacaccagt ttgtattaac gggcttatgt tgctcgaaat caaagacaca 2341 gaaaagtact gtgcccttgc acctaatatg atggtaacaa acaatacctt cacactcaaa 2401 ggcggtgcac caacaaaggt tacttttggt gatgacactg tgatagaagt gcaaggttac 2461 aagagtgtga atatcatttt tgaacttgat gaaaggattg ataaagtact taatgagaag 2521 tgctctgcct atacagttga actcggtaca gaagtaaatg agttcgcctg tgttgtggca 2581 gatgctgtca taaaaacttt gcaaccagta tctgaattac ttacaccact gggcattgat 2641 ttagatgagt ggagtatggc tacatactac ttatttgatg agtctggtga gtttaaattg 2701 gcttcacata tgtattgttc tttttaccct ccagatgagg atgaagaaga aggtgattgt 2761 gaagaagaag agtttgagcc atcaactcaa tatgagtatg gtactgaaga tgattaccaa 2821 ggtaaacctt tggaatttgg tgccacttct gctgctcttc aacctgaaga agagcaagaa 2881 gaagattggt tagatgatga tagtcaacaa actgttggtc aacaagacgg cagtgaggac 2941 aatcagacaa ctactattca aacaattgtt gaggttcaac ctcaattaga gatggaactt 3001 acaccagttg ttcagactat tgaagtgaat agttttagtg gttatttaaa acttactgac 3061 aatgtataca ttaaaaatgc agacattgtg gaagaagcta aaaaggtaaa accaacagtg 3121 gttgttaatg cagccaatgt ttaccttaaa catggaggag gtgttgcagg agccttaaat 3181 aaggctacta acaatgccat gcaagttgaa tctgatgatt acatagctac taatggacca 3241 cttaaagtgg gtggtagttg tgttttaagc ggacacaatc ttgctaaaca ctgtcttcat 3301 gttgtcggcc caaatgttaa caaaggtgaa gacattcaac ttcttaagag tgcttatgaa 3361 aattttaatc agcacgaagt tctacttgca ccattattat cagctggtat ttttggtgct 3421 gaccctatac attctttaag agtttgtgta gatactgttc gcacaaatgt ctacttagct 3481 gtctttgata aaaatctcta tgacaaactt gtttcaagct ttttggaaat gaagagtgaa 3541 aagcaagttg aacaaaagat cgctgagatt cctaaagagg aagttaagcc atttataact