LOCUS C_AA014298 29766 bp ss-RNA linear VRL 11-MAY-2023 DEFINITION Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/CHN/SH-XG2305-3651/2023 ORF1ab polyprotein (ORF1ab), ORF1a polyprotein (ORF1ab), surface glycoprotein (S), ORF3a protein (ORF3a), envelope protein (E), membrane glycoprotein (M), ORF6 protein (ORF6), ORF7a protein (ORF7a), ORF7b (ORF7b), ORF8 protein (ORF8), nucleocapsid phosphoprotein (N), and ORF10 protein (ORF10) genes, complete cds. ACCESSION C_AA014298 VERSION C_AA014298.1 KEYWORDS . SOURCE Severe acute respiratory syndrome coronavirus 2 ORGANISM Severe acute respiratory syndrome coronavirus 2 Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Betacoronavirus; Sarbecovirus; Severe acute respiratory syndrome-related coronavirus. REFERENCE 1 (bases 1 to 29766) AUTHORS Zhang,W. TITLE Direct Submission JOURNAL Submitted (11-MAY-2023) Microbe lab, Shanghai Municipal Center for Disease Control & Prevention, west zhongshan road 1380, Shanghai 200336, China COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Consensus sequence method v. 940-000133-00 Sequencing Technology :: MGISEQ-200 ##Genome-Assembly-Data-END## . FEATURES Location/Qualifiers source 1..29766 /organism="Severe acute respiratory syndrome coronavirus 2" /mol_type="genomic RNA" /isolate="SARS-CoV-2/human/CHN/SH-XG2305-3651/2023" /isolation_source="Nasopharyngeal swab" /host="Homo sapiens; Host_age: 8; Host_sex: Female" /country="China:Shanghai" /collection_date="2023-04-27" /note="Passage_details/history: Original" gene 242..21513 /gene="ORF1ab" CDS join(242..13426,13426..21513) /gene="ORF1ab" /ribosomal_slippage /codon_start=1 /product="ORF1ab polyprotein" /protein_id="C_AAB30347.1" /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQH LKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHVVELVAELEGIQYGRSGETLGVL VPYVGEIPVAYRKVLLRKNGNKGAGGHRYGADLKSFDLGDELGTDPYEDFQENWNTKHS SGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTK RGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSIIKTI QPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEF CGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGR TIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEKVNI NIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGA WNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQYSLR LIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKFKEG VEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSIIIGG AKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLTEEV VLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNIFTL KGGAPTKVTFGDDTVIEVQGYKSVNIIFELDERIDKVLNEKCSAYTVELGTEVNEFACV VADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEEE GDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQD GSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEEAKK VKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNL AKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTV RTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPLVEQRKQD DKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIV GDVVQEGVLTAVVIPTKKASGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVL KKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRK YKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAAR YMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQSTQL GIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINLHTQ VVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTTDSS FLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYY RARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQ TTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKH GTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTI KPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIK FADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVN NATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVEN PTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNEL SRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYF FTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIII WFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSV CLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQL FFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSST CMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISD EVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDN LRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAE VAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDS DVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQV AKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKG GKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIA STDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGD FLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVL EGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCV STSGRWVLNNDYYRSLPGVFCGVDAVNLFTNMFTPLIQPIGALDISASIVAGGIVAIVV TCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFIVLCLTPVYSFLPGVYSVIYLYLTFYL TNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTF EEAALCTFLLNKEMYLKLRSDVLLPFTQYNRYLALYNKYKYFSGAMDTTSYREAACCHL AKALNDFSNSGSDVLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGL WLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKL KVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSCGSV GFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAW LYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCA SLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLLTIL TSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMIFVKHKHAFLCLFLLPSLATV AYFNMVYMPASWVMRIMTWLDMVDTSLKLKDCVMYASAVVLLILMTARTVYDDGARRVW TLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEYCPI FFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQG LLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSK LWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQAIA SEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKM ADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLSIIP LTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNL AWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFV LALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRG MVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVKMLC THTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCAN DPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNRVCGVSAARLTPCGTG TSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQHEE TIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLK EILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNAGIV GVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVDTDL TKPYIKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLFSTV FPLTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAADPA MHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELK HFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNN LDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRART VAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENPHLM GWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGSSLY VKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYR NRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNV FMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTD GTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTND NTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVIST SHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNT CVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIATVR EVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVYRGT TTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQKVG MQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSR IIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARL CAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVD TVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKA VFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRA KVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKVITGLHPTQAPTH LSVDTKFKTEGLCVDVPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRA WIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGD QFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVKIGP ERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQ VHGNAHVASCDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHMVVKAAL LADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKFTDG VCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFVNL KQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYLDAY NMMISAGFSLWVYKQFDTYNLWNTFTRLQSLENVAFNVVNKGHFDGQQGEVPVSIINNT VYTKVDGVDVELFENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANTVIWDYK RDAPAHISTIGVCSMTDIAKKPIETICAPLTVFFDGRVDGQVDLFRNARNGVLITEGSV KGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQEFKPRS QMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPFELE DFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTIDYT EISFMLWCKDGHVETFYPKLQSSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSATLPK GIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVD SDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYICGF IQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYLGKP REQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQINDMILSL LSKGRLIIRENNRVVISSDVLVNN" mat_peptide 242..772 /gene="ORF1ab" /product="leader protein" mat_peptide 773..2686 /gene="ORF1ab" /product="nsp2" mat_peptide 2687..8521 /gene="ORF1ab" /product="nsp3" mat_peptide 8522..10021 /gene="ORF1ab" /product="nsp4" mat_peptide 10022..10939 /gene="ORF1ab" /product="3C-like proteinase" mat_peptide 10940..11800 /gene="ORF1ab" /product="nsp6" mat_peptide 11801..12049 /gene="ORF1ab" /product="nsp7" mat_peptide 12050..12643 /gene="ORF1ab" /product="nsp8" mat_peptide 12644..12982 /gene="ORF1ab" /product="nsp9" mat_peptide 12983..13399 /gene="ORF1ab" /product="nsp10" mat_peptide join(13400..13426,13426..16194) /gene="ORF1ab" /product="RNA-dependent RNA polymerase" mat_peptide 16195..17997 /gene="ORF1ab" /product="helicase" mat_peptide 17998..19578 /gene="ORF1ab" /product="3'-to-5' exonuclease" mat_peptide 19579..20616 /gene="ORF1ab" /product="endoRNAse" mat_peptide 20617..21510 /gene="ORF1ab" /product="2'-O-ribose methyltransferase" CDS 242..13441 /gene="ORF1ab" /codon_start=1 /product="ORF1a polyprotein" /protein_id="C_AAB30351.1" /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQH LKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHVVELVAELEGIQYGRSGETLGVL VPYVGEIPVAYRKVLLRKNGNKGAGGHRYGADLKSFDLGDELGTDPYEDFQENWNTKHS SGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTK RGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSIIKTI QPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEF CGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGR TIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEKVNI NIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGA WNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQYSLR LIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKFKEG VEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSIIIGG AKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLTEEV VLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNIFTL KGGAPTKVTFGDDTVIEVQGYKSVNIIFELDERIDKVLNEKCSAYTVELGTEVNEFACV VADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEEE GDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQD GSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEEAKK VKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNL AKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTV RTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPLVEQRKQD DKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIV GDVVQEGVLTAVVIPTKKASGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVL KKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRK YKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAAR YMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQSTQL GIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINLHTQ VVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTTDSS FLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYY RARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQ TTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKH GTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTI KPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIK FADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVN NATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVEN PTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNEL SRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYF FTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIII WFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSV CLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQL FFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSST CMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISD EVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDN LRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAE VAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDS DVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQV AKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKG GKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIA STDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGD FLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVL EGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCV STSGRWVLNNDYYRSLPGVFCGVDAVNLFTNMFTPLIQPIGALDISASIVAGGIVAIVV TCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFIVLCLTPVYSFLPGVYSVIYLYLTFYL TNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTF EEAALCTFLLNKEMYLKLRSDVLLPFTQYNRYLALYNKYKYFSGAMDTTSYREAACCHL AKALNDFSNSGSDVLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGL WLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKL KVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSCGSV GFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAW LYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCA SLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLLTIL TSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMIFVKHKHAFLCLFLLPSLATV AYFNMVYMPASWVMRIMTWLDMVDTSLKLKDCVMYASAVVLLILMTARTVYDDGARRVW TLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEYCPI FFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQG LLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSK LWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQAIA SEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKM ADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLSIIP LTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNL AWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFV LALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRG MVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVKMLC THTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCAN DPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNGFAV" mat_peptide 242..772 /gene="ORF1ab" /product="leader protein" mat_peptide 773..2686 /gene="ORF1ab" /product="nsp2" mat_peptide 2687..8521 /gene="ORF1ab" /product="nsp3" mat_peptide 8522..10021 /gene="ORF1ab" /product="nsp4" mat_peptide 10022..10939 /gene="ORF1ab" /product="3C-like proteinase" mat_peptide 10940..11800 /gene="ORF1ab" /product="nsp6" mat_peptide 11801..12049 /gene="ORF1ab" /product="nsp7" mat_peptide 12050..12643 /gene="ORF1ab" /product="nsp8" mat_peptide 12644..12982 /gene="ORF1ab" /product="nsp9" mat_peptide 12983..13399 /gene="ORF1ab" /product="nsp10" mat_peptide 13400..13438 /gene="ORF1ab" /product="nsp11" stem_loop 13434..13461 /gene="ORF1ab" /note="Coronavirus frameshifting stimulation element stem-loop 1" stem_loop 13446..13500 /gene="ORF1ab" /note="Coronavirus frameshifting stimulation element stem-loop 2" gene 21521..25333 /gene="S" CDS 21521..25333 /gene="S" /codon_start=1 /product="surface glycoprotein" /protein_id="C_AAB30352.1" /translation="MFVFLVLLPLVSSQCVNLITRTQSYTNSFTRGVYYPDKVFRSSVL HSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFG TTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLDVYYHENNKSRMESELRVYSSANNC TFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPVNLGRDLPQGFSALEP LVDLPIGINITRFQTLLALHRSYLTPGDSSSSWTAGAAAYYVGYLQPRTFLLKYNENGT ITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFHEVFN ATTFASVYAWNRTRISNCVADYSVLYNFAPFFAFKCYGVSPTKLNDLCFTNVYADSFVI RGNEVSQIAPGQTGNIADYNYKLPDDFTGCVIAWNSNKLDSKVSGNYNYLYRLFRKSKL KPFERDISTEIYQAGNKPCNGVAGFNCYSPLQSYGFRPTYGVGHQPYRVVVLSFELLHA PATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDP QTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYST GSNVFQTRAGCLIGAEYVNNSYECDIPIGAGICASYQTQTKSHRRARSVASQSIIAYTM SLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQY GSFCTQLKRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKYFGGFNFSQILPDPSKPSKR SFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYT SALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGK IQDSLSSTASALGKLQDVVNHNAQALNTLVKQLSSKFGAISSVLNDILSRLDKVEAEVQ IDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMS FPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFY EPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDI SGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIV MVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT" gene 25342..26169 /gene="ORF3a" CDS 25342..26169 /gene="ORF3a" /codon_start=1 /product="ORF3a protein" /protein_id="C_AAB30353.1" /translation="MDLFMRIFTIGTVTLKQGEIKDATPSDFVRATATIPIQASLPFGW LIVGVALLAVFQSASKIITLKKRWQLALSKGVHFVCNLLLLFVTVYSHLLLVAAGLEAP FLYLYALVYFLQSINFVRIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPYNSV TSSIVITSGDGTTSPISEHDYQIGGYTEKWESGVKDCVVLHSYFTSDYYQLYSTQLSTD IGVEHVIFFIYNKIVDEPEEHVQIHTIDGSSGVVNPVMEPIYDEPTTTTSVPL" gene 26194..26421 /gene="E" CDS 26194..26421 /gene="E" /codon_start=1 /product="envelope protein" /protein_id="C_AAB30354.1" /translation="MYSFVSEEIGALIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCN IVNVSLVKPSFYVYSRVKNLNSSRVPDLLV" gene 26472..27140 /gene="M" CDS 26472..27140 /gene="M" /codon_start=1 /product="membrane glycoprotein" /protein_id="C_AAB30355.1" /translation="MADSNGTITVEELKKLLEEWNLVIGFLFLTWICLLQFAYANRNRF LYIIKLIFLWLLWPVTLTCFVLAAVYRINWITGGIAIAMACLVGLMWLSYFIASFRLFA RTRSMWSFNPETNILLNVPLHGTILTRPLLESELVIGAVILRGHLRIAGHHLGRCDIKD LPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDNIALLVQ " gene 27151..27336 /gene="ORF6" CDS 27151..27336 /gene="ORF6" /codon_start=1 /product="ORF6 protein" /protein_id="C_AAB30356.1" /translation="MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLT ENKYSQLDEEQPMEIL" gene 27343..27708 /gene="ORF7a" CDS 27343..27708 /gene="ORF7a" /codon_start=1 /product="ORF7a protein" /protein_id="C_AAB30357.1" /translation="MKIILFLALITLATCELYHYQECVRGTTVLLKEPCSSGTYEGNSP FHPLADNKFALTCFSTQFAFACPDGVKHVYQLRARSVSPKLFIRQEEVQELYSPIFLIV AAIVFITLCFTLKRKTE" gene 27705..27836 /gene="ORF7b" CDS 27705..27836 /gene="ORF7b" /codon_start=1 /product="ORF7b" /protein_id="C_AAB30358.1" /translation="MIELSLIDFYLCFLAFLLFLVLIMLIIFWFSLELQDHNETWHA" gene 27843..28208 /gene="ORF8" CDS 27843..28208 /gene="ORF8" /codon_start=1 /product="ORF8 protein" /protein_id="C_AAB30348.1" /translation="MKFLVFLGIITTVAAFHQECSLQSCTQHQPYVVDDPCPIHFYSKW YIRVGARKSAPLIELCVDEAGSKSPIQYIDIGNYTVSCLPFTINCQEPKLGSLVVRCSF YQDFLEYHDVRVVLDFI" gene 28223..29473 /gene="N" CDS 28223..29473 /gene="N" /codon_start=1 /product="nucleocapsid phosphoprotein" /protein_id="C_AAB30349.1" /translation="MSDNGPQNQRNALRITFGGPSDSTGSNQNGGARSKQRRPQGLPNN TASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMKDLSPR WYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQGTT LPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSKRTSPARMAGNGGDAALALLLLD RLNQLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQTQG NFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLDDKD PNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTVTLLPAADLDD FSKQLQQSMSRADSTQA" gene 29498..29614 /gene="ORF10" CDS 29498..29614 /gene="ORF10" /codon_start=1 /product="ORF10 protein" /protein_id="C_AAB30350.1" /translation="MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFNLT" stem_loop 29549..29584 /gene="ORF10" /note="Coronavirus 3' UTR pseudoknot stem-loop 1" stem_loop 29569..29597 /gene="ORF10" /note="Coronavirus 3' UTR pseudoknot stem-loop 2" stem_loop 29668..29682 /note="Coronavirus 3' stem-loop II-like motif (s2m)" ORIGIN 1 taacaaacca accaactttt gatctcttgt agatctgttc tctaaacgaa ctttaaaatc 61 tgtgtggctg tcactcggct gcatgcttag tgcactcacg cagtataatt aataactaat 121 tactgtcgtt gacaggacac gagtaactcg tctatcttct gcaggctgct tacggtttcg 181 tccgtgttgc agccgatcat cagcacatct aggttttgtc cgggtgtgac cgaaaggtaa 241 gatggagagc cttgtccctg gtttcaacga gaaaacacac gtccaactca gtttgcctgt 301 tttacaggtt cgcgacgtgc tcgtacgtgg ctttggagac tccgttgagg aggtcttatc 361 agaggcacgt caacatctta aagatggcac ttgtggctta gtagaagttg aaaaaggcgt 421 tttgcctcaa cttgaacagc cctatgtgtt catcaaacgt tcggatgctc gaactgcacc 481 tcatgtggtt gagctggtag cagaactcga aggcattcag tacggtcgta gtggtgagac 541 acttggtgtc cttgtccctt atgtgggcga aataccagtg gcttaccgca aggttcttct 601 tcgtaagaac ggtaataaag gagctggtgg ccataggtac ggcgccgatc taaagtcatt 661 tgacttaggc gacgagcttg gcactgatcc ttatgaagat tttcaagaaa actggaacac 721 taaacatagc agtggtgtta cccgtgaact catgcgtgag cttaacggag gggcatacac 781 tcgctatgtc gataacaact tctgtggccc tgatggctac cctcttgagt gcattaaaga 841 ccttctagca cgtgctggta aagcttcatg cactttgtcc gaacaactgg actttattga 901 cactaagagg ggtgtatact gctgccgtga acatgagcat gaaattgctt ggtacacgga 961 acgttctgaa aagagctatg aattgcagac accttttgaa attaaattgg caaagaaatt 1021 tgacaccttc aatggggaat gtccaaattt tgtatttccc ttaaattcca taatcaagac 1081 tattcaacca agggttgaaa agaaaaagct tgatggcttt atgggtagaa ttcgatctgt 1141 ctatccagtt gcgtcaccaa atgaatgcaa ccaaatgtgc ctttcaactc tcatgaagtg 1201 tgatcattgt ggtgaaactt catggcagac gggcgatttt gttaaagcca cttgcgaatt 1261 ttgtggcact gagaatttga ctaaagaagg tgccactact tgtggttact taccccaaaa 1321 tgctgttgtt aaaatttatt gtccagcatg tcacaattca gaagtaggac ctgagcatag 1381 tcttgccgaa tatcataatg aatctggctt gaaaaccatt cttcgtaagg gtggtcgcac 1441 tattgccttt ggaggctgtg tgttctctta tgttggttgc cataacaagt gtgcctattg 1501 ggttccacgt gctagcgcta acataggttg taaccataca ggtgttgttg gagaaggttc 1561 cgaaggtctt aatgacaacc ttcttgaaat actccaaaaa gagaaagtca acatcaatat 1621 tgttggtgac tttaaactta atgaagagat cgccattatt ttggcatctt tttctgcttc 1681 cacaagtgct tttgtggaaa ctgtgaaagg tttggattat aaagcattca aacaaattgt 1741 tgaatcctgt ggtaatttta aagttacaaa aggaaaagct aaaaaaggtg cctggaatat 1801 tggtgaacag aaatcaatac tgagtcctct ttatgcattt gcatcagagg ctgctcgtgt 1861 tgtacgatca attttctccc gcactcttga aactgctcaa aattctgtgc gtgttttaca 1921 gaaggccgct ataacaatac tagatggaat ttcacagtat tcactgagac tcattgatgc 1981 tatgatgttc acatctgatt tggctactaa caatctagtt gtaatggcct acattacagg 2041 tggtgttgtt cagttgactt cgcagtggct aactaacatc tttggcactg tttatgaaaa 2101 actcaaaccc gtccttgatt ggcttgaaga gaagtttaag gaaggtgtag agtttcttag 2161 agacggttgg gaaattgtta aatttatctc aacctgtgct tgtgaaattg tcggtggaca 2221 aattgtcacc tgtgcaaagg aaattaagga gagtgttcag acattcttta agcttgtaaa 2281 taaatttttg gctttgtgtg ctgactctat cattattggt ggagctaaac ttaaagcctt 2341 gaatttaggt gaaacatttg tcacgcactc aaagggattg tacagaaagt gtgttaaatc 2401 cagagaagaa actggcctac tcatgcctct aaaagcccca aaagaaatta tcttcttaga 2461 gggagaaaca cttcccacag aagtgttaac agaggaagtt gtcttgaaaa ctggtgattt 2521 acaaccatta gaacaaccta ctagtgaagc tgttgaagct ccattggttg gtacaccagt