LOCUS C_AA014190 29776 bp ss-RNA linear VRL 11-MAY-2023 DEFINITION Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/CHN/SH-HG20230504-502/2023 ORF1ab polyprotein (ORF1ab), ORF1a polyprotein (ORF1ab), surface glycoprotein (S), ORF3a protein (ORF3a), envelope protein (E), membrane glycoprotein (M), ORF6 protein (ORF6), and ORF7a protein (ORF7a) genes, complete cds; ORF7b gene, complete sequence; and ORF8 protein (ORF8), nucleocapsid phosphoprotein (N), and ORF10 protein (ORF10) genes, complete cds. ACCESSION C_AA014190 VERSION C_AA014190.1 KEYWORDS . SOURCE Severe acute respiratory syndrome coronavirus 2 ORGANISM Severe acute respiratory syndrome coronavirus 2 Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Betacoronavirus; Sarbecovirus; Severe acute respiratory syndrome-related coronavirus. REFERENCE 1 (bases 1 to 29776) AUTHORS Zhang,W. TITLE Direct Submission JOURNAL Submitted (11-MAY-2023) Microbe lab, Shanghai Municipal Center for Disease Control & Prevention, west zhongshan road 1380, Shanghai 200336, China COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Consensus sequence method v. 940-000133-00 Sequencing Technology :: MGISEQ-200 ##Genome-Assembly-Data-END## . FEATURES Location/Qualifiers source 1..29776 /organism="Severe acute respiratory syndrome coronavirus 2" /mol_type="genomic RNA" /isolate="SARS-CoV-2/human/CHN/SH-HG20230504-502/2023" /isolation_source="Pharyngeal swab" /host="Homo sapiens; Host_age: 34; Host_sex: Male" /country="China:Shanghai" /collection_date="2023-05-02" /note="Passage_details/history: Original; Additional_host_information: Patient infected while traveling in Singapore" gene 242..21522 /gene="ORF1ab" CDS join(242..13435,13435..21522) /gene="ORF1ab" /ribosomal_slippage /codon_start=1 /product="ORF1ab polyprotein" /protein_id="C_AAB29151.1" /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQH LKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETL GVLVPHVGEIPVAYRKVLLRKNGNKGAGGHRYGADLKSFDLGDELGTDPYEDFQENWNT KHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLVRAGKASCTLSEQLDFI DTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSII KTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKAT CEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRK GGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEK VNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAK KGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQY SLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKF KEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSII IGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLT EEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNT FTLKGGAPTKVTFGDDTVIEVQGYKSVNIIFELDERIDKVLNEKCSAYTVELGTEVNEF ACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDED EEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVG QQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEE AKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSG HNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCV DTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPLVEQR KQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAP YIVGDVVQEGVLTAVVIPTKKASGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAK TVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTI QRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEE AARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQS TQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINL HTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTT DSSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQD AYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCG QQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYE LKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYT TTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCD NIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVW HVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEV VENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKP NELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYM PYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLIN IIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIP CSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAI MQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCN SSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTF ISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVN LDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGD SAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGF VDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHIN AQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIA LKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTR DIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTT NGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDT NVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAG VCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLFTNMFTPLIQPIGALDISASIVAGGIVA IVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFIVLCLTPVYSFLPGVYSVIYLYLT FYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSF STFEEAALCTFLLNKEMYLKLRSDVLLPFTQYNRYLALYNKYKYFSGAMDTTSYREAAC CHLAKALNDFSNSGSDVLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTL NGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCV LKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSC GSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNV LAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLD MCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLL TILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSL ATVAYFNMVYMPASWVMRIMTWLDMVDTSLKLKDCVMYASAVVLLILMTARTVYDDGAR RVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEY CPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMN SQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVES SSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ AIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKL EKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLS IIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNS PNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGG RFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNL NRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVK MLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTT CANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNRVCGVSAARLTPC GTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQ HEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCD TLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNA GIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVD TDLTKPYIKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLF STVFPLTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAA DPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSV ELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVI VNNLDKSAGFPFNKWGKARLYYDSMSYEDQDVLFAYTKRNVIPTITQMNLKYAISAKNR ARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENP HLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGS SLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYEC LYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQ NNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIV KTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVML TNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHV ISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLY KNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIA TVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVY RGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQ KVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDK CSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVN ARLCAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAE IVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAW RKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAI TRAKVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKVITGLHPTQA PTHLSVDTKFKTEGLCVDVPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRH VRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPP PGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVK IGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDL YCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHMVVK AALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKF TDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAF VNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYL DAYNMMISAGFSLWVYKQFDTYNLWNTFTRLQSLENVAFNVVNKGHFDGQQGEVPVSII NNTVYTKVDGVDVELFENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANTVIW DYKRDAPAHISTIGVCSMTDIAKKPIETICAPLTVFFDGRVDGQVDLFRNARNGVLITE GSVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQEFK PRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPF ELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTI DYTEISFMLWCKDGHVETFYPKLQSSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSAT LPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTL LVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYI CGFIQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYL GKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQINDMI LSLLSKGRLIIRENNRVVISSDVLVNN" mat_peptide 242..781 /gene="ORF1ab" /product="leader protein" mat_peptide 782..2695 /gene="ORF1ab" /product="nsp2" mat_peptide 2696..8530 /gene="ORF1ab" /product="nsp3" mat_peptide 8531..10030 /gene="ORF1ab" /product="nsp4" mat_peptide 10031..10948 /gene="ORF1ab" /product="3C-like proteinase" mat_peptide 10949..11809 /gene="ORF1ab" /product="nsp6" mat_peptide 11810..12058 /gene="ORF1ab" /product="nsp7" mat_peptide 12059..12652 /gene="ORF1ab" /product="nsp8" mat_peptide 12653..12991 /gene="ORF1ab" /product="nsp9" mat_peptide 12992..13408 /gene="ORF1ab" /product="nsp10" mat_peptide join(13409..13435,13435..16203) /gene="ORF1ab" /product="RNA-dependent RNA polymerase" mat_peptide 16204..18006 /gene="ORF1ab" /product="helicase" mat_peptide 18007..19587 /gene="ORF1ab" /product="3'-to-5' exonuclease" mat_peptide 19588..20625 /gene="ORF1ab" /product="endoRNAse" mat_peptide 20626..21519 /gene="ORF1ab" /product="2'-O-ribose methyltransferase" CDS 242..13450 /gene="ORF1ab" /codon_start=1 /product="ORF1a polyprotein" /protein_id="C_AAB29154.1" /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQH LKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETL GVLVPHVGEIPVAYRKVLLRKNGNKGAGGHRYGADLKSFDLGDELGTDPYEDFQENWNT KHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLVRAGKASCTLSEQLDFI DTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSII KTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKAT CEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRK GGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQKEK VNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAK KGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQY SLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKF KEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLALCADSII IGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLT EEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNT FTLKGGAPTKVTFGDDTVIEVQGYKSVNIIFELDERIDKVLNEKCSAYTVELGTEVNEF ACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDED EEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVG QQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEE AKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSG HNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCV DTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPLVEQR KQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAP YIVGDVVQEGVLTAVVIPTKKASGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAK TVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTI QRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEE AARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQS TQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINL HTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTT DSSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQD AYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCG QQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYE LKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYT TTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCD NIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVW HVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEV VENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKP NELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYM PYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLIN IIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIP CSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAI MQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCN SSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTF ISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVN LDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGD SAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGF VDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHIN AQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIA LKGGKIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTR DIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTT NGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDT NVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAG VCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLFTNMFTPLIQPIGALDISASIVAGGIVA IVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFIVLCLTPVYSFLPGVYSVIYLYLT FYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSF STFEEAALCTFLLNKEMYLKLRSDVLLPFTQYNRYLALYNKYKYFSGAMDTTSYREAAC CHLAKALNDFSNSGSDVLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTL NGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCV LKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSC GSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNV LAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLD MCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWLLL TILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSL ATVAYFNMVYMPASWVMRIMTWLDMVDTSLKLKDCVMYASAVVLLILMTARTVYDDGAR RVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEY CPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMN SQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVES SSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ AIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKL EKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLS IIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNS PNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGG RFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNL NRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVK MLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTT CANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQSFLNGFAV" mat_peptide 242..781 /gene="ORF1ab" /product="leader protein" mat_peptide 782..2695 /gene="ORF1ab" /product="nsp2" mat_peptide 2696..8530 /gene="ORF1ab" /product="nsp3" mat_peptide 8531..10030 /gene="ORF1ab" /product="nsp4" mat_peptide 10031..10948 /gene="ORF1ab" /product="3C-like proteinase" mat_peptide 10949..11809 /gene="ORF1ab" /product="nsp6" mat_peptide 11810..12058 /gene="ORF1ab" /product="nsp7" mat_peptide 12059..12652 /gene="ORF1ab" /product="nsp8" mat_peptide 12653..12991 /gene="ORF1ab" /product="nsp9" mat_peptide 12992..13408 /gene="ORF1ab" /product="nsp10" mat_peptide 13409..13447 /gene="ORF1ab" /product="nsp11" stem_loop 13443..13470 /gene="ORF1ab" /note="Coronavirus frameshifting stimulation element stem-loop 1" stem_loop 13455..13509 /gene="ORF1ab" /note="Coronavirus frameshifting stimulation element stem-loop 2" gene 21530..25342 /gene="S" CDS 21530..25342 /gene="S" /codon_start=1 /product="surface glycoprotein" /protein_id="C_AAB29155.1" /translation="MFVFLVLLPLVSSQCVNLITRTQSYTNSFTRGVYYPDKVFRSSVL HSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFG TTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLDVYYHENNKSRMESELRVYSSANNC TFEYVSQPFLMDLEGKQGNFENLREFVFKNIDGYFKIYSKHTPVNLGRDLPQGFSALEP LVDLPIGINITRFQTLLALHRSYLTPGDSSSSWTAGAAAYYVGYLQPRTFLLKYNENGT ITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFHEVFN ATTFASVYAWNRTRISNCVADYSVLYNFAPFFAFKCYGVSPTKLNDLCFTNVYADSFVI RGNEVSQIAPGQTGNIADYNYKLPDDFTGCVIAWNSNKLDSKVSGNYNYLYRLFRKSKL KPFERDISTEIYQAGNKPCNGVAGFNCYSPLQSYGFRPTYGVGHQPYRVVVLSFELLHA PATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDP QTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYST GSNVFQTRAGCLIGAEYVNNSYECDIPIGAGICASYQTQTKSHRRARSVASQSIIAYTM SLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQY GSFCTQLKRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKYFGGFNFSQILPDPSKPSKR SFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYT SALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGK IQDSLSSTASALGKLQDVVNHNAQALNTLVKQLSSKFGAISSVLNDILSRLDKVEAEVQ IDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMS FPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFY EPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDI SGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIV MVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT" gene 25351..26178 /gene="ORF3a" CDS 25351..26178 /gene="ORF3a" /codon_start=1 /product="ORF3a protein" /protein_id="C_AAB29156.1" /translation="MDLFMRIFTIGTVTLKQGEIKDATPSDFVRATATIPIQASLPFGW LIVGVALLAVFQSASKIITLKKRWQLALSKGVHFVCNLLLLFVTVYSHLLLVAAGLEAP FLYLYALVYFLQSINFVRIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPYNSV TSSIVITSGDGTTSPISEHDYQIGGYTEKWESGVKDCVVLHSYFTSDYYQLYSTQLSTD IGVEHVTFFIYNKIVDEPEEHVQIHTIDGSSGVVNPVMEPIYDEPTTTTSVPL" gene 26203..26430 /gene="E" CDS 26203..26430 /gene="E" /codon_start=1 /product="envelope protein" /protein_id="C_AAB29157.1" /translation="MYSFVSEEIGALIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCN IVNVSLVKPSFYVYSRVKNLNSSRVPGLLV" gene 26481..27149 /gene="M" CDS 26481..27149 /gene="M" /codon_start=1 /product="membrane glycoprotein" /protein_id="C_AAB29158.1" /translation="MADSNGTITVEELKKLLEEWNLVIGFLFLTWICLLQFAYANRNRF LYIIKLIFLWLLWPVTLTCFVLAAVYRINWITGGIAIAMACLVGLMWLSYFIASFRLFA RTRSMWSFNPETNILLNVPLHGTILTRPLLESELVIGAVILRGHLRIAGHHLGRCDIKD LPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDNIALLVQ " gene 27160..27345 /gene="ORF6" CDS 27160..27345 /gene="ORF6" /codon_start=1 /product="ORF6 protein" /protein_id="C_AAB29159.1" /translation="MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLT ENKYSQLDEEQPMEIL" gene 27352..27717 /gene="ORF7a" CDS 27352..27717 /gene="ORF7a" /codon_start=1 /product="ORF7a protein" /protein_id="C_AAB29160.1" /translation="MKIILFLALITLATCELYHYQECVRGTTVLLKEPCSSGTYEGNSP FHPLADNKFALTCFSTQFAFACPDGVKHVYQLRARSVSPKLFIRQEEVQELYSPIFLIV AAIVFITLCFTLKRKTE" gene 27714..27846 /gene="ORF7b" misc_feature 27714..27846 /gene="ORF7b" /note="similar to ORF7b" gene 27853..28218 /gene="ORF8" CDS 27853..28218 /gene="ORF8" /codon_start=1 /product="ORF8 protein" /protein_id="C_AAB29161.1" /translation="MKFLVFLGIITTVAAFHQECSLQSCTQHQPYVVDDPCPIHFYSKW YIRVGARKSAPLIELCVDEAGSKSPIQYIDIGNYTVSCLPFTINCQEPKLGSLVVRCSF YEDFLEYHDVRVVLDFI" gene 28233..29483 /gene="N" CDS 28233..29483 /gene="N" /codon_start=1 /product="nucleocapsid phosphoprotein" /protein_id="C_AAB29152.1" /translation="MSDNGPQNQRNALRITFGGPSDSTGSNQNGEARSKQRRPQGLPNN TASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMKDLSPR WYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQGTT LPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSKRTSPARMAGNGGDAALALLLLD RLNQLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQTQG NFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLDDKD PNFKDQVILLNKHIDAYKTFPPIEPKKDKKKKADETQALPQRQKKQQTVTLLPAADLDD FSKQLQQSMSRADSTQA" gene 29508..29624 /gene="ORF10" CDS 29508..29624 /gene="ORF10" /codon_start=1 /product="ORF10 protein" /protein_id="C_AAB29153.1" /translation="MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFNLT" stem_loop 29559..29594 /gene="ORF10" /note="Coronavirus 3' UTR pseudoknot stem-loop 1" stem_loop 29579..29607 /gene="ORF10" /note="Coronavirus 3' UTR pseudoknot stem-loop 2" stem_loop 29678..29692 /note="Coronavirus 3' stem-loop II-like motif (s2m)" ORIGIN 1 taacaaacca accaactttt gatctcttgt agatctgttc tctaaacgaa ctttaaaatc 61 tgtgtggctg tcactcggct gcatgcttag tgcactcacg cagtataatt aataactaat 121 tactgtcgtt gacaggacac gagtaactcg tctatcttct gcaggctgct tacggttttg 181 tccgtgttgc agccgatcat cagcacatct aggttttgtc cgggtgtgac cgaaaggtaa 241 gatggagagc cttgtccctg gtttcaacga gaaaacacac gtccaactca gtttgcctgt 301 tttacaggtt cgtgacgtgc tcgtacgtgg ctttggagac tccgtggagg aggtcttatc 361 agaggcacgt caacatctta aagatggcac ttgtggctta gtagaagttg aaaaaggcgt 421 tttgcctcaa cttgaacagc cctatgtgtt catcaaacgt tcggatgctc gaactgcacc 481 tcatggtcat gttatggttg agctggtagc agaactcgaa ggcattcagt acggtcgtag 541 tggtgagaca cttggtgtcc ttgtccctca tgtgggcgaa ataccagtgg cttaccgcaa 601 ggttcttctt cgtaagaacg gtaataaagg agctggtggc cataggtacg gcgccgatct 661 aaagtcattt gacttaggcg acgagcttgg cactgatcct tatgaagatt ttcaagaaaa 721 ctggaacact aaacatagca gtggtgttac ccgtgaactc atgcgtgagc ttaacggagg 781 ggcatacact cgctatgtcg ataacaactt ctgtggccct gatggctacc ctcttgagtg 841 cattaaagac cttctagtac gtgctggtaa agcttcatgc actttgtccg aacaactgga 901 ctttattgac actaagaggg gtgtatactg ctgccgtgaa catgagcatg aaattgcttg 961 gtacacggaa cgttctgaaa agagctatga attgcagaca ccttttgaaa ttaaattggc 1021 aaagaaattt gacaccttca atggggaatg tccaaatttt gtatttccct taaattccat 1081 aatcaagact attcaaccaa gggttgaaaa gaaaaagctt gatggcttta tgggtagaat 1141 tcgatctgtc tatccagttg cgtcaccaaa tgaatgcaac caaatgtgcc tttcaactct 1201 catgaagtgt gatcattgtg gtgaaacttc atggcagacg ggcgattttg ttaaagccac 1261 ttgcgaattt tgtggcactg agaatttgac taaagaaggt gccactactt gtggttactt 1321 accccaaaat gctgttgtta aaatttattg tccagcatgt cacaattcag aagtaggacc 1381 tgagcatagt cttgccgaat accataatga atctggcttg aaaaccattc ttcgtaaggg 1441 tggtcgtact attgcctttg gaggctgtgt gttctcttat gttggttgtc ataacaagtg 1501 tgcctattgg gttccacgtg ctagcgctaa cataggttgt aaccatacag gtgttgttgg 1561 agaaggttcc gaaggtctta atgacaacct tcttgaaata ctccaaaaag agaaagtcaa 1621 catcaatatt gttggtgact ttaaacttaa tgaagagatc gccattattt tggcatcttt 1681 ttctgcttcc acaagtgctt ttgtggaaac tgtgaaaggt ttggattata aagcattcaa 1741 acaaattgtt gaatcctgtg gtaattttaa agttacaaaa ggaaaagcta aaaaaggtgc 1801 ctggaatatt ggtgaacaga aatcaatact gagtcctctt tatgcatttg catcagaggc 1861 tgctcgtgtt gtacgatcaa ttttctcccg cactcttgaa actgctcaaa attctgtgcg 1921 tgttttacag aaggccgcta taacaatact agatggaatt tcacagtatt cactgagact 1981 cattgatgct atgatgttca catctgattt ggctactaac aatctagttg taatggccta 2041 cattacaggt ggtgttgttc agttgacttc gcagtggcta actaacatct ttggcactgt 2101 ttatgaaaaa ctcaaacccg tccttgattg gcttgaagag aagtttaagg aaggtgtaga 2161 gtttcttaga gacggttggg aaattgttaa atttatctca acctgtgctt gtgaaattgt 2221 cggtggacaa attgtcacct gtgcaaagga aattaaggag agtgttcaga cattctttaa 2281 gcttgtaaat aaatttttgg ctttgtgtgc tgactctatc attattggtg gagctaaact 2341 taaagccttg aatttaggtg aaacatttgt cacgcactca aagggattgt acagaaagtg 2401 tgttaaatcc agagaagaaa ctggcctact catgcctcta aaagccccaa aagaaattat 2461 cttcttagag ggagaaacac ttcccacaga agtgttaaca gaggaagttg tcttgaaaac 2521 tggtgattta caaccattag aacaacctac tagtgaagct gttgaagctc cattggttgg 2581 tacaccagtt tgtattaacg ggcttatgtt gctcgaaatc aaagacacag aaaagtactg