SNHG20
Annotated Information
Name
Approved symbol: SNHG20
Approved name: small nucleolar RNA host gene 20
HGNC ID: HGNC:33099
Previous symbols: C17orf86; NCRNA00338; LINC00338
Previous names: chromosome 17 open reading frame 86; non-protein coding RNA 338; long intergenic non-protein coding RNA 338; small nucleolar RNA host gene 20 (non-protein coding)
Alias symbols: PRO0872; FLJ25582; DKFZp686L05235; SCARNA16HG
RefSeq ID: NR_027058
Ensembl ID: ENSG00000234912
LncBook ID: HSALNT0245543
Chromosome
17q25.2
Disease
Hepatocellular Carcinoma
pubmed IDs
16373490
Sequence
>gi|224451087|ref|NR_027058.1| Homo sapiens small nucleolar RNA host gene 20 (SNHG20), long non-coding RNA
000001 AAGTTGCTGA CGGAGCTACT TCCGCCCAGG GGAGTGGGAA GTAAGTGGAG ACACGTGCTT TGGCCTGTTG GAGGGGAAAC 000080
000081 CCGCTCTCGC CTCCTGGTGG TCGCCGACTC GCAGTCCGCA GGATGACTCA GGGCAGCCCT GACCACAGTT CCGCCGCCAT 000160
000161 CGGCCCTAGC CTGCGGAATT GGGCTCGCCC CGGGACGATA ACAGAGCTCT GCCGGGGGCT GGAGGCACTG ACCGGGTGAC 000240
000241 CAGAGACCCA GAGACCAGAC CCCCTCCACG GCGCCCGGGA TTTCGGGGAC GGCTTCTCCC ATCGCAAGTT TCAACAGAAA 000320
000321 ATGAGAAATA TCCCCCGACG ATTGGCTGAA AACATGCAGC AACCACTATT TTCTTCCTGC CCCTCGTTGA TGAGAGCATT 000400
000401 CGAAGTGACC TCAGCAGGGC ATCCAGGTCA GTTTCTGGAA GACTTGTGTG TGTGATGAAT GAATATCTGG TTTTGTCTCT 000480
000481 GCTGGGCCTG TGTGCCTGGA AAGGAATTGT TTTGGCCTAG GATCATCCAG GTTTGTTTGG TTTAGTTCTT GACCACATAT 000560
000561 TTTTTGAATG GTGACTGCTT AAGACCCTGT TGTGTATGGC TATAAATAGA TACACGCCAA GGTGACCACA TACTCTGTGG 000640
000641 TTCCTGTGCC CGCCGGCCAT GTGTCTGAGT GTGTAGCCTC TCATCATCTA AAGGAACTTT GGCTGCAGAG GGGAGGCCTG 000720
000721 GTCCCATGGG AAGTTTTGGG AGCGCAGCAG CAGGTGGGTT CAAACGACCC AGCAAGTGCC TCCTTAGCAC TTAGAGGCGG 000800
000801 AGGCCACCAC GTTGTCCACG TGGGGTTTCT GATCCCAGCT CCCCACCAGC CTGCTGGACC TCGGGCAGGT CCCTCCCTGT 000880
000881 TTGTACCTCC ATTACTTCTT CAGTAAGATG GGGACACTGA AGATGACCAT GCCTCCCACC AGATTGGTGC TTTTTTTGTT 000960
000961 TTAATAAGGG ACAGAGTCTC ACTATGTTGC CTATAGGCTG GTCTCAAGCT CTTGGCCTCA GTTTTCCTGC CTCAGCCTTC 001040
001041 AAATATGCTG GGGTGACAGG CATGAGCCAC TGCACTGGCC CAGATTGGTA CATTTGAGGG TTAAATGAGC AAATCCTATA 001120
001121 AAGCACTCAG GAGAGGGCTG TGGGGTTTGG GCTGGGGCCT GGGAGCACTT TAGACGTGGT AGCTATGTTG TTGTCACCTT 001200
001201 TCTCCTCTTT CAAAGCATTT TTTGTTTGTT TGTTTTTGTT TTTGAGACAG AGTTTCGCTC TTGTGGCCCA GGCTGGAGTG 001280
001281 CGGTGGTGTG ACCTCGGCTC ACTGCATCCT CCGCCTCCCG GGTTCAAGTG ATGGGATTAC AGGTGCCTGC CACAACGCCC 001360
001361 AGCTAATTGT TTGTATTTTT AGTAGAGACG AGGTTTCACC ATGTTGGCCA GGCTGGTCTC GAACTCCTGA CCTCAGGTGA 001440
001441 TCCACCTGCC TTGGCCTCCC AAAGTGCTGG GATTACAGGT ATGAGCTACC GCGCCCGGCA CAAAGCATTT TCAGCTGTGA 001520
001521 AATTCCCTGG GGTGCTGCAG TAGAGGAAGG TTTGGGGTCT TGACGGGTGC TATTTGCCAC GGAAAGATGC CTTCCTGCTC 001600
001601 CCAGGGCAGG AGAGCCGAGG TAAGACTTAC TGTAGGCTGT CGTTTTTTTT GTTTGTTTTT TGTCTTTGCG ATGGAGTCTC 001680
001681 ACTCTGTCGC CAGGCTGGAG TGCAGTGGCA TGATCTTGGC TCACTGCAGA ACCTCCACCT CCCAGGTTCA AGCGATTCTC 001760
001761 CTGCCTCAGC CTCCCAAGTA ACTGGGATTA CAGGCACATG CCCCCACAAC CAGCTAATTT TTTATTTTTA GTAGAGACAG 001840
001841 GGTTTCACAT GTTGGCCAGG CTGGTCTTGA ACTCCTGACC TCAGGTGATC CGCCCGCCTC GGCCTCCCAA AGTGCTGGGA 001920
001921 TTACAGACAT GAGCCACTGC GCCCAGCCAG GCTGTTGTTT TTTTACCTCC TTGTTTGCAC AATTTGGGCC ACTCACAAGA 002000
002001 GTGTATACCC TGTGATAAAC AGTTACCTAC ATTCTCCTCT GCATGCTTGT CTTTAGAGGA AGGAAATGTA TTAATTGCCC 002080
002081 AAAGTAATAT ATTGTGTTAA GATGTGATAT ATACTGGGGA AAAAAAAAGT GTATATTGAC ATTTCTGGAA TAAACCACTT 002160
002161 TGATTCCCAA AAAAAAAAAA AAA
000081 CCGCTCTCGC CTCCTGGTGG TCGCCGACTC GCAGTCCGCA GGATGACTCA GGGCAGCCCT GACCACAGTT CCGCCGCCAT 000160
000161 CGGCCCTAGC CTGCGGAATT GGGCTCGCCC CGGGACGATA ACAGAGCTCT GCCGGGGGCT GGAGGCACTG ACCGGGTGAC 000240
000241 CAGAGACCCA GAGACCAGAC CCCCTCCACG GCGCCCGGGA TTTCGGGGAC GGCTTCTCCC ATCGCAAGTT TCAACAGAAA 000320
000321 ATGAGAAATA TCCCCCGACG ATTGGCTGAA AACATGCAGC AACCACTATT TTCTTCCTGC CCCTCGTTGA TGAGAGCATT 000400
000401 CGAAGTGACC TCAGCAGGGC ATCCAGGTCA GTTTCTGGAA GACTTGTGTG TGTGATGAAT GAATATCTGG TTTTGTCTCT 000480
000481 GCTGGGCCTG TGTGCCTGGA AAGGAATTGT TTTGGCCTAG GATCATCCAG GTTTGTTTGG TTTAGTTCTT GACCACATAT 000560
000561 TTTTTGAATG GTGACTGCTT AAGACCCTGT TGTGTATGGC TATAAATAGA TACACGCCAA GGTGACCACA TACTCTGTGG 000640
000641 TTCCTGTGCC CGCCGGCCAT GTGTCTGAGT GTGTAGCCTC TCATCATCTA AAGGAACTTT GGCTGCAGAG GGGAGGCCTG 000720
000721 GTCCCATGGG AAGTTTTGGG AGCGCAGCAG CAGGTGGGTT CAAACGACCC AGCAAGTGCC TCCTTAGCAC TTAGAGGCGG 000800
000801 AGGCCACCAC GTTGTCCACG TGGGGTTTCT GATCCCAGCT CCCCACCAGC CTGCTGGACC TCGGGCAGGT CCCTCCCTGT 000880
000881 TTGTACCTCC ATTACTTCTT CAGTAAGATG GGGACACTGA AGATGACCAT GCCTCCCACC AGATTGGTGC TTTTTTTGTT 000960
000961 TTAATAAGGG ACAGAGTCTC ACTATGTTGC CTATAGGCTG GTCTCAAGCT CTTGGCCTCA GTTTTCCTGC CTCAGCCTTC 001040
001041 AAATATGCTG GGGTGACAGG CATGAGCCAC TGCACTGGCC CAGATTGGTA CATTTGAGGG TTAAATGAGC AAATCCTATA 001120
001121 AAGCACTCAG GAGAGGGCTG TGGGGTTTGG GCTGGGGCCT GGGAGCACTT TAGACGTGGT AGCTATGTTG TTGTCACCTT 001200
001201 TCTCCTCTTT CAAAGCATTT TTTGTTTGTT TGTTTTTGTT TTTGAGACAG AGTTTCGCTC TTGTGGCCCA GGCTGGAGTG 001280
001281 CGGTGGTGTG ACCTCGGCTC ACTGCATCCT CCGCCTCCCG GGTTCAAGTG ATGGGATTAC AGGTGCCTGC CACAACGCCC 001360
001361 AGCTAATTGT TTGTATTTTT AGTAGAGACG AGGTTTCACC ATGTTGGCCA GGCTGGTCTC GAACTCCTGA CCTCAGGTGA 001440
001441 TCCACCTGCC TTGGCCTCCC AAAGTGCTGG GATTACAGGT ATGAGCTACC GCGCCCGGCA CAAAGCATTT TCAGCTGTGA 001520
001521 AATTCCCTGG GGTGCTGCAG TAGAGGAAGG TTTGGGGTCT TGACGGGTGC TATTTGCCAC GGAAAGATGC CTTCCTGCTC 001600
001601 CCAGGGCAGG AGAGCCGAGG TAAGACTTAC TGTAGGCTGT CGTTTTTTTT GTTTGTTTTT TGTCTTTGCG ATGGAGTCTC 001680
001681 ACTCTGTCGC CAGGCTGGAG TGCAGTGGCA TGATCTTGGC TCACTGCAGA ACCTCCACCT CCCAGGTTCA AGCGATTCTC 001760
001761 CTGCCTCAGC CTCCCAAGTA ACTGGGATTA CAGGCACATG CCCCCACAAC CAGCTAATTT TTTATTTTTA GTAGAGACAG 001840
001841 GGTTTCACAT GTTGGCCAGG CTGGTCTTGA ACTCCTGACC TCAGGTGATC CGCCCGCCTC GGCCTCCCAA AGTGCTGGGA 001920
001921 TTACAGACAT GAGCCACTGC GCCCAGCCAG GCTGTTGTTT TTTTACCTCC TTGTTTGCAC AATTTGGGCC ACTCACAAGA 002000
002001 GTGTATACCC TGTGATAAAC AGTTACCTAC ATTCTCCTCT GCATGCTTGT CTTTAGAGGA AGGAAATGTA TTAATTGCCC 002080
002081 AAAGTAATAT ATTGTGTTAA GATGTGATAT ATACTGGGGA AAAAAAAAGT GTATATTGAC ATTTCTGGAA TAAACCACTT 002160
002161 TGATTCCCAA AAAAAAAAAA AAA
Predicted Small Protein
Name | SNHG20_smProtein_1586:1780 |
Length | 64 |
Molecular weight | 7131.3835 |
Aromaticity | 0.078125 |
Instability index | 66.890625 |
Isoelectric point | 9.13336181641 |
Runs | 12 |
Runs residual | 0.03890625 |
Runs probability | 0.0389654360243 |
Amino acid sequence | MPSCSQGRRAEVRLTVGCRFFCLFFVFAMESHSVARLECSGMILAHCRTSTSQVQAILLP QPPK |
Secondary structure | LLLLLLLLEEEEEEEELLLEEEEEEEEEELLLLEEEEEELLLEEEELLLLLHHHHEEELL LLLL |
PRMN | LLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLL LLLL |
PiMo | iiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTooooooooooooooooooooooooo oooo |