LINC00470
Contents
Annotated Information
Approved Symbol
LINC00470
Approved Name
long intergenic non-protein coding RNA 470
Previous Symbols
C18orf2
Synonyms
_
Chromosome
18p11.32
RefSeq ID
NR_023925
OMIM ID
_
Ensembl ID
ENSG00000132204
pubmed IDs
11173868
Sequence
>gi|193083199|ref|NR_023925.1| Homo sapiens long intergenic non-protein coding RNA 470 (LINC00470), transcript variant 1, long non-coding RNA
000001 AATGCCTTAA AGAAAGGACT GATCTCCCCT GAAGAAGAGA GAATTCTGCC TCTAGATTGC TTACAATTTG AACTGGAACA 000080
000081 TCAGCTGTTC TCTAGGTCTT AAGTCTGCTG CTCTAACCTA CAAATTTTGG ACATGCCAAT CTCCACAGTC ACATGGGCCA 000160
000161 ATTCCTTAGT TTCTCTGGAG AATCCTGACT AATGTATGCA ATGTCTTTAT TTCATCCTCA TTCTTAAAGG ATATTTTCAT 000240
000241 CAGCAACACC ACACCCTCCT GATCACCATA GCTTTATGAT CTGAGCTTTC ATCTGGTATC ATTTCCTTTC CGTCTGAAGA 000320
000321 ACTTGCTTTC CTTGCAACAA CAAGAAACAA ATTTATTAGC TAACCTAACC ACTAATGACG CAAGAGACAA TTCTAAGGAC 000400
000401 TTTCAAAACA GCAAAGTAGG AGCAGCTGCT ACCTCTAGGG ATGAGGGAGA AACTCAAAAA TGCATACAAA AACCATTGCA 000480
000481 TGAAAAGTGA CTGGATTTGT ACATAGCGTC AGGAGATGTG CGTAGTGTCA AAGTATCTCA TCACACATTA CTTGATAATT 000560
000561 ACAAATGAGA AAAATGAACC TTCACAGTGG CAAGACTTGA CTTCTACCTC TTTCAAAAAG ATGCAATTGT CCAATTATTG 000640
000641 GTGAAATTGT CATTTCATGC TATTGGCTAT TTGAAATTCC TCCTCTAATT TCAGAATAAA TCACTGAAAT TGACATCGGC 000720
000721 CAGTCTGAAT TTCAAGAAAT TACCTGCTGA AGACAAGAGG GATCTCTTCT TCAGATTTGC AGTCTGGGGA AGACACAGCC 000800
000801 TCTACTGTAC TTTAGAACCT GAGATATGGT GGTGGAGGGA GCCCTGGGTC GAGTGGTAAG ATTCACCCTT AGGTTAGTAT 000880
000881 TGACGTAAGG TGACGAGGAG CTGTAGACAA AAGATTGTAA CCATAAGAAC TTCATAGTTT TTGTATTTTC ACCGAGCTTA 000960
000961 TATTTGGTGT GTTTTTTGTC TTTTCTTTAT GATTATCAAT AAAATGCTTG AAAGGAGATG AGGTTGGGGA ATAATTTTTG 001040
001041 GGAATACCAC AAAAGACACT TTTGTGATGG AAATCCTTAA AAAGACACAA TCCATTACCT CATTGGGTTC AAAAGGCAAT 001120
001121 TGTGAACTAC TGTGGAGTTT GGAAAGAAGC AATGAGGTAA TCAAGGATAC TGTTGACAAT CTAGCTTATC CTATGGATGG 001200
001201 AAGGAAATTG AAACTAATGG AGGCGAGGCT GGTAAAACAA TAGGGTTTGA GACAATTCTG TGGCATTAGA AATGAAAGAG 001280
001281 GAAGGTGAAT CCGGGAGACG GAGCTTGCAG TGAGCTGAGA TCGCGCCACT GCACTCCAGC CTGGGCGACA GAGCGAGACT 001360
001361 CCATCTCAAA AAAAAAAAAA AAGAAAGAAA GAAAAGAAAG AGGAAGTAAT AATCTGTGAA ATTTTTCCTT AGGAACTTAT 001440
001441 TGGCAATTTA AAAATGAATT TGTTAAGCCA TGCTGGTTCT GACCCAAAAG CCATTCCCCA GCCTTCCTCA CTCCCCTCTT 001520
001521 TCACTACTGG CAGAGATTGT CTCTCATTTT ACAAGCTGAA AATGCCAGAT GCTTGCTTTT ACAGTCTTCC TTACACCCAG 001600
001601 AGCATGTGCA TATGTTTAAA CGGTCAAGAA GAAGTCATAA CATGGGTGTC TGGGAGAGCT TTTATCCCAC AAAAACAACA 001680
001681 CTTCACTCAA AAAACAAACC AAACAAAGAA AAATTCTCCT TCCTGCCATT GGATGTGAGG CTCAGACCTC TAGTAACCAT 001760
001761 TTTGTGACCA CAAAGCAACA AGCCTGAGGA AAAGTCCTAC ACGCTGAGCA ACAGGCAGAA ATATTGCCAT CGCTGAGTTG 001840
001841 CAGAAACAAA TCTAGAGATG TTTTGCTTCT GTAATTATTT TTTATGGGAG ATTACAAGTG GGTTTACTGT TCACTTTTCA 001920
001921 AATCTTATTT CTCTATGATG TTTAGCTTGG GTAAATTTTA CCTTAAATCC ACTTTTTTAT GTAAGGTAAC ATATTTGTCG 002000
002001 GTTTCAAGGA TTAAGATGTG GGCATACTTG GAGGCCATTA TTTTGCCCAC CACAGGTGAA AAAGGAAGTG TTATTCTTAA 002080
002081 ATCATTTGGA AGGATCTCTG TGTAAATGCA AGAGCGAGAC AAGAAAATGC TGTCATTCTT TTGATATGGA CTCGAATTTC 002160
002161 CACTTCATGG TTGTCTGCTT CCTTTTTAGA GTATTATTTA TCCTCCTAAT AAAAAGAAAG TGAAATTTCC C
000081 TCAGCTGTTC TCTAGGTCTT AAGTCTGCTG CTCTAACCTA CAAATTTTGG ACATGCCAAT CTCCACAGTC ACATGGGCCA 000160
000161 ATTCCTTAGT TTCTCTGGAG AATCCTGACT AATGTATGCA ATGTCTTTAT TTCATCCTCA TTCTTAAAGG ATATTTTCAT 000240
000241 CAGCAACACC ACACCCTCCT GATCACCATA GCTTTATGAT CTGAGCTTTC ATCTGGTATC ATTTCCTTTC CGTCTGAAGA 000320
000321 ACTTGCTTTC CTTGCAACAA CAAGAAACAA ATTTATTAGC TAACCTAACC ACTAATGACG CAAGAGACAA TTCTAAGGAC 000400
000401 TTTCAAAACA GCAAAGTAGG AGCAGCTGCT ACCTCTAGGG ATGAGGGAGA AACTCAAAAA TGCATACAAA AACCATTGCA 000480
000481 TGAAAAGTGA CTGGATTTGT ACATAGCGTC AGGAGATGTG CGTAGTGTCA AAGTATCTCA TCACACATTA CTTGATAATT 000560
000561 ACAAATGAGA AAAATGAACC TTCACAGTGG CAAGACTTGA CTTCTACCTC TTTCAAAAAG ATGCAATTGT CCAATTATTG 000640
000641 GTGAAATTGT CATTTCATGC TATTGGCTAT TTGAAATTCC TCCTCTAATT TCAGAATAAA TCACTGAAAT TGACATCGGC 000720
000721 CAGTCTGAAT TTCAAGAAAT TACCTGCTGA AGACAAGAGG GATCTCTTCT TCAGATTTGC AGTCTGGGGA AGACACAGCC 000800
000801 TCTACTGTAC TTTAGAACCT GAGATATGGT GGTGGAGGGA GCCCTGGGTC GAGTGGTAAG ATTCACCCTT AGGTTAGTAT 000880
000881 TGACGTAAGG TGACGAGGAG CTGTAGACAA AAGATTGTAA CCATAAGAAC TTCATAGTTT TTGTATTTTC ACCGAGCTTA 000960
000961 TATTTGGTGT GTTTTTTGTC TTTTCTTTAT GATTATCAAT AAAATGCTTG AAAGGAGATG AGGTTGGGGA ATAATTTTTG 001040
001041 GGAATACCAC AAAAGACACT TTTGTGATGG AAATCCTTAA AAAGACACAA TCCATTACCT CATTGGGTTC AAAAGGCAAT 001120
001121 TGTGAACTAC TGTGGAGTTT GGAAAGAAGC AATGAGGTAA TCAAGGATAC TGTTGACAAT CTAGCTTATC CTATGGATGG 001200
001201 AAGGAAATTG AAACTAATGG AGGCGAGGCT GGTAAAACAA TAGGGTTTGA GACAATTCTG TGGCATTAGA AATGAAAGAG 001280
001281 GAAGGTGAAT CCGGGAGACG GAGCTTGCAG TGAGCTGAGA TCGCGCCACT GCACTCCAGC CTGGGCGACA GAGCGAGACT 001360
001361 CCATCTCAAA AAAAAAAAAA AAGAAAGAAA GAAAAGAAAG AGGAAGTAAT AATCTGTGAA ATTTTTCCTT AGGAACTTAT 001440
001441 TGGCAATTTA AAAATGAATT TGTTAAGCCA TGCTGGTTCT GACCCAAAAG CCATTCCCCA GCCTTCCTCA CTCCCCTCTT 001520
001521 TCACTACTGG CAGAGATTGT CTCTCATTTT ACAAGCTGAA AATGCCAGAT GCTTGCTTTT ACAGTCTTCC TTACACCCAG 001600
001601 AGCATGTGCA TATGTTTAAA CGGTCAAGAA GAAGTCATAA CATGGGTGTC TGGGAGAGCT TTTATCCCAC AAAAACAACA 001680
001681 CTTCACTCAA AAAACAAACC AAACAAAGAA AAATTCTCCT TCCTGCCATT GGATGTGAGG CTCAGACCTC TAGTAACCAT 001760
001761 TTTGTGACCA CAAAGCAACA AGCCTGAGGA AAAGTCCTAC ACGCTGAGCA ACAGGCAGAA ATATTGCCAT CGCTGAGTTG 001840
001841 CAGAAACAAA TCTAGAGATG TTTTGCTTCT GTAATTATTT TTTATGGGAG ATTACAAGTG GGTTTACTGT TCACTTTTCA 001920
001921 AATCTTATTT CTCTATGATG TTTAGCTTGG GTAAATTTTA CCTTAAATCC ACTTTTTTAT GTAAGGTAAC ATATTTGTCG 002000
002001 GTTTCAAGGA TTAAGATGTG GGCATACTTG GAGGCCATTA TTTTGCCCAC CACAGGTGAA AAAGGAAGTG TTATTCTTAA 002080
002081 ATCATTTGGA AGGATCTCTG TGTAAATGCA AGAGCGAGAC AAGAAAATGC TGTCATTCTT TTGATATGGA CTCGAATTTC 002160
002161 CACTTCATGG TTGTCTGCTT CCTTTTTAGA GTATTATTTA TCCTCCTAAT AAAAAGAAAG TGAAATTTCC C
Predicted Small Protein
Name | LINC00470_smProtein_1883:2104 |
Length | 73 |
Molecular weight | 8461.072 |
Aromaticity | 0.191780821918 |
Instability index | 13.9932876712 |
Isoelectric point | 9.65484619141 |
Runs | 13 |
Runs residual | 0.0339372514361 |
Runs probability | 0.0446713682008 |
Amino acid sequence | MGDYKWVYCSLFKSYFSMMFSLGKFYLKSTFLCKVTYLSVSRIKMWAYLEAIILPTTGEK GSVILKSFGRISV |
Secondary structure | LLLEEEEEHHHHHHHHHHHHHHLHHHEEEEEEEEEEEEEHHHHHHHHHHHHHEELLLLLL LEEEEELLLEEEL |
PRMN | LLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLL LLLLLLLLLLLLL |
PiMo | iiiiiiiiiiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTooooooooooooo ooooooooooooo |