LINC00470

From LncRNAWiki
Revision as of 12:10, 11 August 2019 by Lin Liu (talk | contribs)
Jump to: navigation, search

Annotated Information

Name

Approved symbol: LINC00470

Approved name: long intergenic non-protein coding RNA 470

HGNC ID: HGNC:1225

Previous names: chromosome 18 open reading frame 2

Previous symbols: C18orf2

RefSeq ID: NR_023925

LncBook ID: HSALNT0246856

Chromosome

18p11.32

Ensembl ID

ENSG00000132204

Sequence

>gi|193083199|ref|NR_023925.1| Homo sapiens long intergenic non-protein coding RNA 470 (LINC00470), transcript variant 1, long non-coding RNA

000001 AATGCCTTAA AGAAAGGACT GATCTCCCCT GAAGAAGAGA GAATTCTGCC TCTAGATTGC TTACAATTTG AACTGGAACA 000080
000081 TCAGCTGTTC TCTAGGTCTT AAGTCTGCTG CTCTAACCTA CAAATTTTGG ACATGCCAAT CTCCACAGTC ACATGGGCCA 000160
000161 ATTCCTTAGT TTCTCTGGAG AATCCTGACT AATGTATGCA ATGTCTTTAT TTCATCCTCA TTCTTAAAGG ATATTTTCAT 000240
000241 CAGCAACACC ACACCCTCCT GATCACCATA GCTTTATGAT CTGAGCTTTC ATCTGGTATC ATTTCCTTTC CGTCTGAAGA 000320
000321 ACTTGCTTTC CTTGCAACAA CAAGAAACAA ATTTATTAGC TAACCTAACC ACTAATGACG CAAGAGACAA TTCTAAGGAC 000400
000401 TTTCAAAACA GCAAAGTAGG AGCAGCTGCT ACCTCTAGGG ATGAGGGAGA AACTCAAAAA TGCATACAAA AACCATTGCA 000480
000481 TGAAAAGTGA CTGGATTTGT ACATAGCGTC AGGAGATGTG CGTAGTGTCA AAGTATCTCA TCACACATTA CTTGATAATT 000560
000561 ACAAATGAGA AAAATGAACC TTCACAGTGG CAAGACTTGA CTTCTACCTC TTTCAAAAAG ATGCAATTGT CCAATTATTG 000640
000641 GTGAAATTGT CATTTCATGC TATTGGCTAT TTGAAATTCC TCCTCTAATT TCAGAATAAA TCACTGAAAT TGACATCGGC 000720
000721 CAGTCTGAAT TTCAAGAAAT TACCTGCTGA AGACAAGAGG GATCTCTTCT TCAGATTTGC AGTCTGGGGA AGACACAGCC 000800
000801 TCTACTGTAC TTTAGAACCT GAGATATGGT GGTGGAGGGA GCCCTGGGTC GAGTGGTAAG ATTCACCCTT AGGTTAGTAT 000880
000881 TGACGTAAGG TGACGAGGAG CTGTAGACAA AAGATTGTAA CCATAAGAAC TTCATAGTTT TTGTATTTTC ACCGAGCTTA 000960
000961 TATTTGGTGT GTTTTTTGTC TTTTCTTTAT GATTATCAAT AAAATGCTTG AAAGGAGATG AGGTTGGGGA ATAATTTTTG 001040
001041 GGAATACCAC AAAAGACACT TTTGTGATGG AAATCCTTAA AAAGACACAA TCCATTACCT CATTGGGTTC AAAAGGCAAT 001120
001121 TGTGAACTAC TGTGGAGTTT GGAAAGAAGC AATGAGGTAA TCAAGGATAC TGTTGACAAT CTAGCTTATC CTATGGATGG 001200
001201 AAGGAAATTG AAACTAATGG AGGCGAGGCT GGTAAAACAA TAGGGTTTGA GACAATTCTG TGGCATTAGA AATGAAAGAG 001280
001281 GAAGGTGAAT CCGGGAGACG GAGCTTGCAG TGAGCTGAGA TCGCGCCACT GCACTCCAGC CTGGGCGACA GAGCGAGACT 001360
001361 CCATCTCAAA AAAAAAAAAA AAGAAAGAAA GAAAAGAAAG AGGAAGTAAT AATCTGTGAA ATTTTTCCTT AGGAACTTAT 001440
001441 TGGCAATTTA AAAATGAATT TGTTAAGCCA TGCTGGTTCT GACCCAAAAG CCATTCCCCA GCCTTCCTCA CTCCCCTCTT 001520
001521 TCACTACTGG CAGAGATTGT CTCTCATTTT ACAAGCTGAA AATGCCAGAT GCTTGCTTTT ACAGTCTTCC TTACACCCAG 001600
001601 AGCATGTGCA TATGTTTAAA CGGTCAAGAA GAAGTCATAA CATGGGTGTC TGGGAGAGCT TTTATCCCAC AAAAACAACA 001680
001681 CTTCACTCAA AAAACAAACC AAACAAAGAA AAATTCTCCT TCCTGCCATT GGATGTGAGG CTCAGACCTC TAGTAACCAT 001760
001761 TTTGTGACCA CAAAGCAACA AGCCTGAGGA AAAGTCCTAC ACGCTGAGCA ACAGGCAGAA ATATTGCCAT CGCTGAGTTG 001840
001841 CAGAAACAAA TCTAGAGATG TTTTGCTTCT GTAATTATTT TTTATGGGAG ATTACAAGTG GGTTTACTGT TCACTTTTCA 001920
001921 AATCTTATTT CTCTATGATG TTTAGCTTGG GTAAATTTTA CCTTAAATCC ACTTTTTTAT GTAAGGTAAC ATATTTGTCG 002000
002001 GTTTCAAGGA TTAAGATGTG GGCATACTTG GAGGCCATTA TTTTGCCCAC CACAGGTGAA AAAGGAAGTG TTATTCTTAA 002080
002081 ATCATTTGGA AGGATCTCTG TGTAAATGCA AGAGCGAGAC AAGAAAATGC TGTCATTCTT TTGATATGGA CTCGAATTTC 002160
002161 CACTTCATGG TTGTCTGCTT CCTTTTTAGA GTATTATTTA TCCTCCTAAT AAAAAGAAAG TGAAATTTCC C

Predicted Small Protein

Name LINC00470_smProtein_1883:2104
Length 73
Molecular weight 8461.072
Aromaticity 0.191780821918
Instability index 13.9932876712
Isoelectric point 9.65484619141
Runs 13
Runs residual 0.0339372514361
Runs probability 0.0446713682008
Amino acid sequence MGDYKWVYCSLFKSYFSMMFSLGKFYLKSTFLCKVTYLSVSRIKMWAYLEAIILPTTGEK
GSVILKSFGRISV
Secondary structure LLLEEEEEHHHHHHHHHHHHHHLHHHEEEEEEEEEEEEEHHHHHHHHHHHHHEELLLLLL
LEEEEELLLEEEL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLL
LLLLLLLLLLLLL
PiMo iiiiiiiiiiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTooooooooooooo
ooooooooooooo