LINC00471

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

LINC00471

Approved Name

long intergenic non-protein coding RNA 471

Previous Symbols

C2orf52

Synonyms

MGC43122

Chromosome

2q37.1

RefSeq ID

NM_173513

OMIM ID

_

Ensembl ID

ENSG00000181798

pubmed IDs

12477932

Sequence

>gi|34222230|ref|NM_173513.2| Homo sapiens chromosome 2 open reading frame 52 (C2orf52), mRNA

000001 GAGGGCGTCA CCAGCGTGTC GGCGCCTGCG GTTTTCTTCG GGCTGTCCCC AGGCGTCGTG CCCGCGTCCG TCACGCGCGG 000080
000081 TGGCTTGCAG GGGCGCACCG CGACAGCCTT AAAGGAAAAC CTGTTAGGGA CAAGGGCTGC TGTGGGCAGC GAGGGTTATA 000160
000161 GACTCATGTG TGTTCTGAAT TCTGGTGGGG GTAGTGATGG ACGCTGACCC AGTTCACAGA ATTATCTGAC TGGCGAAGTA 000240
000241 AAGGTACTTG AATTGCAGGA AAAAAGGTCA AGGTGAACTG AAGAAAATAA GTCTTTCCCC AAAGGAGGTA GAAGATGGGG 000320
000321 AAGATGAGAT GTCAGAAGCT AAGGACAATG GCAGTAGAGA TGAGGTTCTT GTCCCTCATA AGAATTGCAG GAAGAATACC 000400
000401 ACTGTCCCGG GAAAGAAAGG GGAGGAAAAG TCTCTGGCTC CTGTGTTTGC TGAGAAGTTA ATATCACCAA GCAGGAGGGG 000480
000481 AGCCAAGCTC AAGGACCGTG AGAGCCACCA GGAGAATGAA GACAGGAACA GTGAGTTGGA CCAGGATGAG GAAGATAAAG 000560
000561 AATCCTTCTG TAGGGGGTTC CCGATGAGTG GCTGTGAGTT AGAGACAAGC TGCTGTGTGT GCCATTCTAC AGCACTTGGG 000640
000641 GAGAGGTTCT GTTAAAAATC CATCCCTCTG AAAACTGCCT TGGCAATTCC CGGCTGATTC TTCGGGATGG GCCTCATTGA 000720
000721 TGGGGAGTCT CAGTGGATGT TTGACTTTTG CTTTCTACCT GACCCTAGTC AGAGATTTTT TCTTTTTCCT TTTTTTTTTT 000800
000801 TTTTTTTTTT CTGAGACAGA GTCTTGCTCT GTCACCCAGG TTGGAGTGCA TTGGTGTGGT CTTGGCTCAC TGCAGCCTCT 000880
000881 GCCTCCTGAG TAGCTGGGAC TACAGGCGCA CACCACCACA CTGGCTAATT TTTGTATTTT TAGTAGAGAT GGGGTTTCGC 000960
000961 CATGTTGGCC AGGCCATCTT GAACTCCTGA CCTGAAGTGA CCTTCCCACC TTGGCCCCCC AAAGTGCTGG GATTACAGGC 001040
001041 ATGAGCCACC ATGCCTAGCC CAGAAATTTT CTCTTTGAAT ACTATACATT GATGAGTTCT GTCTTTATGC TTATGATCCA 001120
001121 CGCAGGTAGG TCGTCTCAGA TTTAATTTTC AGTGGTTTTT TTCTGTCTTG ATGGAGTCTG TGTAACATTT AAAATATTGC 001200
001201 TACCAAAGCA GTAATTCTTA CTATAGTTAT TAAAATGCAA GAACAATATA TTTAAAATTA TTTCATAATA AAGTTAAAAT 001280
001281 GAGAAAAAAA AAAAAAAAA

Predicted Small Protein

Name LINC00471_smProtein_719:892
Length 57
Molecular weight 6502.4657
Aromaticity 0.245614035088
Instability index 48.3966666667
Isoelectric point 4.60174560547
Runs 6
Runs residual 0.0486842105263
Runs probability 0.0511835364778
Amino acid sequence MGSLSGCLTFAFYLTLVRDFFFFLFFFFFFSETESCSVTQVGVHWCGLGSLQPLPPE
Secondary structure LLLHHHHHHHHHHHHHHHHHHHHHHEEEEELLLLEEEEEEEEEEEELLLLLLLLLLL
PRMN LLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLL
PiMo ooooooooooooTTTTTTTTTTTTTTTTTTTiiiiiiiiiiiiiiiiiiiiiiiiii