LINC01019

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

LINC01019

Approved Name

long intergenic non-protein coding RNA 1019

Previous Symbols

_

Synonyms

_

Chromosome

5p15.33

RefSeq ID

NR_033898

OMIM ID

_

Ensembl ID

ENSG00000248118

pubmed IDs

12477932

Sequence

>gi|299829234|ref|NR_033898.1| Homo sapiens long intergenic non-protein coding RNA 1019 (LINC01019), long non-coding RNA

000001 ATGCGCGGCC TGCCGGGGGC GGCCTCTTGC AGTTCCCGCC CTGACGCAGC GCGAGATTAG AGCTCCTGCT AGTCCCCAGC 000080
000081 TGACCCTGGG CCTCAAAGGC AAGCCCTGAC CTCAGTCACA CGGGGTGGCG GTGATATGGT TTGGCTCTGT GTGCTTCACT 000160
000161 CTGCCTCACT CGCCTTCCGC CTGCTGGACG CTGCCCTGTT CCCCGCAGGC GGCCGCCGTG GAGGGCAGGC ACCGCGCTCC 000240
000241 CTGGCTGCTC CCGGGGCCTT CCAAGCACAG CCACGGCCAT GGGGCCGAAA CCCAGACCTC GAGGGGCCTG CGGGCAGAGC 000320
000321 TCGACAAGCC CAGCTGCTAC ACTGAAGGGA AAAACCTGAG GAAGCCCCTG ACCTTCTTGA ATAACCTGGA TATTGCAGAA 000400
000401 TCATGCCTGG CACACGGCAC TGGCAGAGAC GCAGAAGGGA AAGTGTCCAG CGATGAGAGA TACCGGCCAG GGGGCAAATC 000480
000481 TTCAGCTTGC AGCACTCACT TTGCAGATCC AGTTGGAATT ATTTGTGTTG ACGCTCTCTG CTCTCTCCGC TCTCCCCTGG 000560
000561 AAATGACGAG GAACTGAGTA CAGTGGACAA TGGCTTCCGT GTCCCTAGCT CCCCCCGAGC CATGAGCTGC TACATCAAAA 000640
000641 GTCTGCAGTG GGCTGCCCTT CCCATAATAT TTCAATTTAA GCACCGACTT GCTGCTGCGG AGTCATATCT CATTCATGCT 000720
000721 GCAGCCTTTG CAGGAGTCAG GGATCATTAT GGAGCAAGCA CTGAGAAAAA ACAGATTACA GCTTGGCACC GAGCAGCCAG 000800
000801 GTTGCACCCC AGATGCTTCA GGGACATGGT GCCTGCTCTG GAGAATGGGG CAGCTTCCAC ACTGCCCAGG AGCCAGGGCT 000880
000881 TCTGACCCAG GAGCGAAAGT CTGTCTCTTC CATTTCTGGG AGCTGGCAGT ATTTGCACGA CTTTCTGGGC CTCAGGCATC 000960
000961 CCATTGCCCA CCAGGAATTA CCTTCCTCCA GGATCATGGT GAAGATGACA TGAGATGCTA ATGCTTGGCC TGCGGTTGCT 001040
001041 ATTTTTAGTT CTCCACTTTC CTGGAAAGCA TGCAGTTGAA GTTCATGACG GTATTCTCTC AACGTGTGAC TCAAAGACAG 001120
001121 AGGCCGTTTC CCATCCACCG GTTCTCACAC TGCCCCATTC TGCAGCTTCA CCATCCACGC GCTAAGAAGG CAGGCCCCAA 001200
001201 AGTCTCAGCA CCCAGCCGCT GGGGTCCTCG CTCTTATGGA CAGAAAACTC AAGAATGAAT ATGTTTCCCC ATCTTGTGCC 001280
001281 CTTTTGCCTG GAAACTATAC AGAGCAGGGG TCCCTGCTGG CATTCCTTCT CTGTACTAGT ACGGCTTCGG AAAGTACTGG 001360
001361 TTCTCTGAGG AGCGCTCAGT TGCATCTGGA AGGTGCAGTG TGCACAGCGA TGCCCCTTGA CACCAAGGTG TGAGCATTTG 001440
001441 AGCTGTGCTG GAAGGTTCTC CATGGAATAG TGCCCACCCT GGAGGGTCCT GTGCCGCGCA CTTGAGGAGG CTGTGGTGGT 001520
001521 GAGGAGAGAC TCTGGAGCAA CCCACACTCC CGATAAGTCA GATAAGAGTG CCTCCTCTAG GTAAGTTTAT GTTGCAAATT 001600
001601 TAAAGAGAGG AACCAAGAAA CAGCAAGGAC ACTGAGCCAG GACTGAGAGT TTGGGTCTTA GGTGGCTTTG GGTCATTACG 001680
001681 CCCTCTCAAG TGGATTGCTC TAAGCCCTGT TTATTTGTTA ATGTGAATAG TTGGATGGAA TTGCCTTTGT TCTAAAATCT 001760
001761 TTTATTCTGC CTAAAGTCTC TGGGTCAGCA AGCACATGCC ACATGGCTTC TCTCTCTGCA TGGGAATGCA CACCTGAGTA 001840
001841 GGGAGGCTGG CCAGCCCGTG CTGCCCTGGT GGGGTGAGTG TTAGCTGGCT AGGGTTGCTG GAACAGAGGG GATTCAACCA 001920
001921 CAGGAATTTA TTCTCTCACA CCATGGAGGC CCAAAGCCCC AGATCATGGT GCTGTAGGGC CAGGCTCCCT CTGAGGGCAC 002000
002001 TAGGGAGGAC CTGTGCCCAT CCTCTCCAGC TCCTGGTGGT TCCTTGGCTC GTGGCAGCAC AGCTTGTACC TTCACGTGGC 002080
002081 ACTCTCCCTT TATGTGTGTG GATGTGTCCA ATTTTCCTTT TCTGTTGGGG CCCCAGCCGT AGTGGCTTAG AGGCTCACCT 002160
002161 ACTCTAACAG GACCTCGTCT GAACTCATGA CATCTGCAAG ACTGTGTATC CAAATAAGAG CACATTCTGA GATGCTGGGG 002240
002241 GTTAGGACTT AAGCACAGAA GATTTGTGGG GAGGTACACA GTTCAATCCA TCTGGAAGGT GGATAGCCAT GGCAGGGTGG 002320
002321 AAAGACGTGT GTGGGTATAG GCCGGAGGTG GCCTCACTGG AGAAACCTGT GCACGGCCCC ACAAGGCAGC AGCTGAGCGT 002400
002401 GGAAGGGGCT GGAGCTGCCC CGTGGTAGGG AAGGGGCCGT GTGGCCAGCA GCAGCATCTG CCTTTGTCAT CACTGGGTGG 002480
002481 GACCACGAGG GCAATGCTAC TCTTCCCCTC ACACCTTCTA CAGGCAAGTT GGCCAGGTGT CAATGCAGCC AGGTGACACG 002560
002561 GCCGCCTGGA GGGCAGGAGC AGACAGCCCC ATGGAGGCTC TAGGAGATAG ACAGGGAGGC CGCAGCTGTC CAGGAGCTGA 002640
002641 AGTGAGCAGC AGAGGGCGCC GGGGGAGATG CGCCCTAGCA CCAGGCTCTG AGTGCCCCAG CGAGCTTCCT TGCCTTGACT 002720
002721 TTCCTCACTG GGAACCCGGG CCTGGAGTCC ATATGAAGCC TCAGCCCAGG GGAGGGTCTA CCTGTCCAGG AGGCTCCCAC 002800
002801 CATCCAGCTC GCCCTGGATG GTGCCCTCAG AGGTGCCTGG GCATGGGGTC ACCTCGGAGA GTCTGGAGGG GCCTCCGGAC 002880
002881 AGTGCGGCCA CGGGCTGTCC TCTGCTTCTC AGTGCCCAAG ACAGGGAACC AGAGCACAGC ACTGAACACG TTTTAGTGCT 002960
002961 CTTTTAGTTC ATGTGCTGCT TGCTGAATTT CCCTGAATTC AGAGAAGAGC CTGACGCAGA GGAAGCATCT GAGATCTTCT 003040
003041 GTATGTTTTC TCGTCTGTGC AAAGTGTTTT CAACTCTTTC AAAGGTCGGC ATTCACTGAA TTTCAGCTCC CCTTGGCTTG 003120
003121 ATCTCTCATA CAAAGGTTCT CAAATTCCCT GAAAATATGC AATGATTTTC TCACTCAATA AACGACTTTT CTGTCACCAA 003200
003201 AAAAAAAAAA AAAA

Predicted Small Protein

Name LINC01019_smProtein_812:1012
Length 66
Molecular weight 7348.2582
Aromaticity 0.0757575757576
Instability index 43.4712121212
Isoelectric point 6.95941162109
Runs 7
Runs residual 0.0445472690982
Runs probability 0.0190400484518
Amino acid sequence MLQGHGACSGEWGSFHTAQEPGLLTQERKSVSSISGSWQYLHDFLGLRHPIAHQELPSSR
IMVKMT
Secondary structure LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLEEELLLLHHHHHHHHLLLLLLLLLLLLLLL
EEEELL
PRMN -
PiMo -