LINC00299
Contents
Annotated Information
Approved Symbol
LINC00299
Approved Name
long intergenic non-protein coding RNA 299
Previous Symbols
C2orf46, NCRNA00299
Synonyms
FLJ45673
Chromosome
2p25.1
RefSeq ID
NR_034135
OMIM ID
_
Ensembl ID
ENSG00000236790
pubmed IDs
12477932
Sequence
>gi|300796743|ref|NR_034135.1| Homo sapiens long intergenic non-protein coding RNA 299 (LINC00299), long non-coding RNA
000001 GACACTGAGC TAGGGTGGCC ATGCCTCGCC CTGACCTCAG GCCCATGGGC CCCGTGGATG AGCAGGGGCA TGCAGGACAG 000080
000081 GCCTGCAGGG CAGGGCAGGG CAGTAGGGTT TTCATCGGCA AAGCCCCGTG ATCCTCACAG TAGCCCCAGG AGTTGGACCA 000160
000161 GCCTGGACCC TTACCCCATT TTACAGCCAG AAACAAGATA AGCGCCTTGC AGGGGCCCTT AACTGACTGC TGGGTTCGAT 000240
000241 GGTTCAAAGA GAAAAAGCAC GTGATAACTT TGAAGGAGGC TGCCTGGCTG AGCTGATCGG ATCACCTAGA GACTGGAAAT 000320
000321 GCTTTCTGGT CTGAAGTCAC CTGCCCTATC TGGACATTCC ATAGACTACC ACTTTTACCC ACGACTTCGA TGTGGCATGC 000400
000401 TCATTGGACC AGATAAGCAA GCAGTGGCAA GTGGATTGGA GGTCTTGGAA TATGAAGGCC TGGCCCTCGT CTCAGCTTGG 000480
000481 AATAAATCTG AAGGCCCATC ACAGCTTCAG AGATGCTTGG AGAAGGCTGA GGTCTCCCCT GTAACCTTAT GAGAACCTTG 000560
000561 AGTTGTTGAA CTCATTAATT CTTTGGGACC ATCTTCCTGA TGTTCAAGTT TGTGAAACAG GCTTTCCCTT TAAGATTTGT 000640
000641 TTTCCTCAAA ATACAAGACT ATTGATGCTT CAGTTTTGAA GTGGATTCTG CACTCAAGAA ATGTACCATT GAAGAATGGG 000720
000721 ACTGGTAGGA AGGGAAAAAA ACTAAAAGGA AACGCATTTT CGAGTTGCAC ATCATGATCA GGCACAGAGT CCAGATACAC 000800
000801 TTGGAGTAGC AAGCATCCAT CCTTGTGGAG ACGCACATGA ACCAGGACTC TCTGCACCCA CCCCTCTCAA GTTTTATTTT 000880
000881 CCGACGGCTG TAATGTTCCA GGACACCTGC CATCATGGAG GGTTATCCAG CACCTGGAGG ATACCTGCTA CCTATTTTGT 000960
000961 GAACAATTTG GCATGAGGAG AGAAAGGAAC CTGAAGTTTT TAGCAAGGAA GCTTCCTGTC TCCAAGGGGA TTTTTTCCCA 001040
001041 CAGAAATATA TTGCTGATTG GATGTTGCCT CTCTAATTTT GGAGACCATA TGTTCTAGGT ATTCCGGGAA AAATCTGCAT 001120
001121 TTCAAATCCT GGATCCATTG TTCCCAAAAG CCATGTAAGA ATCCCAACAG AGATAGGTCC CTCTCATGGA TATGTCCTAT 001200
001201 GTCCCAGGGA TACCGCTCAA AGCATTTTGT GGCTGAAGAC ATTTAATCTT TCATAGAAAC TCCTGAAGAA GATGTTATTG 001280
001281 CTATTATAAT CATATGTATT TGGAAATGAA GACTTTGAAA GGTTCTAACT TATTCAACAT TTCTCAACCA GCATGTGGCA 001360
001361 AAGTTCTGAA AGTCTGTTAT TCTAACCAAC ATCCTACCTT GAGGAGCAAT GATGTTCACC TGTCTGGTTG CCACAACTCA 001440
001441 CGTCCTTCTG TGGGCAGCAG AGTTCCAAGA CGTTCCCCAA GATCTCATTT TCTGGAGGTG CATGTCTCCC GTGACCCCCT 001520
001521 CTTTGGATTG CCCGCAGAGC CCGTGAAGAT GGTGTTATCA CTCCTGTGAT TACTTTACTG ATCAGGTGAC TTTGAGTCAA 001600
001601 TCAAAAGGTA GATTATCCAG GTGTGCCTGA TTTGATCAGG TGGTCCCTTA AGGAGGCTTA AAATGACCCT TTCTGAAGTA 001680
001681 GAGTAATTGG AAAAGTAAGA GGGTCTATGG GTGGGGTCAC CTGGCAAGGA ACTGAACTCA GCCTCCATGA GCTCTGGCCA 001760
001761 CCAGCTGACC TTTAGCAAGA AAGCAAATCT TTCTTTGGTC AGTCTCCACA ACAGGACGAA GCTGGCTGAG CCCTTGCCTT 001840
001841 TGGCCCTGTG AGATGCTGAC CCGAGTATCC AGCGAACACG TGCCAGAGTC CTGACCCATG GAAACTGAGA TGATGAGTCT 001920
001921 GTGTTGCTTT AAGCCACTGT GTCTACAGTA ATTGGTTAGA CAGCAATAGA AAACTAATAA ACCTCCCTCT TTTTCATTTA 002000
000081 GCCTGCAGGG CAGGGCAGGG CAGTAGGGTT TTCATCGGCA AAGCCCCGTG ATCCTCACAG TAGCCCCAGG AGTTGGACCA 000160
000161 GCCTGGACCC TTACCCCATT TTACAGCCAG AAACAAGATA AGCGCCTTGC AGGGGCCCTT AACTGACTGC TGGGTTCGAT 000240
000241 GGTTCAAAGA GAAAAAGCAC GTGATAACTT TGAAGGAGGC TGCCTGGCTG AGCTGATCGG ATCACCTAGA GACTGGAAAT 000320
000321 GCTTTCTGGT CTGAAGTCAC CTGCCCTATC TGGACATTCC ATAGACTACC ACTTTTACCC ACGACTTCGA TGTGGCATGC 000400
000401 TCATTGGACC AGATAAGCAA GCAGTGGCAA GTGGATTGGA GGTCTTGGAA TATGAAGGCC TGGCCCTCGT CTCAGCTTGG 000480
000481 AATAAATCTG AAGGCCCATC ACAGCTTCAG AGATGCTTGG AGAAGGCTGA GGTCTCCCCT GTAACCTTAT GAGAACCTTG 000560
000561 AGTTGTTGAA CTCATTAATT CTTTGGGACC ATCTTCCTGA TGTTCAAGTT TGTGAAACAG GCTTTCCCTT TAAGATTTGT 000640
000641 TTTCCTCAAA ATACAAGACT ATTGATGCTT CAGTTTTGAA GTGGATTCTG CACTCAAGAA ATGTACCATT GAAGAATGGG 000720
000721 ACTGGTAGGA AGGGAAAAAA ACTAAAAGGA AACGCATTTT CGAGTTGCAC ATCATGATCA GGCACAGAGT CCAGATACAC 000800
000801 TTGGAGTAGC AAGCATCCAT CCTTGTGGAG ACGCACATGA ACCAGGACTC TCTGCACCCA CCCCTCTCAA GTTTTATTTT 000880
000881 CCGACGGCTG TAATGTTCCA GGACACCTGC CATCATGGAG GGTTATCCAG CACCTGGAGG ATACCTGCTA CCTATTTTGT 000960
000961 GAACAATTTG GCATGAGGAG AGAAAGGAAC CTGAAGTTTT TAGCAAGGAA GCTTCCTGTC TCCAAGGGGA TTTTTTCCCA 001040
001041 CAGAAATATA TTGCTGATTG GATGTTGCCT CTCTAATTTT GGAGACCATA TGTTCTAGGT ATTCCGGGAA AAATCTGCAT 001120
001121 TTCAAATCCT GGATCCATTG TTCCCAAAAG CCATGTAAGA ATCCCAACAG AGATAGGTCC CTCTCATGGA TATGTCCTAT 001200
001201 GTCCCAGGGA TACCGCTCAA AGCATTTTGT GGCTGAAGAC ATTTAATCTT TCATAGAAAC TCCTGAAGAA GATGTTATTG 001280
001281 CTATTATAAT CATATGTATT TGGAAATGAA GACTTTGAAA GGTTCTAACT TATTCAACAT TTCTCAACCA GCATGTGGCA 001360
001361 AAGTTCTGAA AGTCTGTTAT TCTAACCAAC ATCCTACCTT GAGGAGCAAT GATGTTCACC TGTCTGGTTG CCACAACTCA 001440
001441 CGTCCTTCTG TGGGCAGCAG AGTTCCAAGA CGTTCCCCAA GATCTCATTT TCTGGAGGTG CATGTCTCCC GTGACCCCCT 001520
001521 CTTTGGATTG CCCGCAGAGC CCGTGAAGAT GGTGTTATCA CTCCTGTGAT TACTTTACTG ATCAGGTGAC TTTGAGTCAA 001600
001601 TCAAAAGGTA GATTATCCAG GTGTGCCTGA TTTGATCAGG TGGTCCCTTA AGGAGGCTTA AAATGACCCT TTCTGAAGTA 001680
001681 GAGTAATTGG AAAAGTAAGA GGGTCTATGG GTGGGGTCAC CTGGCAAGGA ACTGAACTCA GCCTCCATGA GCTCTGGCCA 001760
001761 CCAGCTGACC TTTAGCAAGA AAGCAAATCT TTCTTTGGTC AGTCTCCACA ACAGGACGAA GCTGGCTGAG CCCTTGCCTT 001840
001841 TGGCCCTGTG AGATGCTGAC CCGAGTATCC AGCGAACACG TGCCAGAGTC CTGACCCATG GAAACTGAGA TGATGAGTCT 001920
001921 GTGTTGCTTT AAGCCACTGT GTCTACAGTA ATTGGTTAGA CAGCAATAGA AAACTAATAA ACCTCCCTCT TTTTCATTTA 002000
Predicted Small Protein
Name | LINC00299_smProtein_20:130 |
Length | 36 |
Molecular weight | 3788.3047 |
Aromaticity | 0.0277777777778 |
Instability index | 58.7666666667 |
Isoelectric point | 9.48956298828 |
Runs | 5 |
Runs residual | 0.0415695415695 |
Runs probability | 0.0475357710653 |
Amino acid sequence | MPRPDLRPMGPVDEQGHAGQACRAGQGSRVFIGKAP |
Secondary structure | LLLLLLLLLLLLLLLLLHHHHHHLLLLLEEEEEELL |
PRMN | - |
PiMo | - |