NONHSAT107139

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT107139

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2419 nt

Genomic location

chr6+:3476798..3480072

Exon number

9

Exons

3476798..3477560,3477572..3477676,3477784..3477885,3477974..3478075,3478177..3478263,3478433..3478633,3478834..3479012,3479096..3479199,3479297..3480072

Genome context

Sequence
000001 CCAAAGTTCT GGGATTACAG GCGTGAGCCA CTGCACCCAG CCTGCTCCAT GATTTCTGAC ACTGTGGTGG AAATTCCTTT 000080
000081 CTACTCCTCC CTCCTCCCAG CCCCACCTGT TCTGCAGCAG AACCCCCATT CTGATACTGG TGTGTATCTC CTCTACCCAT 000160
000161 GCATAGGTGT ACATAATCAA CATTTAGTAA TGTTTTCTAT TTTATACAAA TAATACTTCA TACAGTTTTC TACAGCTTAC 000240
000241 ACTTCACTCA GTTCCATGTT TTCAAGACTT GTCTCAGTTG ATATTTGGAG ATCTAGTGTA TTCATTTTGA ACATCTATTT 000320
000321 CTGCATTATT TCTATGAATT TATTAAAATG TATGCATTCG TTTACTTATT GGTGAATATG TAAGTCATTT TTAATTGTTC 000400
000401 ATTCTCAAAA ACAATGATTT GCAGGGATTT TTGTGTATCC GGAATACTCA TTCCCTATCA GTTTCATGCA TTACAAATAT 000480
000481 ATTCCTCCAA TCTGTGGCTC CTTTTGTCCT TGATACTGGC TTTAACTTTG TTTCTGTTGT CTTTTGTTAT ATAAAAATAT 000560
000561 TACATTTTCA TTTTGTCAAA CTTATCAATC TTTTTCTGTT GTGGCTTATG CTTCTTTTAT CTTATTTAAG AAATGATAAA 000640
000641 GATGCACTCT GTAACTCAGG ATTTTGTAAA CAGTAAGCAT CAGTGGAGAA GAATTAATAC AATACTGTGG GTGAACTTGA 000720
000721 CTGCAGTTTT CTTAACAATT TTCTTAACTA TTAAGAACAT TCTCCAGACT TACAGAATCA GAATCCACAT TTTCACAAGC 000800
000801 CCCAGGTCAT TCCTAAGTGC ATTAAAACCT AGGTATAGGC CACACGAAGC TGCTTCTGCT GCCTCTGTAT CAGGCGTTAG 000880
000881 ATTCTCATAA GGAGCATGCA ACCTGGATCC CTTGCACGCA CAGTTCACGA CAGGGTTTGT GCTCCTGTGA GAATCTAATG 000960
000961 CCCCCACTGA CTGGAACCAG GGGTCAGACC CCTGCCCTAA AGGGCTGGCA AATTCTTCCA GTTAGAAATA GAGGCAAGTC 001040
001041 TTGATGCGGT CCTCTTTACT TCTTAATCGT CTGTGAGTGG ACTCACTCCC AGGTGCCCAA ACACCTGCCC TCTGCTGGGC 001120
001121 TGCCTTCCCT CTGATCTGGT ACAATTATTG TGGCAATCTA ACGTTCTATC ACTTGTGTTA ATAATTGCCT CTACTTTCAC 001200
001201 AGTCTCTTTT TCACAGAGAA ACTTTAGAGA AACATTTTGG GCTTTTAATG GGAGGCATTT GTGTACTGAA CTAAGTATCT 001280
001281 GGGTGGCGGA GTCAATACAG TATAAATGTT TTTAGGTCAT CTGTCTTTGA GGTGGGCTGT TCACCTCCTT GTTCCCTTGT 001360
001361 GTATGTGGCT GACCCAACTG ATACCCTGAC TTCTCAGCTC CTTGACTTCC TCAGCTTGTG ACCTTTTTCT CCACTCCATC 001440
001441 TTTCCCACAG CCATGATTCT AGTAGTCAGC TCAGGCTGCC ATAAAAAATA CTATTGACTG GGTGGCTTAA GAAACAGAAA 001520
001521 TTAATTTCTC CCAGTTCCGT CACCAACTCA CCTATCTAGG AGAGTGAGAG AGAAAGCTCT CTTTCTTCCT CTTATTATAA 001600
001601 GGCCACAGTT CTATTGTATT AGGGATGAAT CTTTATGATC TAAGTCTCTA GCATCCCACC CCTGCCCCCC AAATTCATGT 001680
001681 TCTTATCTCA TGCAAAATAT ATTCATTTCA TCCCAACAGC TCCAAAAGTT TTAACTCATT TCAGCATCAA CTCCAAAGTC 001760
001761 TAAAGTCCAA AGGCTCATCT AAATATCATC TAAATCAGGT ATGAGTGAGA TTTGATGATG AATCTTGAGG CAAAATTCCT 001840
001841 CTCTGGCTGA AACCTGTGAA CTCACGCAGG TTATCTGTTT CTAAAATACA ATGGTGGGAC AGGCATAGGA GGGACATTCC 001920
001921 CATTCCAAAA GAGATAAATT GAACAGAAGA AAGGGACAGC AGGCACCAAG CAAGTCCAAA ACCTAGTTAG GCAAATCCCA 002000
002001 ATATATCTTA AGGTTCTAGG ATAATCCTCT AGTTTGTGCC CTGCCTTCCA GACCCACTGG CACAGCTGTT TGGAAGGGGC 002080
002081 ATCTGTAATA TGAATAGGGT GAGAATTTCC CAAATCTTCA AGTTCTGGTT CCTTTTTAAT AATTCCTTTC TTCTTCTTCT 002160
002161 TCTTCTTTTT GAGATGGAGT TTCGCACTGT CGTGCCCTGG CTGGCGTGCA ATGGCGCAAT CTCGGCTCAC TGCAACCTCC 002240
002241 GCCTCCCAGG TTCAAGTGAT TGTCCTGCCT CAGCCTCCCA AGCAGCTAGG ATTACAGGCG CCCACCACCA CGCCTGGCTA 002320
002321 ATTTTTTGTA TTTTTAGTAG AGATGGGGTT TCACTATGTT GGCCAGGCTG TTCTGGAACT CCTGACCTCA TGATCTGCCC 002400
002401 GCCTCGGCCT CCCAAAGTG
[back to top]

Predicted Small Protein

Name NONHSAT107139_smProtein_632:889
Length 86
Molecular weight 9929.5426
Aromaticity 0.117647058824
Instability index 49.7152941176
Isoelectric point 11.7997436523
Runs 15
Runs residual 0.0399286987522
Runs probability 0.0349584878996
Amino acid sequence MIKMHSVTQDFVNSKHQWRRINTILWVNLTAVFLTIFLTIKNILQTYRIRIHIFTSPRSF
LSALKPRYRPHEAASAASVSGVRFS
Secondary structure LEEEEEELLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLEEEEEELLLHHH
HHHLLLLLLLHHHHLHHHLLLEELL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLL
LLLLLLLLLLLLLLLLLLLLLLLLL
PiMo ooooooooooooooooooooooooooTTTTTTTTTTTTTTTTTTiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiii