NONHSAT059014

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT059014

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3382 nt

Genomic location

chr18+:36786770..36807345

Exon number

3

Exons

36786770..36789951,36791504..36791653,36807296..36807345

Genome context

Sequence
000001 gtttgagcca ctgagcccag ccCAAACACA AGTATCTTAA TTACTTATTC GAATGTTTTC CCACTATATC CCACTAGAGG 000080
000081 GAGGGGAAAA AAAACAGAAA ATACAAATAA ATTGCTGTCA ATTTTGATTG GCTTTATTTT GATGTGAGTC ATTAACATTT 000160
000161 AATAAGAACT TAAAATCTAT TTTCAAAAAA TAAAacaata tagtgattaa gagttcattc tctattgtct tggtcaaatt 000240
000241 ctgtcattta ttacatgaat aatattgaca aagttgtctt tagtattcct ttctttaaaa tggaaggaat aatgttatcc 000320
000321 acttcaaaga acagtaatga ggtttaagca ttgaacacag tgcctggcct atattaaggg ctaacaaatg ttTTCTACGT 000400
000401 TACCGTTTGT TGTAATTTAC TAGAGCTTAC CActcttcct tcttcttatg tattttttct cttttccttt tcttctgctt 000480
000481 ttccttattc ttcttAACCA TAAAAATTCA ATAAATGGTA GATAAtgcca ggaaatactc taaatgcttc ttttgtatta 000560
000561 tctgacttaa tcctcaccac aatcttatag gttagtacta tcattattac aattttacaa atggaataat tgaggcacta 000640
000641 aaaaggttgc ttaacctggt tgatcactga gatagtgttt tatctgagat ttgtattcag aaagaactat gtagtcctgc 000720
000721 tcataagcac tatgccatat tgCTTCTTAC TTTAGTATTT TTAACATCAG AGCAAGGTTT TATCTTAAGT TACAATGCAA 000800
000801 CATTAAAATC TAAGTGGAAA TGAGATTTAG AGCTTTAATT TTTTAAAGGA TTGAGAAAGA AAGGAAAAAT CGTACATACC 000880
000881 AGCTTCATGG ATGAGGTGAG ATAATTAAGA GAAAGGATAC TGAAAGGCTA GAACAAATGT CCTTGAGGCT GTCTGTGTCC 000960
000961 GGTGCCTTCC CTATAAGTAA AGGCAACAAG CTCTGTCTGT CCAGACAAAC TCACTTGACT CAAGGTTATG ACTATAAAAC 001040
001041 AGGACTCCAC AACAACATAG CTGTCCATGT CCTTCCTTCT TTCGGTCAAC CTCTTCTAAT CCTGCAAGTG TGAACTTTGC 001120
001121 TTCAGCACGT GCTTCAGTCT ATGGAAACTG TCTCCTCCAA GCTCTGCATC TCACTGATTG TTCTGTTTTC ATGGAAGTCC 001200
001201 AGATGGCTGC TGGAGGTTTA CTTGAGGTCT CTTTCTGTAG GATATAGTTA TACTTTTGTC AACTCGGTGG TACATAAGTT 001280
001281 TGTTCACTCT AAAAGAAGTT GCCTTAAAAA AAAAAGCTTT CAGATGTTAG AGAGAACTGA GATGCTGATG TCAACACATG 001360
001361 AAAAATTAGA TGCTTAACAA TGCAAGAATA CTTATGGTAT AGATTTTAAC ATAATTATCA ATACATTCCT TTCAGCTATT 001440
001441 TTCTCATTTT AATTGCATAC AGTTTATGTC TTTGTTTTAT GTAGCTTTTT TTGATGAAAA GAGAATGGAA GCAGCAAGAT 001520
001521 GAAAATGAAA GAATTATGCA AAGGAACCGA AGCCTTGCCA AATACAGAAT TTGTGGTTTC TACTTTGTGC ATGATTTATG 001600
001601 CACAATTACA TTTTTCAATG TATTATCCTG CAAGCTACTT TGCTGTATAC TCCTGGTTTG TCTTTCTGGT GTCTGTGTTT 001680
001681 GGATTGAGAC AAATATGCTT CTTTTGTTTT CATTCTTGAA GCCAATAACT TTTTAGGGAG ATGTGGAATT TCTTTATTTT 001760
001761 AGATTAAGCA CTATCCGTAA TTCCTTGTAT TACATTTGGC TTGATGTTGA AGAACTTTGT CACCACTCTG GTTGGTCTCC 001840
001841 GCTGAACTTT TGCCAAGAAT AAAAGCAGAt ccacttcaaa agctgatctt ttttgctgct caagattaaa tccagaattt 001920
001921 agctggtttc tgccttagct tGGTCAACAG ATAAAGGGTC AAGACTGGAA AATTGCTTGT ACCCCTAGGA GACATGGATA 002000
002001 GGAAAAAAAG ACTGATGAGT TTTTAAGGTA CAAACAGAGA CCATCACAGC ACATTTCACC ACCTAGATTA AATAACTACT 002080
002081 AGGCTGAAGA TGGAAAGTAA AAGGGAAGTC ACATATTGAG CAAGAGATGT TACCCCAAAG TCAAAGGAAC ACTTATATGG 002160
002161 GGAAATGGAC AAATACAAGC AGGCAGACAA TCATGATTAG GGATGATATG TCAACAGGAA ATGGGATCCA GCCATCAGGC 002240
002241 TCAGCTTTAT TGCTACGACT ACAAGTCAGG AGATTATGAT CATAGCAGAA CTACAGTTCT AGAGGATGCT AAAAATGCAC 002320
002321 GCTGCCAAAT TGGTGGAGAG GTGTAGTGTT TGGCGCTGGC TTCTAGTGCA AAACTCTGTT ACCAGAAGTC TTACTCATGG 002400
002401 AGGTAAGAAA GGGAATTAAG GTATTTGAAT GAGAAAGACT GGTGTTTAGG ACAGGAAAAC ATCATCACGT TTAGATGAAC 002480
002481 AGTCAGTCGG GGAAGCATGA TGGAAAACCA TATAATGATT CTAAAAGGGC TTGGATATGT CATGTCTCAT GTCAGTATTA 002560
002561 GCCTTGGTGT TTGGAAGAGA AGCAGCATAA ATGATAGGAT TTAGTATGGC CCACTGGGAA TTTTCTAATA TTTATCTTCT 002640
002641 ACCTCCTAGA GTATGAAATC ATGACAGAAA ATATAAGCAG TTGTCTGACA TTCAATTACT AGCTGAAAAA GGAAAGCAGA 002720
002721 AACTAATGCT CTCTAATCAT ACTTTATATC ACAAATGTGA GATGattata aaacatgcaa catccatttt ctctatctgc 002800
002801 atgggcagtc tacagaactt taacaatagt atgtctcctt ctttttttct ttacctaagt ccaagtgctt caaatgtcct 002880
002881 taccctccag ttcataaatg tccTTACCCC CCAGTTCATA AATGTCCATA ATTTCTACAT CTTCTTCATC TCTATTAATA 002960
002961 ATTTTTCTAT CTTAATAACC TTGATTTTAT TGAAAAAGTG ATGAACTCAT TTTTGGTGCA TTGCCTAAGA CACAGGCTGT 003040
003041 CAAATCTTTC TTTCCAGGTA TTTTGTCCCA TTAAAATCAG TGAAAAGATT CTGTATTTTT TTTTTCCTCG TGCTCTGTGG 003120
003121 TTTTGTTCTG TTTTATTTGG AGCTCCAAAA CATACAGTTA ACCTTTATGA AAACTATTTC TTttcttctt tttttttttt 003200
003201 ttttttactt ctgctcaaag gccttccaat caatggaatc agtcccaccc agattatcta tgataatctc ttttaaagtc 003280
003281 aactgtagat gttaatcata cctacaaact actttcagag caacaccaag tttttggaac tgggtaacag gcagaggctg 003360
003361 gaacactttg gagggctgaa aa
[back to top]

Predicted Small Protein

Name NONHSAT059014_smProtein_1493:1735
Length 81
Molecular weight 9576.4568
Aromaticity 0.1375
Instability index 43.551375
Isoelectric point 8.77691650391
Runs 13
Runs residual 0.0236842105263
Runs probability 0.0418450124333
Amino acid sequence MKREWKQQDENERIMQRNRSLAKYRICGFYFVHDLCTITFFNVLSCKLLCCILLVCLSGV
CVWIETNMLLLFSFLKPITF
Secondary structure LLLLHHHHHHHHHHHHHHHHHHHHHEEEEEEEELLEEEEEHHHLLLHHHHEEEEEELLLL
EEEEHHHHHHHHHHHLLLLL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHH
HHHHHHHLLLLLLLLLLLLL
PiMo iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTT
TTTTTTTooooooooooooo