NONHSAT044062

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT044062

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2512 nt

Genomic location

chr15+:57592563..57599967

Exon number

3

Exons

57592563..57593122,57597517..57599266,57599762..57599967

Genome context

Sequence
000001 ATTGCAACCT CCGCCTCCTG GGTTCAAGCA ATTCTCCTGC CTCAGCCTCC TGAGTAGCTG GGATTACAGG CGCCCGCCAC 000080
000081 CACTCCCAGC TAATTTTTGT ATTTTTAGTA GGGACGAGGT TTCACCATGT TGGTCAGGCT GGTCTCAAAC TCCTGACCTC 000160
000161 GTGATCTGCC CTCCTTGGCC TTCCAAAGTG CTGGGATTAC AGGCGTGATG GAGATGATAC TCCCTAAATC ACAAGGGTGT 000240
000241 GGTGTGAAGA TGAAATGGCT GCACGTGACA GAGGAGTTGG AGAACAGGAA GCAAGTGGAT TACTTGGATG AACTGTGATG 000320
000321 GGCCCAAGAA GAACTAAGCC AATCACTCCA GGGGGCGTTT TCCCAGGATC CAGGCAGAGA GCTGATTCCA GCATCCACGT 000400
000401 TGTATCCCCC TCCCGCTGTG GCCGGGGTCT CCAAGAGCCG GGCACCCACT GCAGAGCTTC CAAGCCCAGG CCCATTTGAA 000480
000481 CACAGGAGGA CTTTTGAGGT TCATTTAACC CCAAAATCAG TGATTCTACA TCTCAAGCTC ATATTTTGAA GTGTTTACGG 000560
000561 AGAGGGAGCA GGTCTAGAGA AGAACCAGAG GAGCTCAGCT GAGATATGGT GTATGGATTG GATTTTGGTA GAAGATGGGA 000640
000641 AGAACCAAAC ACCTGAGAAA CCACTTTGAA GATCGGGGTC AGAGTAAGGC CTAACACATA GTTGGCTCCC AGTAATTATT 000720
000721 GGTTGATTGA ACAGCTCAAA GAGCAACTCG ACCAAGAACA CTGGACTGGG AGTCCAGTTA CTTGGATCTT GCATTCCTGA 000800
000801 TTTATTTTTA TTTTATATGT ATTTTTTCTA TTTTTTTGAG ACGAAGTCTC ACTCACTCTG TCGCCCAGGC TGGACTACAA 000880
000881 TGGCACGATC TCGGCTCACT GCAAACTCTG CCTCCCAGGT TCAAGCGATT CTCCTGCCTC AGCCTCTCGA GTAGCTAGGA 000960
000961 TTACAGGCAT GCACCACCAC GCTGGCTAAT TTTTGTATTT TTAGTAGAGA CGGGGTTTTG CCATGTTGGC CATGCTGGTG 001040
001041 TCCACCTCCT GACCTCAGTT GATCTTCCTG CCTCAGCCTT CCAAAATGTT GGGATTACAG GCGTGAGCCA CCGTGCCTGG 001120
001121 CCGTGATTTA TTTTTTTTGT GTATGTTTGT TTTTGTCAAC TTGCTGTGTG ACCTTAAGCA AGTTACTTAA CTTCTCTGGG 001200
001201 CTTCACTTTC CATGGATGAA CATTGTAAAG AGGCTGGAGA GAGATGAGGA CTAGGTACAG GCTTTAGAGG AGAGCCACCG 001280
001281 CCCCGGACTT CTCCCTCTGT CACCCCGCTT TCCATGACCC TCCTTGCCTG ACTTTGTGAC TCCTTGCCTC GCTATCAAAA 001360
001361 CAAGTGCTGC AATCTCAGTG CTTTCCAAGA GCCCTGCATT GTTAGAAACT TCCCAGCACG CAGCAAAGGC TGCTGCAATA 001440
001441 CTCGCTCTGC CTGCCTTTGC CCTGCGCTTC CTACTTACCC TCCTTTTGTT TCTCCCAAAC ATCTGTCCCT GACTATGCTC 001520
001521 ATCTCATGTT TGTCCTCAGC TGCTGAAAGG GCCACGTTTG TTTTCATTAC AAATAAGACC ACCGAGTGGG CTCCTGGCGT 001600
001601 GGGGGCGGGA CCAGCCGCGC GCAGTCTTCA GAGGCAGCCC CCCAGGCTGT CTCTGGAGGG TGTGTCTCTG CTTCCCTTTC 001680
001681 CCCGTGTTTA TTTTCAGACG AAGCCAAGTG GCCCGGGGGG ACCCTCCGGA CTCCCAGCCT TCAGAGAGGA GGGCAGCTCG 001760
001761 GGCTTTCGCC GCAGTGCTTC CTGCCCGTCA CGTGTGTGCT CCTAGCCGGG GTCGGGGGAG CTGGTATCTT GGCCCTTCTG 001840
001841 GGAGGACGCG CACAGCCCGA GGAGGCAGAG CCCCAGACGG GAATGGGCTT TTCAGAGGTG GGGTGCGGGC GAGGGGACGA 001920
001921 TGCATTATTT TTAATATTTG ATTTATTTTT CCAACTGGAC TTCTTCCCGG GGCTCTTTCT GGGCCCAGCT GCCTTTGTGA 002000
002001 TCCCGCGCCC CGGTCCTCGG CCTCTCACCT CCAGCGCCGG GGCGCCCCCT GCTGTCGGAA GCGGCTGTGA CCGGGCAGAG 002080
002081 GTGCTATCTG GGACTCTGGG TTCTCAGCCC GGGGACAGCG AACCGAGGGG CAGATGATCC ATCAGAAAAG AGCCGGCACT 002160
002161 GCCCAGCCCC GCGCCCCTGC CCCTGCCTTT TTCCGGGAGC GCGCCGCGCC GCACCCGCTA CGGCCGCTTG ACCCCATCTT 002240
002241 TGAGCCCGGC CCCAAGCTCT GGGACCGTCG TGCCCCTCAT CAAGGAAGAG CCAAGGACCC CAAGGAGAAG AAATGCCTGG 002320
002321 GGCCCACGAA CATCCCAGTG TGGCCCTGGA CGGGACATCA TGCTGGGCAA CACAGCTAAA ATGCGGGTGA AGACCAGATT 002400
002401 TCTTGCACAT GGCGGTGACG GGATGCTCCC TAGAGAGCTT CAAGTGGATT CTTTGCTTTT TATTTTCTCT CTTAATAAAA 002480
002481 ATGTATGATG TTTACATTGT CAGAGAACAA AC
[back to top]

Predicted Small Protein

Name NONHSAT044062_smProtein_1514:1804
Length 97
Molecular weight 10127.456
Aromaticity 0.0833333333333
Instability index 72.1489583333
Isoelectric point 10.286315918
Runs 17
Runs residual 0.0428479381443
Runs probability 0.0334902376425
Amino acid sequence MLISCLSSAAERATFVFITNKTTEWAPGVGAGPAARSLQRQPPRLSLEGVSLLPFPRVYF
QTKPSGPGGPSGLPAFREEGSSGFRRSASCPSRVCS
Secondary structure LEEEELLLHHHLLEEEEEELLLLLLLLLLLLLHHHHHHHLLLLLEEELLEEELLLLEEEE
EELLLLLLLLLLLLLLLLLLLLLEEELLLLLLLLLL
PRMN -
PiMo -