NONHSAT101244

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT101244

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2682 nt

Genomic location

chr5-:42985708..42993563

Exon number

2

Exons

42985708..42985766,42990941..42993563

Genome context

Sequence
000001 AAAGAAACTT GAGGCAGGGC TAACTCTGCG TTTGTTATGA TTAGGCTACA GGAAACCCGG AGTTGCTGGG AAATGAGTCA 000080
000081 CAGGGTCACT CAGCCTTTCT GTGGCGAAAC ATCGACTTCC TACGACCTAG AAATCGCCGC CAGGGCTAGG CTGGTGTTTG 000160
000161 TCCTAAGTGG GTACTTGCTT GGGTTGAGAA ATCACATTTT GGCTATTCCC CGACTTGAAC CAGGGGATAG AAATCGTAGT 000240
000241 CACCGGCTGG GCGCAGCGGC TCATGCTGTA ATCCTGGCAG TTTGGGAGGC CGAGGCGGGC AGATTATTGA GGTTAGGAGT 000320
000321 TCGAGACCAG CCTGGTCAAT ATGATGAAAC ACCGCCTCTA GTAAAAATAC AAAAATTATC CGGGCGTGGT GGCGAGCGCC 000400
000401 TGTAATCCCA GCTACTCAGG AGGCTGAGGC AGTAGAATCC CTTGAACCTG GGGTGCGGAG GTTGCAGTGA GCCAAGATCG 000480
000481 CGCCACTGCA CTCCAGGCTG GGCGACACGG CGAGACTCTG TCTCAAAACA AAACAAAAGG AAAAGAAGAG AAAAGAAAAA 000560
000561 GAAACGCAGG CACCCTGAAA TCGAAACTTG TTAGGACCAA CAAGCCAACG ACAGACGAAT CTGCGGGGTC CTGCGACAGA 000640
000641 CACGCCAGCC CCGCCGCCAC AGCACAGGGT TCTGGAGGTC GAGCTGTTCT GCGGGTTCTG CGGCGGCGCT GGGAAGAGAC 000720
000721 GGCGCCCGGG CAGCTCCCCT GTCACCGGCT TGGAGGAGCG GCGGGCTCCC CTAGCCCAGC GCCGCCGCCG CCGCCCGGGA 000800
000801 ACGCCAGGCT CACCCCAGCT GGGGAAGCTC CGAATCCCTC AGATCGGCGA CCTGAGTGCT GTCGCCCCGA GAAAATGGAG 000880
000881 GCATCGAGCA ACGAGCTGTG CTGAGACCGA GAGACCTCAG TTCTGCGAAA TGTCGGTGCC TGGAGCTTGC GAATGACTGC 000960
000961 GGCCGGCGGG TTGTCAAGGA CAACATTCCT TATGGCGCAA CCAACAGCGC TGTCACCAAG AAACTGGACT CTGAGAAAAA 001040
001041 AGAGGGTTTC GGCCACCGAG AAACTCCGTG CCACATGTGC CGTGACAGCA AAACCGCCCG CTCCGCGCCG CTGGTGAAAG 001120
001121 AGCTAACGAA CGCCAGGTGC TGCGACAGTG ACACCGGGAC TAGAAGGCCT CGGATGGAGA ATGGGACGCC CAGAGCAAGC 001200
001201 AGGAAAACGT TTTGTTTGGC TCAACTAGCG CTGTCGCAAC CGGAAAACTG TCAGGCCAGT GCTGCTGCGG ATGTGAGAAA 001280
001281 CCGGTGATCC GAAGGCGGAG TGGCGCTGGG CGCTGGACTC GCTGGTGTGA AAATCTGGCT GCTCTAGGAC CCGGCAAGCG 001360
001361 CGGGCACCAG GCGAACGCCA GCGACTTTGC CAGACAAATG CATGTCTTTT TTGGTCGCCA TTCAAAGATG CAGCCGGCAG 001440
001441 TCAGCCCCGG CTCTGAGAAA CAAGTGGAGG TGTCAGCCAT AAACCCTCCG TTCAGGGAAA AGAGAGTCGC TGCGACGGAT 001520
001521 CGACTGGCCG TTGGGTGGAA CTTTCTTTGG TGGCCTAGTG ACTAGAAAAT AAAAACGAAA ACGAAATAAA AAACTGCTAC 001600
001601 TGACATTACT CCCACGCTCC CTCCTCAAGT CAGTGGCTTC AGGCTTAAGA AAACCTGTAT TTTTCTTGGC TGATCCGTGA 001680
001681 TCCCTAATTA CTGAGGGCCC CGCGACCCAC CAAGTGCGAT TCGCAGAGGG AAAGGAGCTG GAATAAGGAT CCAGAATGGG 001760
001761 ACGGATTTGG CTCCTCCAGG AAGAAAGCAA AACAAATGCA GGTCGCGAAC ACTTTACATT GCAGGAAGCT GCTCTTCCAG 001840
001841 TGGCCGCGGC TGGGTCCTTC CACACCACCT CTCTGTCCTA GGCCTTCGTC CCGGTTCAGC CCTCACGGAT TCTAGGCTAG 001920
001921 CGGTCAACAA CCGCATCCCT CCGGAACCTG GAAACTCTAG GTCCGAGATC TCGGTGAAGA ATGAAAGGCA CAGCGCCGCT 002000
002001 CAACCCAGGC TTCAGGCCTG CCCCGCTGGT CCCGTCCCTT CCCTGTGGGA TGGAACTCTG TGGGAATCGC TCCTTATTCT 002080
002081 GATGGTTCAC TCATTCAGAT GTCCTTGATA TTTGGGGATA TTTCTAAAAC AGCTGGGGAG AAAAATGGAC TGCGAGGCTC 002160
002161 CGGTGTTGGA CGTGAAATTG GCAGTGAGAC ATCCGCCTCT AAGCCCAGGG AAAAGCTTAA ATTGTGGTTC TCTGTCTCTA 002240
002241 GCACCGTGAT CTTTGAAATG CGTTATTGAT TTTCTTCTTT CTTCTGGGAA ATGCAATGCA GGTGAACTTG TAAGTGTTGA 002320
002321 ATCCTCCCTC TCCTCACTGT TCAATCAGAC TGAGGCCTAC TTCCTTTCTG ATAAGAAATA TGGAACTGCC ATTGCTCAGT 002400
002401 AGGAAAATGT CCTTTCTACA AGCATTCATA TTTCAATAAA AATCTGCACT TTGTGAGTTG AGCCCCCTCA AAGAGACCTA 002480
002481 CAGACCCTGC CATGAGCATC TCCATCCACA CTCATCATTT CCCATTTTCA CCACTTCTGC TCCTGTGCAA ACCAGTACTA 002560
002561 TCCTTTTCTG GGGGATCCTA GGCAGAAGCG ATGGCAGGAC AAAGGCAGAG AAGGCACTGC CAGAGGTCCT GCATGCCAGT 002640
002641 AAAACTACTC ATGTCAGCTG TTCTGCAGGA TCCCATGAGA AG
[back to top]

Predicted Small Protein

Name NONHSAT101244_smProtein_1397:1564
Length 56
Molecular weight 6296.1195
Aromaticity 0.127272727273
Instability index 68.7672727273
Isoelectric point 9.68792724609
Runs 9
Runs residual 0.0121212121212
Runs probability 0.0485779897545
Amino acid sequence MHVFFGRHSKMQPAVSPGSEKQVEVSAINPPFREKRVAATDRLAVGWNFLWWPSD
Secondary structure LEEEELLLLLLLLLLLLLLLEEEEEEELLLLLLHHHHHHHHHHHLLLLEEEELLL
PRMN -
PiMo -