NONHSAT126940

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT126940

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3240 nt

Genomic location

chr8+:64378407..64388040

Exon number

4

Exons

64378407..64378555,64379700..64382504,64384928..64385060,64387906..64388040

Genome context

Sequence
000001 AACTCTGGAG GTGGAGGTTG CAGTGAGCTG AGATCATGCC ACTGCACTCC AGCCTGGGTG ACAGAGTGAG ACTATGTCTC 000080
000081 AAAAAAAAGA AAGAAAGAAA GAAAGAAAAA TTTGAAGTAT AGAGGGGAAT CACAAAAATT CTAAATGGTA ACTCATGTCA 000160
000161 GGAACCTGAA AGTCAAGTCT TCACTTCAGC ATTGGGAAGT ATATGAAACT GTTTTCGGAC CACACAACTC TGTGAGCTCA 000240
000241 TCGCTAAGAA TTGGTATGCA AGGTCTTCAG AAGGGTAGGC AATTTGTAAA CTCCTTTTGA CTTAGCTGGT AAGACAGAAA 000320
000321 GTTCTTTTTT CATGGCTAAA TGATACTTGA GAAAAGAAAA AATGTGGTGA AGCTGAATGT ATCTAGTTGA AATAAAAATG 000400
000401 CCTACCTTTC TCCTGCATAG GCAAAATATA CTACTTACAA GAAAAAAAGG AATTTATGAA TAGAAATAGA ATCAGGTCAC 000480
000481 AGTGCATTTC CTAATTCTCC ACTGGATTTT ATTACTCAAG CACTGTATTT TTACCTTTTA TCATAGCTGT CATTCGTCCT 000560
000561 TGTACTTTAG ATAGCAGCTT ATCAACCCTT GTTATATCCT TACCTTCTTT AGGGTGATGA TCTTATCTTT ATTTTTGACC 000640
000641 ACACTTCCTG TGCCTGCCCC TTTTCTTACA TCAATTTCTG TTCCACTTGG GATTTTGTAC TCACAGATGT GACATGAATA 000720
000721 TTTATGAGAT TAAAGTGAAT GGTCCAACAT CAATTCAGAA AAATTTTGAT CCATTCGTCC AAATTATTTA AAATATTCCC 000800
000801 TTTCTAATTG AGGCAGCAAG GCCTCTAGCT TAGTGCTCAT TTTCCCTCTC TAAGCATATT CATGCCATTA GGGTATTAAT 000880
000881 TTCTTCTAAA CTAAAGCTCT TATCTCCTCA TAATTCATTA TGTCTGTTTT TGATTCGCAC TATCATTGTG GTCTCTCAAC 000960
000961 TTACTCCCGT TAACACAGTC ATTTTGGTGG CTTTTATAAT ACCTCCTCTG TTGTTCTTCA AATTTTGCCT AAATAGGCAA 001040
001041 TAACTTAAAT GAAAAAGTGT TCATGAGTGA GCCAGTGAGC TCAGAAATTT GGAACCTTTT TGCCATGGAG GGTGATGGTG 001120
001121 AGCTTAGTGA TAACAAGAAT TTCCCAATTA TGGCATCTAC CTTTCTAATC ATCTCTCCTT CCATGTGATA CTGTTTTTAT 001200
001201 ATAGTTTGTT TTGCCTGCCT CCCTCCCTTC CCATCTCTCT TCTTTCCTCC CTTCCTTCTC TCCTTTCCTT CCACTCTTCT 001280
001281 TTCCTTTATG GGAAAAACCA TTCTAAATCC TTCAAATTGT GGTTTGAGTT TACAGTCTTT TATTGAAATT ACATGTTGGT 001360
001361 ATGCTAGGAG TATATGAAAG CTTTTTTTCC AATTTGCAAA GAATGTTTAT ATGATTTTAT TATTTTTATA TGATTTTTGT 001440
001441 ATTAAAATCA TCCCAGATAT CCCCAAAATA GAATGCAAAG TTTACATTTT AAATTTACTT TGAAATTTTA AATAAATTTT 001520
001521 CTTCTAAGTT GTCTTTGGAT TACATATCCA GACAGATTGG AGGCATCAAA ATACTTCCCA CTCCTTTTTC AATGTTGTTA 001600
001601 GGGATATTGC TATGACATTT TTGAGAAGTA TTCGAAACAA CTCTATTAAG ATAAGCTCCT TTTCTTTATT GTCTTCATCT 001680
001681 TATATCTCTG GCTGTCAATG AAATTCTATC ATTTCCCACT CAAAGCCCAA TTGATCTCTG GAAGGAGGTG GGGTGTGCCC 001760
001761 TTGTATGAGC TCTTAAGAAT TTATTTAGCA TTTTTTTCTA GCTCTTCCAT CAGTGAAATC AAGATGGTAG CTGAAATTTA 001840
001841 CCACGGTGGA GGATATTTAC ATCACAGAAA TTGGCAAAGG CTACAAATAA GTTTAATTTT TTTTTTTTTA GAACTCGTTA 001920
001921 AGTATGTACC AGCACACCCT GTAAGGTTGA TTGATGCTTG ATCTAGGTGT CTTCCGTATG CTTTGTGTGT TACTCCTGTG 002000
002001 GCTTTACCAG CTATTATGCA TTAGTCTCTG GGGATAAGGG AAGTAATAAA CCTTTGTCCC AGGTCTCATA AATAGCTTGG 002080
002081 TTATTAATCA AATGATCATT TGGTCTGGCT TCCTTTGTTC GGTCCTATGC TTGTAAGTAT GGATGTGTTA GCTCTAGACA 002160
002161 GAACCATAGA TAAGGGAGCT AAAGCCCAGA GTGATTAATG ACTTACAAGG TCACAATTCT AGTTAGAGGG AAAGATGCTG 002240
002241 GGACAAGAAT CTGGGTTTTC TGAGTCCCAG GTAGGTTTTG TTGCTGTTAT ACAACTCTCT AATAATTTTC AAAAAATCCT 002320
002321 AAATCATAAA TATTCCATTG TAATAAGCTT CAGAAAGCAT TGCAAGCCAC AGATTTAGCT CTAATACTAT TAATCTTTGC 002400
002401 TTTTTTTCTA ATCTAAAAAC TAGGATACAT TAAGAAAAGT AGAAAATGTA AAAATATTGG TAAAAAACCA AAACTCCTAA 002480
002481 CAAATTAATA TATATATGTT AAGAGAGACA TACGTTTAAA CAATTTACTA TCATCTGTCA ACTTTTAAAG GAATGCCTGG 002560
002561 ATCTAGCTCT TATGACTTTC AATCGAAGTG ACACAATTTT TCCAGTAATT TTAACAGTTT TCTCCCAGCT TGCTTGTTAA 002640
002641 AGAAATCTGT AACATCTTTA GCAAGAAACA GAAAGACAGA GGGCAGCATG GGACCAGAGT GCAGCTGTCC TTCAGAAGAA 002720
002721 CCTGACTGTA CAACTTGCAC ATCTGAAGAG GTAGCCAAGA AAATAGAGAA AAGAATGGGA AGTGGATCTA CCCACATCCT 002800
002801 CTCCCATCCC CTCCAGAAAG AAATGCAGCA TCTCTTTCTC ATTGTTTAAG AAGAAAAGGG GGATCTCGGA GACAAAGCCA 002880
002881 ATGAAGGATA AATGAGTGCT CTGCCAGTTC AAAGTCAATG TGAAGATTGG ATGGGGAAGC GTGCGGAGCT CAGAAGCATT 002960
002961 TTTTTTTTTC CAGAGCAAGA GAAAATCCAG TGACTGAAAA AGAATAAGAA GAAAAACACT GACCAAGGCA ATAATTCCAA 003040
003041 GAAGTCAAAA ATTAAGGCCT GAGAGGAAGA AGGCAGACTA AAGAAAGTAG GAAAGATGCA AGGAGAAAGA CAGCCATCTG 003120
003121 CATACCAGGA AGAGGGCCCT CACCAGACAC TGGATCTGCT GCATCCTTGA TCTTGGGCTT CCAACCTCTA GAACCATGTG 003200
003201 AAATACATTT CTGTTGTTTA AACCAAAAAA AAAAAAAAAA
[back to top]

Predicted Small Protein

Name NONHSAT126940_smProtein_1697:1960
Length 88
Molecular weight 10325.0713
Aromaticity 0.172413793103
Instability index 57.7518390805
Isoelectric point 9.81719970703
Runs 13
Runs residual 0.0111658456486
Runs probability 0.0570162923105
Amino acid sequence MKFYHFPLKAQLISGRRWGVPLYELLRIYLAFFSSSSISEIKMVAEIYHGGGYLHHRNWQ
RLQISLIFFFLELVKYVPAHPVRLIDA
Secondary structure LEEELLLLLEELLLLLLLLLHHHHHHHHHHHHHLLLLHHHHHHHHHHHLLLLLEELLHHH
HHHHHHHHHHHHHHHHLLLLLLLLLLL
PRMN LLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLL
HHHHHHHHHHHHHHHHHHLLLLLLLLL
PiMo iiiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTToooooooooooooooooooo
TTTTTTTTTTTTTTTTTTiiiiiiiii