NONHSAT100939

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT100939

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2006 nt

Genomic location

chr5+:32947549..32962573

Exon number

6

Exons

32947549..32947927,32949049..32949206,32950222..32950389,32958720..32958803,32961247..32961365,32961474..32962573

Genome context

Sequence
000001 GCGAGACTCC ATCTGAATAA ACAAAAAAAA ACCCCAAAAA ACAAAAAAAC AATGTCATGG TATTAGTTAA GGCAAGTGAT 000080
000081 ATTCGCTACC ATAACAAAAA TCTCTAAAAA TCTTATGGCT TAATATAATT AGAATGTACT ATTTGCCCCG GAGAATTCAG 000160
000161 ACAGATGTTT GTTTCTGCTC TAGTCACTTT CCATATAGTC ATTCCAAGTC CAGTCTTCTT CCATCCTGTG GCTCCGACTT 000240
000241 CTTCCAAGAC CTTAAAATCC TTTCTCCTTC CAGCAGATGC AGAAGGAGAG AATGTGTGGA GGATTATGTA GAAGGATTTC 000320
000321 ACGAGCCTGG CCTGGAGAGG ACACATCACT TCCAACTCCA TTCCATTACC AGAGCTTAGT CTATTTCCAG AAGATTATAA 000400
000401 GCTTCATGGG GGTAGATGCT TTACTTCATT CATCACTGCA TCTCCAACGC CTAGCACAGA GTCCTGAATT ACAGGCTCAA 000480
000481 TAACTATTTA CTGAATGTGA AAACAGAGCT GCTTGAAGAT ATATGAGGCT AATTGAGCTT GTGCATTTGT TGGCTATTGC 000560
000561 TGCTACAGTG GATGCTGTTG TTCAGTATTC AGATATCAAG AAACAACTAG CCAAAGTAAG GAGGAACGAG GGGCCCATGA 000640
000641 AGCTCAGTAG AGCTTGTGAT AATATTTGCA AGAGAAGACT ACCTGCCAAA CTAACTTCAG AAAAGGTGTT TCTGGTTGAA 000720
000721 ATCGTCGTGC CCAACATTCC ACCAGGTTCC TCACATGTTG AATACATCAA AGCCTACTCC ACCCATCTGG AATGGACAAG 000800
000801 CAGAAATGAC TCACTGTGTT TTCCAATTGC ACTCAAGACC TGTCTCCATA TGCCCTCCTC CTCTGCTTGA GCAGAAGGTG 000880
000881 AAAGAGGTTG TGTTGACTGT GGAATTTGGC CATTGAATCT TGAAGACAAA TGCTCTTTGA CTTCACTACA ACATCAGAGG 000960
000961 GCACTTGGCC AAGGAAGACT AGTTAGAACC AGCCTTTTTG CAAACACAGC TGAGGTTTTT ATCCTGTCTT TTCTTCATCA 001040
001041 CAAAAGTAGC ATGGAGAATA ATGGACAATA GAGTTCAAGT GGAGCAAGAA CTGTCCAGAA TTTGGGGAAG AAGTCACATA 001120
001121 TCAGTCCTGG CCATGATAAC TGGGGCACAT GTACTATATC CAAGTTGATA TCCTTAATTT GGGGAAAGTT TTCTTGCCTT 001200
001201 TCCCACAGCC TCCCAAGCCA TTGACAATTC CCACATTCAA AACATTCTCA TTCACATTTC TGATTCACTT TGGGCTACAG 001280
001281 GAACAACAGA TGTTGAGTAC TTCTAGGGAG CAAATCCATT ATAGATGGAG GTAGGTCTTG CAAACAGAGA GGTAGAGAGC 001360
001361 TTTCTCGAGA CCTTTGGACC AAGCAGATAA CATCACAAGG TGTACCTTAT GCTAAGTGCC TGATCTCTAA GACCCTCCAC 001440
001441 CAGCGTCAGA GAGACAGCAT GCTCACAACA TGACAGCTGG TCATACAGGT GAAGAAGAAC ATGAAGACCA GCCCACCCCT 001520
001521 GAAAATGACA GATCAAACTC AGCTTCACAA TGTCTCTTAT TCCATCGCTA ATGGTGTTTA AAAACACACT TTGTCTTAAG 001600
001601 GAACAAGGGG AACTGACCAC GTGTTCATGA CGGTTCATTT CTTACCCTCT GCTTTGGCTT GACCAAACTT TAGTCAGGTA 001680
001681 TCTCTCCTCC ACAAAGATTC CTAGACTTTG GCTGTCCCCC AAGTTTAAGC AAGCACTGAA ACACTAAGGA GCAGAACATC 001760
001761 TGTTAACAGC TCATCCTGAA AATCAACTGA CCATAGGAAA AAACACTCCC TATTGAGATA TCCTGGTTTT GCCACCTGCT 001840
001841 CGCCCACACT CTATGCCCTT CTCCTCCACA AAGGTCTGGC TGGTTCTTCT CCCTCCATAC AAAGGAAAAG CTTATTTCTG 001920
001921 TTTGATTTTG AGACATTTGC AGATATCTGA GGTTGGCATG TTCTTCCTAT TGCAATAGTC TTTTTGAGTA AAGTCTCTCT 002000
002001 TTATCT
[back to top]

Predicted Small Protein

Name NONHSAT100939_smProtein_1148:1333
Length 62
Molecular weight 7389.5976
Aromaticity 0.180327868852
Instability index 70.5540983607
Isoelectric point 9.69488525391
Runs 12
Runs residual 0.0439042773695
Runs probability 0.0244435612083
Amino acid sequence MYYIQVDILNLGKVFLPFPQPPKPLTIPTFKTFSFTFLIHFGLQEQQMLSTSREQIHYRW
R
Secondary structure LEEEEEELLLLLLEELLLLLLLLLLLLLLLLLEEEEEEEELLLLHHHHHHLLHHHHEEEE
L
PRMN -
PiMo -