NONHSAT101354

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT101354

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3155 nt

Genomic location

chr5+:44826678..44836603

Exon number

4

Exons

44826678..44828418,44830990..44831818,44832987..44833074,44836107..44836603

Genome context

Sequence
000001 TTTTTTCTTT TCCTATTGGA GGTTTGACAC CTATATTTTT CTCTGACATT TTGGAAATTT GCTTCCCAAA GTCAAAAACA 000080
000081 TAATGCCTAA CTTTTCCCGA TTTGGCCAAC ATCAACTTTA AGATTCTGTG GTTATCTTTT TACCAAGAGA TTTGTCAGTT 000160
000161 TTACTGCCCC AGTCATTTCC TTTTTACTGA TTAGAATTAG GCCTTTGTGT AGCTGTTCAC CTAATTACTT CCTCTACCTT 000240
000241 GTGAAAAAGA GCAGGGACGC TAACATTTGC TGGGTACCTA CTACATGCCA AAAGTACTTA ATCGTATGCA GTAATGTCAT 000320
000321 TTAATTTTCA AAACAACTCT CCAAAGGAAG TGTCACTCTC CTTGTTTGAT TTCTTGCTTC CCAAATGGAA GGTCCTTTGT 000400
000401 ATTAATAGTA TCTGTCAAAT TTCATACAAT ACTTAAATAT GTTCTTGATA CGTACTTCAA TATAACCAAC AGAATGAGGC 000480
000481 AGTATTCGAT AAAGTAAAAA ATAAGATGAA TATTTCTAGT GATTTATAAG TTACTTGAGG AAATCATGCT GTTTATTCTT 000560
000561 TTCCTTTTTA GGTATTCTTT GTGAATTTGC TATTTGTGTG TGAATATATC TGTTGCGATA ATGAATAACC ATATGAAATG 000640
000641 GATAATTGTA TGAAAATTCA TTTGTAATTC AATAGATTGC CAGGGATTTT AGGTTGGTTA TGAAGGTTTG TTTTTTTTTT 000720
000721 TTTTTTCTTT TGGAGAGTGG GGAGATTGGC ATACAGTTTC AAATTGTTTA TGTGGAAGTT GGAAGTGTGA CTAAGCTCGA 000800
000801 AGAAAGGAAG AGAGGGACAA AGAAAGGGAG GAGGTACCCC TAAGTGGGAA CCTACCAGGA CATTCAAAGC AAGAGCAGTA 000880
000881 AGTTCTGAAT GTTCTGGGAC AACCTGGGTG ATATGCATGG ATATGGGCTG TGGAGGCTGA GCATTTTAAT GATAACTTAG 000960
000961 GGAAACGAGG CATGGCCATG GTGTAAAACT CTCAAATCCC AAGCCCTAAT CCAACCTTAA AATCCGAGTC TTCTAAAGGG 001040
001041 CTGTTTTAAC CATGAAAGGA CCATAAGAAA GGCAATTCAC AGAAAATGAA GCCATGTGGC CAAGAAATAT AAGAAAAACA 001120
001121 GTAAAAGCCC TTAATCTCAA TAGCAATAGA GTGGATGCAA ATGAATATAA TGAGTTGCCA TGTCGTTCTT ACTGGATTGG 001200
001201 AAAAGAAATT AGAATGTCTA AATAACATGT ATCATCCAGG ATGTGGGGAA ATGGGAGCTC TTGTACCCTG CAGGCAGGCA 001280
001281 TTTAAATTGG TGCAACCGCT TTGGATTGCT GCTTTGTAGT ATCTGGTGCA ACTGAAGATG AACATGCCCT GTGACACAGC 001360
001361 AACCGCACTT CTAGGTCAAT ACCCTAATTA TATTCTTACT GTGGTTCACA AGAAGGTATG TAAGAGGTCA TTGCCTGAGC 001440
001441 ACTGTTTAGA ATAGGGGCAA ACTGGAAATC CTCTAAATGT CTGTCAATGA AGGAATAGAT AAATTGTAAT ATGTTCATAT 001520
001521 AAAATGCTGC ATAAATAAGT GAAATTTATA AATATACTAA CGAATGAATC TTGAAAACAG AGTTGGGAGA TAAAAGCAAG 001600
001601 CTGTTGAAGA ACATGGTCAG TATCCTCTCA CTTATGTAAG TTAAAAACTC CAAAGAACAT TATCTATATT GGTAATGGCA 001680
001681 TAGACATGTG TGGTAAAATA TAAAAATATT AACTAAAAGT TCTATACGCT TCAGGATATT GTATTTTTGC TTATATGTAT 001760
001761 ACCAATGTCT TCATGCTTGG CAGTAGGAGA AGAGAAGGGA AGTTGGGGCG GAAAAGAGTA GTGATAAGAG AAGATATAAA 001840
001841 GCTATTTTTA GTTTTAAATT AAATCCACAA AGACTAAATT TGATGAAGGT GATGGGGTGT TTGGAAACAT CCAAGACATA 001920
001921 ATGGTTTTTC AGAGGATGGT TTTCATATTC TTTATATTCA AGTTTACTTT TTCCTAATAA TAAAAAACAT AAAGGAGTCA 002000
002001 AATATTTTTA GATGCCTGTG GATTTTGAGA ATCTTACCTC ACTCCAGGTT TATTTCTGAG CTCTATTATT CTTTTCATGT 002080
002081 ATATGAGGCA AATACAATTA AAATTTAAAC ATAAAATACA GTGTTGCTGC TTTTGCTGTA ACACATAAAG TAAGCATAAA 002160
002161 TGAGTCTCAG TTACACAAAC ATAAAAATAA TCATCCATCC AGGAAAGTAA TTTAAAACAA ATTATAGTCA GGGAACTGAA 002240
002241 ACTGATACTT TTAACTGTTG ATATAAAATG TGAATTAGAT TGTAAAAACT CAAGAAAATA ATTATTATAA ACTAGGTATA 002320
002321 TATGTGTGTA TCTGGAATGA AAATTTGTAG AAATAGATAA AAAGCCACAA AATCCTTTTA GTCTTAGAAA TAATAATCTT 002400
002401 AAAATGCTTT TAGTAAATTA CAAAGCTTTT GTTGTCAAAA AGTACTACAG TCTGATTAGG AATTACATTT TCTGTGATCT 002480
002481 ATGCCATAGA CCTCTCCACA TGTGCTAACT TTTCCTGAAA TGTAAATGAG CAATGTTTAT GAAAATAGCT TATTTTATTA 002560
002561 ATATTTGAAG TTGAGACTCT CCTCCTGTTT ATCCCATCAT ATACTCACTG TATTGTACTT CTTTGTTCCT AGGTTCCCTG 002640
002641 AGAGTATGCA TTGTGTTTGG AAGCTAGGCC TTAGATGAGA AGATGGTCCT GGAGGTTGCT ACTGGAGAGG AGATTGGAGC 002720
002721 GTGGTTAGAG ATGAAGGGTC AGAGTCCCCA GAGTCTGTGG CATGCAAATA CTGTTGAAAC AAACAGTGGA TCCACTGGAC 002800
002801 CAGTGCATGT CTTGTGCCTT TGCACACTTT TTTACATGCA CGTAACCAAG GATGGCTTCA AGATCATGGC CCAATTATTA 002880
002881 ATCGAGAAAA AGGGATACAT AAAGTAAGAG CATGATGGTG CTTATCCTAG TGGAGACTTC TCCCTTGGTT GCCATAGTCA 002960
002961 TTGACTAAGC ACTTCGTGGG AATAAATGTT CCAATAGCAG AGACTTCCAC CATTAGCAGA ACCATCAGCT CATATTTATT 003040
003041 TCACTGCATA TTTCACTCGA TTACTTGAAA AGACATATTT TTAGAGATAA ACTACACTGG GACAGATATC ATAACACCGT 003120
003121 AGTTTTTTTT GCTAATTCAC ATATATTATT TCTGG
[back to top]

Predicted Small Protein

Name NONHSAT101354_smProtein_1154:1318
Length 55
Molecular weight 6032.286
Aromaticity 0.0555555555556
Instability index 57.8277777778
Isoelectric point 7.69378662109
Runs 9
Runs residual 0.00397470641373
Runs probability 0.0281752340575
Amino acid sequence MQMNIMSCHVVLTGLEKKLECLNNMYHPGCGEMGALVPCRQAFKLVQPLWIAAL
Secondary structure LLLEEEELEEELLLHHHHHHHHHLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHL
PRMN -
PiMo -