NONHSAT144755

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT144755

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2677 nt

Genomic location

chr17+:655573..658576

Exon number

1

Exons

655573..658576

Genome context

Sequence
000001 CAGCGTTCCG GATCCTAGAC GCTTTTTGAC GGCTGGTCCT CCCCAGAAAA TGCGCCGTGT GAGCCCAAAC ATGGTGGGGT 000080
000081 TTACGCTGGA CCCCGAGCTT GCAATCTGTG CTTGACAAAA CACAAACATC GCGTGCCTGT CTCCTCCCCC TGAGATCGAG 000160
000161 TAGTAACAGC CACTCCAACT CTCCACCTCC AGCTTCTAGC ACCAGGGACC GCCTCCACCA CCCCATGTGC CAAGTGGAGT 000240
000241 TCGAGCTGCG CGGCCCTCAA GCAGCTGAAG GGTCCCGTGA GCGATCAGGA GAAGCTGCTG GTCTACGGCT TGTACAAACA 000320
000321 GGCCACCCAG GGCGACTGCG ACATCCCCGG CCCTCCGGCC TCAGACGTGA GAGCCAGGGC CAAGTGGGAG GCTTGGAGCG 000400
000401 CGAAAAAAGG GGCGTCCAAG ATGGACGCCA TGAGGGGCTA CGCGGCCAAA GTGGAGGAGC TGACGAAGAA GGAAGTGGGG 000480
000481 GGCGTGGAGC GCGAACAAAG GGGCGTGCAA GATGGACGCC ATGAGGGGCT ACGCGGCCAA AGTGGAGGAG CTGACGAAGA 000560
000561 AGGAAGTGGG GGGCGTGGAG CGCGAACAAA GGGGCGTCCA AGATGGACGC CATGAGGGGC TACGCGGCCA AAGTGGAGGA 000640
000641 GCTGACGAAG AAGGAAGTGG GGGGCGTGGA GCGCGAACAA AGGGGCGTCC AAGATGGACG CCATGAGGGG CTACGCGGCC 000720
000721 AAAGTGGAGG AGCTGACGAA GAAGGAAGTG GGGGGCGTGG AGCGCGAACA AAGGGGCGTC CAAGATGGAC GCCATGAGGG 000800
000801 GCTACGCGGC CAGAGTGAGG AGATGAGGAA GAAGGAGGCT GGCTGAGGGC TCCTCGGGAA TGGAAAGGGC TTCTTAGACC 000880
000881 TTGATGGCTG AAATGTCCTG AAACTGTCGC AAGCTTAGCC GAGACATCAA TAAATCACTT AAACTGCATG AGAGTGACTG 000960
000961 CTGTTGGAAA GGAGAAACTC GGTGACTGAG TGCTGGGGAG GCGTCCAAAT ACACAGACTA ACTTTTTGTA TGGTTTTGGG 001040
001041 TGGAGTTCCC AGCTCTCCTA TTACAAGAGC GTATGTAGGT ATTTGGAAGG ACGAGGGGGA CAGTGTGCTG TGGAAATGCC 001120
001121 CAAGATCTGA GGGCAGGAGA CCATTCTCCA GTGACAGCCT CTTGGGAGGT TGAGGCCCCA GCCCCTTATC TCTTCAAGGC 001200
001201 TGATTTACTA ATACCCCAGC ACCACAGAGG GGAGAGGGAC TCTGTCCTCC TAAACACTAA AGTCAAGTCC AAAAGCCCCC 001280
001281 ACTGGAACAC AGGGCCTACC CGGTTCATGT TTTGAGGGAG ACCAAGCTTT CCGTGATGAT AATGTGATGA AACAATTGAT 001360
001361 GCGCCAGAAC ACAGCAGTTT AGAAACAATG GAGCAGATGG TGTCAAAATC CCAGCCCCAG AAAGGCTCAC CTAAGGCCGG 001440
001441 GCAACCAAAA TGTTCCCGGG ACTATTCTGT ACTTAACCCT GGTTTCTGCT TAGGGGACGG TGGGACAATA TGCATATGGA 001520
001521 TGTCTATGTG GGATTTGTTG CAAGAGCACA ACACACCCCT TGACCTATGC AAGTGCCAGA GTGTCTGAGC CCAGCTTCTA 001600
001601 CTCCTGTTAA TGATCACAGG AGCCTCATAC ACAACCATTG TGCATGGCGG CTGGAGTATG GAGTAGTAAA TGGTATCCAG 001680
001681 AAAGGCAGGT CTACGGTGGC ATATGGGATT TCCTGGTACT TCTTCTTTAA TAGAAACTAT CTTCACAGAT CATGACATCA 001760
001761 GGTCAGCTGC AGTTCTCACT GGGACCAATA AACAATAGAA GCAACAACCA TAAAGACTTA AGAGGACAAA GGAAACCAAA 001840
001841 CCTCACTGTG TGTCTTCAAG ATCTGTTCAC ATAGTTGCAG GAAGTTGCTG TAGACAAAGT ACATTACTAA AGAATGAAGC 001920
001921 CCACCCAGGA CATCACAGCA ACGAAAGAAC TATTTCTGGT AACTGTGGAC CAGGTGAGAG CCACACTGTA GAACAGCAGT 002000
002001 GCTGCCCTCT CTTCTTTGGC GAATGTTATG AAGACATTCT GATGCACTTA GGATCATGAT TCATGTTAAA TGTAAATTAA 002080
002081 TTGTAAAAAT AATACATCTT GCTCCAAAAT GTGGTGACAA TTTAGAAGCT GTCCACATAG CCAGACCACT CTGTAAATCA 002160
002161 CAGAACCTAG AAAGTAAAAA TGTGCCATAA TTAGGACAGC AGAAAGTAAC AGTAAAATAA AGTTTTAAAA AAAAAAACAG 002240
002241 AAAATAAGGA TTTATCTATT TGTATTTGCT TAAGAAGAGG TTACCTATAG GATGCAGGGG GAATGGGATT GATGGGCATG 002320
002321 GGGTATACTG TGAAAGGAAA ATAAAATCTT GGGACCAAAT TCACTAAGCG AAAAGGAAAA GTTAAGCTTG GAAATTGAGT 002400
002401 CATGGAATAT GCAAAAAACT TCCTTTTGTT CCTAAACAGA TAAGCTACAA GACAGAAGGC CACATATCTC CCCAGGTGGT 002480
002481 CTCCCTCACC TGACAATATA AATTAACAGC TTATCTTCAT AGGTAAGGGA CAAAGACAAC ACCAGAAACC ATCCCTCACC 002560
002561 AGCCGAGACA AAACATAGAT GTACTGAGCT GAATGCATAA CTGTTCCTCT GCCTGCTGCT TTCACTGTAA CATATAGATT 002640
002641 TGGTGAGCAG TAAATGTAGA TTTACTGAGC TGAATGC
[back to top]

Predicted Small Protein

Name NONHSAT144755_smProtein_224:523
Length 100
Molecular weight 10651.5918
Aromaticity 0.020202020202
Instability index 53.4252525253
Isoelectric point 11.1576538086
Runs 15
Runs residual 0.017825311943
Runs probability 0.0379202732144
Amino acid sequence MCQVEFELRGPQAAEGSRERSGEAAGLRLVQTGHPGRLRHPRPSGLRRESQGQVGGLERE
KRGVQDGRHEGLRGQSGGADEEGSGGRGARTKGRARWTP
Secondary structure LLEEEEEELLLLLLLLLLLLLHHHLLEEEEELLLLLLLLLLLLLLLEEELLLLLLLEEEE
ELLEELLLLLLLLLLLLLLLLLLLLLLLLLLLLLEELLL
PRMN -
PiMo -