NONHSAT029172

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT029172

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3167 nt

Genomic location

chr12+:65277174..65371302

Exon number

3

Exons

65277174..65277645,65294357..65294478,65368351..65371302

Genome context

Sequence
000001 AATAAAATCT GGGAGCCCAA GCAGCATATT TGTGGGATTT GTCTGACAGA ACGCAATGGA AACAGAGCGT GGCATTCGCT 000080
000081 CCATCTCCCC CAGGTCACCT GTGGAGAAAG AAACATCATC TCTCAACAGA GGATTGTTCC TCCGTTTGAT GATACGTGCA 000160
000161 TCCTGCCATC TGGAATTCAA TTGACTGTTC TGTCTTGGAG CCAAGGATCT AGAAACTCTC CAATCCTGAA GTCAAAATGG 000240
000241 AATGCATCTC TGCTCTTCCT CTGGGCATCA TCTAATTTTT TTGTTCTTGT TTTCCTTGTG CTGGAATCTA CAACACCCTC 000320
000321 TCCTAGCCCC TCAACCAGCC TTTCTTCTGC AGAGCAACTT AGTTCTAATT AGCAACTACA AACCTATTCA CCTACATCCA 000400
000401 AGAGTTCAGT AGTGTCTTTT AAGTATGTAA CAGGCAATTC TTTTACATTT TATTGAGATG ACTACTTTAA AAAAATTGGT 000480
000481 TGCTACCCAG AACTGTTTAG AACATTGCTT GCTTTAGGTA AGTGAGTGAA GCAAGATGTG ATAACATAGG GCTGGGGGAC 000560
000561 TAGAGTTGGG TAATATGGTA GCACATGATT TTGTTACTTG AGATTGCCTG TAATTGTAGA CATCAGTGAG CAGGGTGATA 000640
000641 TGAACCTCAC AGGGGCTCTT GAAGTTCTGC TTCTTCCCTA TTCCTTAATG CAATACAATA TTTAGGTTCA TCTTTTCAAT 000720
000721 TTTCCTGGAG AAATTAAATA CTGTGCCATA ATAAAAAAAT AAGTAATTTG AAAGTAAAGG AAGGGCATGA ATGGAGAGAA 000800
000801 GGCAGGAAAA CATTCTAGGT TGGAAAAAAT GGCAGAAGTA ATGTTGGGAC ATTGGAGTGA TGGTGGAGGT GGGAATAGAG 000880
000881 TCTAGGATCT CTTCATCCCA TACTATGCAT CTATCTTCAC AAAATTAGTA TGGCCAAATA TTAAATCAGC TCATTTCTGG 000960
000961 TTGGGAAAAG TAGGGTGAAT GTGCCTTTCT AGCTGTCCTT GAGATGTAAA TCTCCTGTAA CATAGCTTCC CTAGTGGTCA 001040
001041 CCTCAAAGGC TTTGTTCTCT CCAGGATAGA TGCTACAGAG TGGGACAGTA CTTGGTGTGT GTGTATCTCA ATTTTTTTTT 001120
001121 TTTTAACATG ACAAATTGAA CTAGGGGCAA GAGAAGGATA GACAGAGGAG CCTATGATGG GGTGGTGGAT ACTAAATATT 001200
001201 TTCCAGGTGT AAGGAGCAGA GGAGGAAAAG GGAGACTGGG TAACCCTTTT TCAGACCATC ATTTAGTCCA TTTTATGTTG 001280
001281 CAAATATTAG CAAACCTACT CCAGCTTTCT TAAACAGTAG GGTTTTCATT ACAACATAGT TAAGAGAGTA GGCTTAGGAG 001360
001361 TAAGATTGCT TGATCCTGAG TAAGTTACTT AATCTTGGTG TCTCAGTTTC TTCATTCCAG TTAAGTAAAT ATATTTAAAT 001440
001441 TAAATTATAT TAGTTAGTAA ATGTAAAGCA CACAGAAAAG TGTTTGGCTT CTGGTGAGCT TTCAATAAGT GTTAACTATT 001520
001521 ATTTGTTATT ATTATTTTAA CAAAACCAAC CTCCAAAACC TATGAAGTCA GTTTACAGAA AGCCACCAGG AAAAGGGAAA 001600
001601 TAAGAAGATT CTTCACTTTG CTGTCATTTG ATACACTTAG CATTTTAAAT GACTATATGT ACAATATTAA CCATGCCTTT 001680
001681 CCACAAAACT ATGTGTGTGC ATATGAATTA CAGGGCTTTC ATTTGAAACT GAACTATCTG TGTTTGTAGT CACTTTAATC 001760
001761 ATACAACAAA CCTACAAAAT ATTTAAGCAT AAATTATCAT GATGACTAAA TATGGGAAAG CTTCTGTTAC CAAAACCTGC 001840
001841 ACCCAAATTG ACTGCACAGG ATGGGTGATT GGATGGTGGT GTAAAGAATG CTCATGAGGA CTAGGCTGCT GCTAATTAGA 001920
001921 TCTTCTATGC TATTTAAATG TTCTCCAAAC TTCAGGGAGT GCCCTGAAGG CAGTTGGACA TTAGCCTTGT GGTTTTTGAG 002000
002001 GGTAACCTAC AGTTTGTCAG CACAGGGGTG GAGGTGAGCA CCATGGGACT TTTCCCAGGC TGCTGGTTAA CATACTGTGG 002080
002081 TTTCAATCCT TTAAGCCAAT AAATGGAGGA GAGAGAAAGA AGGCAGACTG AAATTAGAGG AAAAGAGACA AAGAAAGGAA 002160
002161 AAGTGGAAGT AAGAATAATA TTCATGAAAT GTCTTCTTTG GGCCAGATAT TTTTTTAGGA ACTTTCACAT ATTGTGCTTA 002240
002241 CATCAACTCT GTGGGAAATG TATGATTATA CTATCCTCAT TTGTAACAGA CATTGTTGGG TGCCTATATA ACATATAACA 002320
002321 GCTATTTTTC TTCATTTTTT TCTTGCTGAT GGCGCCTTTT TTTTTTTAGT GTTTACTTCC CATACAAAAC TTATGTCTCC 002400
002401 ATGTCTCCAT CCTTCAGCAA TAGGGGTAAT TCTGAAAGGT CATTGTTTTG CAATTGCCCT TGAAAGGAAA TTCTTGGACA 002480
002481 TAACAACAGA TAAGATTCTA GCCAATGAAA GGTGATGAGA AGGTTGCTGA GAAGTTTCTG AGAAAGGTTT CTTTATTCTT 002560
002561 TAACAAATAC CCTGGAAAGA GACAGGCCTT CTGCTGCTGT TCATGGTTGT ACCTGCACAG GATGCCTGAA AAGGAGTAAC 002640
002641 CGTGGGGTCA GAAGAGGAGC CAGCGTGAGG ACAAAGCAGA TACACTGAAG TACCACCTGT CACTGAATTT TTGTGCAAGA 002720
002721 GTATAAACAT TCTCATTGTT TAATCTTTCT GTGGGAGACG CTTTACTATG TGTAGCAAAG TCATTCTAAC TGACATACCA 002800
002801 TTTTCCATTG TATGAGAAAA CTGATGTACA GAAAGGTTAA GAAACTTGCC TGCATTTGCA TGCATAATGA GGAGTGGAGA 002880
002881 GAACAGGTTG TGCCTTCTAA TATACAGGCA AGATATGTCT CTTTTAAACA GCACATACAG ACACACTCCA AAATGACAAT 002960
002961 TTCCATCATT TTTTAAAAAT TAAATGAGAC AAAGACTCTA AAACTATTTA AATAACATGT ACATTACTGT CTTGGGATTT 003040
003041 ATTTTTGTTT CAGCTCTATT GAGATATAGT TGACAAATAA AATGGTATGA ATTTAAGGTG TACAATGTGA TGATTTGATG 003120
003121 TATGTATATA TTGTGAAAAG ACCAAAATAA AGTCAGTTAA CATATCT
[back to top]

Predicted Small Protein

Name NONHSAT029172_smProtein_2102:2317
Length 72
Molecular weight 8557.2278
Aromaticity 0.112676056338
Instability index 52.9169014085
Isoelectric point 9.36334228516
Runs 12
Runs residual 0.0204595997035
Runs probability 0.0555555555555
Amino acid sequence MEERERRQTEIRGKETKKGKVEVRIIFMKCLLWARYFFRNFHILCLHQLCGKCMIILSSF
VTDIVGCLYNI
Secondary structure LLHHHHHHHHHLLEELLLLLEEEEEEEHHHHHHHHHHHHHHHHHHHHHHHLLLEEEELLL
LHHHHHHHLLL
PRMN LLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLHHHHHHHHHHHHHHHHHHH
HLLLLLLLLLL
PiMo oooooooooooooooooooTTTTTTTTTTTTTTTTTTiiiiTTTTTTTTTTTTTTTTTTT
Toooooooooo