NONHSAT054253

From LncRNAWiki
Revision as of 07:14, 13 October 2014 by 73.162.128.239 (talk)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT054253

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2636 nt

Genomic location

chr17-:45127626..45177612

Exon number

6

Exons

45127626..45129033,45131239..45131582,45133826..45134123,45144780..45144863,45149837..45149925,45177201..45177612

Genome context

Sequence
000001 CTGGGGGAGC AGCCGTTGGC TCTGGTGCCC CTGGGGGTGG GAGCACGAGT GGGCAAGGAT GATGGTGAGG CAGAGGAGGC 000080
000081 GGCCCCGAGG GAGCGCCCGG CTGAGAGGCG GGGGGCCGAG GGGCCCGGGG AGCGGGCGTC ACCGAGGGCC CTGGAAGCGG 000160
000161 CAGGGCTGGG GGAGAGGAGA CGCGTGTGTG GAGCATGGGG ACCCCAGCCC AGCTCCCACT TGCAGACCTG GGCCGCGGGC 000240
000241 TGCCGGCCGA GTGGCCGGGG GGCCTGGCTG CCCAGGAGGC GGGCCGGGGC CGCGGCCGGG GGCGCGGAGC GGAGTTAGGG 000320
000321 TCGCCAGGCC GAGCCGAGGT GGGACGGACC GACGCAGAGA GGAAGGGAAG CCACCTGGCC TGAAGATCCT GTCTCTAAGA 000400
000401 TCCCATTCTG AGGCTTCGAG GGTCTGGCTT GAGCAGCACA GACAGGGACG TGTCTAACCG GGATGACCAT GAGGTCTCAT 000480
000481 CGCAGTTTGC CACTAACCAG GGCAGCCGTT AGACAGCACA ACCGAACCCA ACACCGGGGC CCAACGGCAG GGGCTTGGGC 000560
000561 AGGAGCAAGG TCCGAGGGCT GCCAGTTAGG TCCACGCTGC TTTTATGAGC TGTAACACTC ACCGCGAAGA TCTGCAGCTT 000640
000641 CACTCCTGAG CCCAGCGAGA CCACGAGCCC ACCGGGAGGA ACAAACAACT CCAGACGCGC TGCCTTAAGA GCTGTAACAC 000720
000721 TTGCCGTGAA GGTCTGCAGC TTCACTCCTG AGCCAGCGAG ACCACGAACC CACCAGAAGG AAGAAACTCC GAACACATCT 000800
000801 CAACATCAGA ACGGACAGAC TCCAGACGCA CCACCTTAAG AGCTGTAACA CTCACCGCGA GGTTCCGCGG CTTCATTCTT 000880
000881 GAAACAGCAA TGAATCCTCC AATGTACCTG ACTCTCCCTT TGCGAAGAGC ATCTCCGTGG CAGAAATCTG AAAATGCCCC 000960
000961 TGGGGAGACA CATGCACAAG ACAGTGAGTG ATGCAGCCGT TTCCCACGTA TCTCACAATG TACTTCTCTG GTCTTATTAG 001040
001041 AACTAAATGA GTATCTCAGT CCATAATCAC AGGGAGAAGA ACCACCACAG ACCACATACC TGGGGTCTTG AAAATAATTC 001120
001121 CATGCATGTG GGACTTTCAG AAGCTCTCCA TGTCTGTCCA GAAGGGCCCC ACAATATACT GGGGGGACTT TGTATGTGGC 001200
001201 TCAGCATGGA GCAGGGGCAG GATGTTCAAT TTTGCTATTA TATCGACTTC TTAAAATAGT CTAGTGGGAT TAACTTGGTT 001280
001281 TCAATTCACA GAGATCTGGA AGCGAGGATC TTTTAAAAAT CCTGAAATAT ACACTGCAAT AAAAGAACAA AGCATACACC 001360
001361 TCAGCCTTAA ATGACTGAAG AAGTATGTCA AGTAGCAGCA GGTGGGAAAG TGGCTTTGGT TTTCAGTTTG TGAGCTCTGA 001440
001441 ATCCACACAG AAACAGGACT GCATTCTGAC AACCTGAATT AATTATTGTC CTTACCACAA TGAGGCAGAA AAGTATAATA 001520
001521 AAAATCATTA GTATTTCAGT CACAATTAAT GCCAAGATGA GTTTGTCAGT ATAGCCATAT CCTGGAACTT ATTTTGTGAG 001600
001601 CTAAAAAAAA CAAAAAAACA AAACAAAAAA AAACACACCA GAATGAGAGC TAACTATTCA AAACCCCAGT ATTCCAGGTG 001680
001681 AGTAGCTTAC AGGTTCTTTT TTATTTTTTT GAAAGAGGGT CTCACTCTGT TACCCAGGCT GGGGTACAGT GGTGCAATCA 001760
001761 CCGTTCACTA GACTCGACCT CCCTGGGCTC AGGTGACCCT CCCACCTCAG CCTCCCAAGT AGCTGGGACT ACAGGCACGT 001840
001841 GTCATCAACC CAGCTAATTT TTTTATTTTT TGTGGAGACA GGCTTTCACT ATGTTGGCCA AGCTGGTCTC AAACTCCTGA 001920
001921 CTTCAAGTAA TCCACCCACC TTGGCCTCCC AAAGTGCTGA GATTACAGGC ATGAGCTACC ACCCCTGGCC TACAGTTCAT 002000
002001 CTTGTGCCCT AATCTATATT TCACTCTCTA CATGAGCAAA GTGGGAGATC ACTGTCATGA CCAAAGTTAC ATGGCCAAGA 002080
002081 TAAGCTATGG CCTGGGAGTC CCAGATTCTT CTGTGTGGGC ACTTTCCTGG GATATGCTAA ATGATGGGAA ATCTGGGTCT 002160
002161 CATGTTTCTG TGTGGTCCTC ACCTCAAGCG ACTTCTCTTT CTGTTCACTC TGGGCTTCCG TGCTCTCATT AATGTAGTTC 002240
002241 TGAGTCTTCC ATTGGTCCGT ATCCCATTCT ATCTCAGATG CCTTTACTTC CTGCTGCCCA CTGAGAAGCT TCATCAGGTG 002320
002321 GCCTGTCCTG GAGATGAGCT TGGCACAGGT CACTTGCACA TGGGCCCCAG AGCAGTCCAT CTTCAAGGTC CGGATAACAT 002400
002401 GAGAAATGAG CCTTCTCACA TTGTTGGGGA TAAGGGACTG TAGCTGCTGG GTTAGCTGAA TTTCAAACTG AGCAATGGGT 002480
002481 AATTGAAGCT TTTGGGCTCG GGGGACAGGT CAGTGCCCAC GTTGTTGTAT TCCCATTTTG TCTCAGTTTG TTTAACAGTT 002560
002561 GGCCCTAAGT TGAATGCAGT CCCAGCGGAA TCTGCCTCAG GAGGATGATT GTAGTTTGTG TTTTCAGAGA TGGTGA
[back to top]

Predicted Small Protein

Name NONHSAT054253_smProtein_1046:1258
Length 71
Molecular weight 8111.1393
Aromaticity 0.0857142857143
Instability index 57.0114285714
Isoelectric point 6.19366455078
Runs 13
Runs residual 0.0408769448373
Runs probability 0.0371842430665
Amino acid sequence MSISVHNHREKNHHRPHTWGLENNSMHVGLSEALHVCPEGPHNILGGLCMWLSMEQGQDV
QFCYYIDFLK
Secondary structure LLEEEELLLLLLLLLLLLLLLLLLLEEELLLHHHEELLLLLLLHHHHHHHHEELLLLLLE
EEEEEEEELL
PRMN -
PiMo -