NONHSAT144717

From LncRNAWiki
Revision as of 08:03, 13 October 2014 by 73.162.128.239 (talk)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT144717

Source

NONCODE4.0

Same with

,

Classification

sense

Length

3444 nt

Genomic location

chr17-:414737..418180

Exon number

1

Exons

414737..418180

Genome context

Sequence
000001 TCTGATCTCC AAACCCAGCT CAGGGATTGA AGCTAACTAC CGCCACTCCT TCCTCTCTCG AAGAACAAAG AGCCAGTACC 000080
000081 CAGGGCTCTG CCCACCCCCG AGAAGGGGTG CCTGATTTCT TATGTCTAGG GTGGCTCCTC TGTGCAGATC AGACAGGACT 000160
000161 GAGAGCAGGC AAGCTGAGAC CAGCATGACC AGTACAGGGC ACGCAGAGCG TGGGGGCAGG TCCGAGGCTG CTGCTCGATG 000240
000241 CAGGGTATGA CACCAACCCT TCCTGAATTG CCATCGGATG AGGAGACCTG GTTCCCTGTT GAAAGAATGC CAGGGGAAAT 000320
000321 GCTGCATGTG CACAGCTGGA AATGGGGTTC CTTCTCTAGG AAGAGCACTC GGAATAACCT GCTGGAAATG GGGTTCCTTC 000400
000401 CCTAGGAAGA GCACTCAGAA TAACCTGCTG GAAATGGGAT TCCTTGGCTA GGAAGAGCAC TCAGAATAAC CTGCTGGAAA 000480
000481 TGGGATTCCT TGGCTAGGAA GAGCACTCAG AATAACCTGC TGGAAATGGG ATTCCTTACC TAGGAAGAGC ACTCGGAATA 000560
000561 ACCTGCTGGA AATGGGATTC CTTACCTAGG AAGAGCACTC GGAATAACCT GCTGGAAATG GGGTTCCTTC CCTAGGAAGA 000640
000641 GCACTCGGAA TAACCTGCTG GAAATGGGGT TCCTTCCCTA GGAAGAGCAC TCGGAATAAC CTGCTGGAAA TGGGGTTCCT 000720
000721 TCGCTAGGAA GAGCACTCGG AATAACCTGC TGGAAATGGG ATTCCTTCGC TAGGAAGAGC ACTCGGAATA ACCTGCTGGA 000800
000801 AATGGGATTC CTTCGCTAGG AAGAGCACTC GGAATAACCT GCTGGAAATG AGATTCCTTC CCTAGGAAGA GCACTCGGAA 000880
000881 TAACCTGCTG GAAATGGGAT TCCTTCGCTA GGAAGAGCAC TCGGAATAAC CTGCTGGAAA TGAGATTCCT TCGCTAGGAA 000960
000961 GAGCACTCGG AATAACCTGC TGGAAATGGG ATTCCTTCGC TAGGAAGAGC ACTCGGAATA ACCTGCTGGA AATGAGATTC 001040
001041 CTTCGCTAGG AAGAGCACTC GGAATAACCT GCTGGAAATG GGATTCCTTC GCTAGGAAGA GCACTCGGAA TAACCTGCTG 001120
001121 GAAATGGGAT TCCTTCGCTA GGAAGAGCAC TCGGAATAAC CTGCTGGAAA TGGGATTCCT TCCCTAGGAA GAGCACTCGG 001200
001201 AATAACCTGC TGGAAATGAG ATTCCTTCCC TAGGAAGAGC ACTCGGAATA ACCTGCTGGA AATGGGATTC CTTCGCTAGG 001280
001281 AAGAGCACTC GGAATAACCT GCTGGAAATG GGATTCCTTC GCTAGGAAGA GCACTCAGAA CAACCTGCCT GCTTGAGGGT 001360
001361 GACAGCAAGT GTCTCACACT TCTCGTCTAC TCCAAGTCAG ATTCCTAAAA ATAAAATAGA GCTCCAATGG GCAATTTAAG 001440
001441 CAAAAGCCTT AGAAACATGG ACACCTGAAA TGCTGGCCTA CATGAAGGGT ACGATGACAT TTTATATATT AAATAAAAAT 001520
001521 TCCCAGTCCT GTTTTAGTAT TCATCAGAAT AGCCTCTTGG ACCCTCGAAG TTCTACACCT TTGGGAGATA TAAATTTCCT 001600
001601 TAAAATAAAG GGATTGCTTC TTTGTGTAAA TAGCTTTCAT TTTCTCCATC TGGAAGTGAT TTCTGCCTGC ATGTTGGAGC 001680
001681 ACAAAATACT GCAGTTAATT AAAGCATTAC TGGATGTCAT TTGCTAGAAT GTATTTCCTT ACTGCTGACA GAAAACAGAA 001760
001761 CAGACTCTAG GAAAGAAGTA GCAAGTCCGT GACCCTGGCC GCTCCTGGTG GCTGCCCTTC TCCCTTCCCT GACAGTCTAA 001840
001841 GGAAGCAACG CGTGGGAGCA GAATGTGGCC GACTGAGCCC AGTCCTGACC TGCCCAGTTG CAGTGGAGCC TGTCTGTAAT 001920
001921 GTTGTGTAAA GTGTTTTGCT TCTGACTTGG AACCCTACTT GCCCTGTGAA TTTCTATCCT GTGAACCTTC CCTTCTCTGT 002000
002001 CCTTACCGTT CCCATCACAG CCTGCTCTTC CAGTGACGGC CAAGTGATTT CTCTCCTGCA CGCATGAGTC CATCCTGTTT 002080
002081 CCTTAGTAGC TGGATGGACG TTTTGCCATG AAAACTCTTT GGGAAGTCTT GTGTCCTTTA GTGATTTCTC TCCTGCACGC 002160
002161 ATGAGTCCAT CCTGTTTCCT TAGTAGCTGG TTGGACGTTT TTGCCATGAA AACTCTTCGG GAAGTCCTGT GTCCTTTCAA 002240
002241 CAGGGGAAAA CTGGCTGCGG CCCTTCCATT AGGCTCCAGG ATCATTACCT CGTTTCCATT CTTTCTTTTC AACAGCTCCC 002320
002321 CACTTTACTA GGGAAGCCAA GGGATCAACC TAATGAGAAA AGAACCTTCC CTGTAGCCCA CCTGGAAGGG CATTGCCTGC 002400
002401 CAGCTCCTTC CTTCTCACCT ATGTGAACAG GAGGGAAGAT GCCGGCGATC CATGGATCTG TGGGACATGA AGGACGTGTC 002480
002481 CATCAGAAAT GAAAGCAGCC TGGGATTGGA AGCATCCACA GAAGACTTGA TCTGCAGGTC TTGATTTGGA CCTGGCACTT 002560
002561 TGACACCAGG CGAGTCACTG ATCCCCTGAG CATCCATTTT CAATATTGGT GAGAGGGGAT TATAATGAAT ATTGGCTGTG 002640
002641 AAGGTCACAC AAGACCGGGC TATATGCACT TGCTCTGCCA TAAGGCAGTG TGAAGTGTGT TCGTGATCAC TTGTGGGCGG 002720
002721 GAGCCTGTGG CTGTCTTGCC TGGACTGGCT TTTTTAAACA TGGATGAGTG GTGACAAATC CTGCGGCCGT GTGCAGGCTA 002800
002801 CATCCGCAGC GAGCTCCGCA TATCTGGTCT TTGCCTTTTT AGGTCATGAC TGTAATATCA AGCTTACACG AGAGTTTTAA 002880
002881 AAGACTGAGC AGTATCCCCT TCCCCTGGCA GCTGCTGCTT CCGTAGCATT TAGGGATAAA CCTGCAGACA CTGAGAAGAG 002960
002961 AGCTTCCTCC CTCCTGGGGC TGGTGGCGAG GCTGCCTGTT GTGGAAAATA GGGCATGTCT CCTGTGAATG CGAATTGCGA 003040
003041 GGGTTGCCAA AGCCTCCCTT CCAAGAAACT TGCTTAGGTG TCTGGTGTCC ACAGACCAGA CTCTTCTCCC GACAGGGACA 003120
003121 AAAATTCTTG CCTGCCTTTC TTTTTCTTTT TTTTCCTTTT ACTTATTCTT GCCCTTTTTC TTCTTCTTCT TCTTTAGGTA 003200
003201 TGATGCCCCC CAAAAAACTA AGCAAGATTA ACTTGGGACA GGCAGTTCTA ACTGGGAAGC CTGAGCTCAC AGGCTGTTTG 003280
003281 GATTACTTTC CAATTGTGTT TCAAAGATGT TTAGACTTTT CCCAAGGCCA GCTGAATCCT TATAATCAGC CCAAAAGTTC 003360
003361 CTTTCTGTTT CTGTCTGTCC CCTTTGAGGG TGTTGACTCA GAGAACCTGG GGCAATATGA GATGTTAAAA TAAAAATAGC 003440
003441 GTCC
[back to top]

Predicted Small Protein

Name NONHSAT144717_smProtein_1469:1726
Length 86
Molecular weight 9641.4773
Aromaticity 0.0823529411765
Instability index 49.4612941176
Isoelectric point 8.42620849609
Runs 13
Runs residual 0.0163992869875
Runs probability 0.0422051010286
Amino acid sequence MLAYMKGTMTFYILNKNSQSCFSIHQNSLLDPRSSTPLGDINFLKIKGLLLCVNSFHFLH
LEVISACMLEHKILQLIKALLDVIC
Secondary structure LHHHHLLEEEEEEELLLLHHHEEHHHHHLLLLLLLLLLLLHHHHHHHLHHHHHHHHHHHH
HHHHHHHHHHHHHHHHHHHHHHHHL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHH
HHHHHHHHLLLLLLLLLLLLLLLLL
PiMo oooooooooooooooooooooooooooooooooooooooooooooooTTTTTTTTTTTTT
TTTTTTTTiiiiiiiiiiiiiiiii