NONHSAT131336

From LncRNAWiki
Revision as of 07:17, 17 October 2014 by 124.16.129.48 (talk)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT131336

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4111 nt

Genomic location

chr9-:43027664..43031774

Exon number

1

Exons

43027664..43031774

Genome context

Sequence
000001 TGGAGGATCA CAGGGCAGGT GGGGCAATCC AGAGCTGTGG GGGCTTCCAT GGGAATTGGG AGGTCCCAAG GCAGAGGTAG 000080
000081 GGGTTCCACA GGAGGAGTCA CAGAGCCACC AAGGGCTCTC CTGGCCCAGG GAGCAGTCAA CACCATGGAC TGAACACCCA 000160
000161 CTGGGCTAAG CCCTGGGCCA GGCTGGGGCA TGTGGGGCCA GGAGGCAGCT CAGAGTGGGA GACAGAGACA AGTGTGCTCA 000240
000241 GAGGGCACCC ATATCTGCAT ATAACGTGGT CCTGAATTTC TGGCTGGGAA GTGCTTCCAG GGTTTCATAT GTGTTATGGA 000320
000321 GATGCTTCCT CTCTCCAACC TCACCGTGCA GGAATCCCAG TGAATATATT GCCACCATCT TGGAGCTCAG TGCCCTCATA 000400
000401 GTGTAACAGC ACCAGCAGAT CTGCCTGTGC ACAGACTTCC TGTACTACCT CACTCCTGAG GGGAGATGCT TCTGCAGGGC 000480
000481 CTGCGACCTG GTGCACAACT TTAGACACCA TCATCCTGGA GCGGCACTGC ACCCTCACTA GCCAGGGTGT TGATGACTTC 000560
000561 CTCAATGCCA AGGCCACGTT CAAGATTTTC GACTTCAGTG ATGCGTTTGT GCTGAGCAAG GTGGGCTTCT CCGGGATCTT 000640
000641 AATTCAGGAG GTAGAATGCA GCTTGAGATC TAGTGTCTGA TCAAAGAACT TGAACTTGAC CTGGAGGGCT CTGGGGAGCC 000720
000721 ATGGAAGGTG CTGGATAAAG GAAGGGACAG TCATATATAT TTTAGAGATG ACTGTGGAAG GCTGCCTGGA AGGAGTGAAC 000800
000801 AAGAGCCAGG AGACCAGGGA GGGAGTTTGT GGGGCAGGTC TGCAGATGGC AAGGGAGGGA TCCTGCTTGG ATGAAAGGTC 000880
000881 TTCAGGGACT GTCTCAGGTT ACACTCAGGT GCCCTCAGAG CTACTGTGTT CAGGGTTCTT GTCTCCAGGA TGAAAATAAG 000960
000961 GAGGAGTTGT CAGACAAGGA CATATACATG GAGGCTGGCA TCTTCATGAG TGCCAATCGT GGTCCTGGTG TGGACTACTG 001040
001041 TGGGAGCAGG GGTCTCTCCA TCCAGGGACA TGGTGGATGG ACCCTACATC ACTCCATTCT GCCCTTCCTT TCCCTCCCAT 001120
001121 TCTCCTGAGG GACTCAATGC ATGGGCACTG TCCAACCTCT GGTGCTGAAG CAGCCAAGAG ACCCAAGCCT GCCTTGCTGC 001200
001201 CACTTAGGAT ATGACAGCAC AGCCAGTGGC CTCTACTGGA TCCTGGTACC CCTCAGAAGA CACCCAGACA CTGGGAGTGC 001280
001281 TGCCACCTCG TGGTGCAAGA GTTCTGAGGG ACGGCAATTC TGAAGACATT GAATGGTGGG TGCTGGGCCT CATGGCTGTT 001360
001361 CCCCAGCCCC TCTCATTGGC TCTGCTCCAG GTGGAGAAGG GGGATGATGT CTCTGTCAGT TCTGCTGTTT TAGCCTAGAA 001440
001441 GGAAAAGAAG CAGAGCCCAG AAGCAGGGCC TGGTACCCAG CCTGCCTAAC AAGGGAGAAT TTGTAGGCTT TGTGGACAGA 001520
001521 AAGATCTGGG ACTCCATGTC ACCCACTAAC TTGCTGAGAC ATTAGTAAAA TCAGTTTTCT TTTCTGAACT ATGTTTCTGT 001600
001601 CATCTGTACA TTGAGAGGAA TTTCTTTTAC TCCACGAGGC TGCTTGGAGA ATTAGTGACA GTGTGTGTAG AGCATGTGCC 001680
001681 ACCCAGCAGG CATTTGGTGT CGAGACCACA CCTCCTCCCC CTTCATTTTC AGCCTAAATT TGCATTTTGT TCTTAAGACT 001760
001761 TTCACTCGCC TTAATTTTAC TCTTTCCTCT GATTCCCACC TTATCGTCTA TCCCATGGAG TCACTAGGAT CTAAGTGGGT 001840
001841 AACAGTCATG TATGCATGTA TGTGTATGTA CGTATATACT TTGTTGGTGT TGGAGTGTGG TGTGTGAATG TGTGTGTGTG 001920
001921 TGTGTGTGTT GGAGTTACTG GGTGACTGAA ACTGTACACA TCAGGCTGTG GTTCTGCCCA TTGCTGGAAG CGCTGTCAGG 002000
002001 GGTCCTGCCC TCAACCCCAG GTCTGACCCT TGCAGCGCAG GCAGGACATT CTGGAGGAAT CATGCCCTTG GGAGGATCCC 002080
002081 TGAGGAGTGA CTGGTGGGTA TTGGTGGATA AATACCCCTG GTCCCTTGCT CTGGGTATGA TGACTCTGAA GCACATGTTC 002160
002161 TGTGCTGTCT CTCAGAGGTA CCTGGCAGGG CTGAGTCCTG GCTGCCACAG TGGAAACTTT CTTGATGAAG GTCCCTTTAA 002240
002241 CTGCTGCATT CCTTTCCTGT CTCAGTTCCC CACTCCTCCA CTGATGTTTC CTGGGATTAG CACCCTAAGG AAGAACTGGC 002320
002321 AGTCGAATTA ATATCCTAGC GTTATCTCCA AACAAAATTT TTGTATTTGA ATCTTTGCCT CAGGATCTAC TTCCAGGAAA 002400
002401 TTCAGACTAA GACACACATT TTTCTTTTGG CTCCTTGAAT CCCCATAGGC CTGACATTTT GCTGTTTTTA TCAAAAAGGA 002480
002481 ACATGAGGAT CAGAGAGGGA AAGTCACTTG CCCAAAGTCA CCCAGCTGAA CAGTGGTAGA GTTCAACTTT GATCATGAGA 002560
002561 TGTCTGGCCC CCAGGTGGAG GCTTGCTCCT CTCCCATGAG ACTCCTTCCT TATCAGGGTC AAATGAATGA ATGGAGGATG 002640
002641 TTAAAAGTGG GGTCTCTGAT GCCTTTGCCA GATAAACCCC AGGCTCATGG CTGGCGCCTG TTTTCTCATT CTTACCTCAT 002720
002721 TAAGAGTAGT AATGAAAAAC ATGCTCAGTG CTGACCGTGT GCCTGGGGGT GTTGTAGGCA CTCCACTTAC TTTAATTCAT 002800
002801 TTAATTTTCA CAATAACCTT GTTTTTACTT CTAGTTGTTA TATGAAAAAA CTGAGGCAAA GAGCAATACA GAGAGTTGCA 002880
002881 AAAATTCATA CCGCTGGTCC AGGTTTGAAC CAAACAGTCT GCACCTGGAG TCCTTGTTTG TAACCATGGC ACCCTGTCTT 002960
002961 CACACATATC TCATCGTGGA GTTCCATCTT GTGTTAGGCA TGGCACTGAG CAGCTTCTTT TAAGAACATA ATTTGTAGCC 003040
003041 AGGCGCAGTG GCTTATGCCT GTAATCCCAG CACTTTGGGA GGCCGAGGCG GGCAGATCAC GAGGTCAGGA GATCCAGACC 003120
003121 ATCCTGGCTA ACTTGGTGAA ACCCCGTCTC TAATAAAAAC ACAAAAAATT AGCTGGGCAT GATGATTGGC GCCTATAGTC 003200
003201 CCAGCTACTT GGGAGGCTGA GGCAGGAGAC TGGCATGAAC CCGGGAGGTG GAACTTGCAG TGAGCCGAGA TGGCGCGACT 003280
003281 GCACTCCAGC CTGGGCGAGA GAGCAAGACT CCATCTAAAA AAAAAGAAAA AAACATGATT TGTAATTATG TAAATTACTA 003360
003361 ATTCTACTTC AAAGTGCCAC ACAGCCTTCA TGTGATAAAA TGAAGCAATT GGTAAGTCTA AGCATTGAGA AAAAACATTA 003440
003441 TTTTTCCCAG CTCCATTGCA ACAGTTGGGA CAGTGTTTTC TCTGTGCCTA TAGAAACCTC AGCTAGTGTG CCGAGGAGTC 003520
003521 TGGTCCCTTT GGGGAATGTG GCAGTCAGGT TCTGGCAGGG ACCTCGAAGT GGCTGGTAAT GTCTTTCATT ACCACCACCA 003600
003601 CGTGACCTGG TCTTACGACC TGTTAGCTTC CTTCATCAGG CATGAGCACC AGGATGGCAG GGGCCTCATC TGTCCTGTTC 003680
003681 CTCCTGTGGC CTGGGTCCTA GCACCATGTC TGGTACAGTG TAGGTGCTCA AGGGAAGTTT ACTTTATAGA ACTGTCTACC 003760
003761 TGGGAGATGT TGCTGTTAGT CTAACCTGTA CCATTTTGTA AACCTGCAGC CGTTTTGCAC ACCCTGGTCA GAATGAAACA 003840
003841 TTCCTTGGGA ACTCGGGCCG TGAGAAGCAT CCTTCCTGAT CACCTGACTG TAGAAACATC CTTATCGCAC CCTCCCGGGC 003920
003921 AAAGGCCCAA CAGCCTGACT GCAGGAACAT CCTTGCCATA TCCTGCCGGG CAGCAAGCTC TACCGCCCAC ACCCCTCCTT 004000
004001 CCCAGTCCCA TGATCACCCC AGCCTGTGAG AGGCAGTTGG TGCTGGCAGT AAGCTGGTTT CCTCCTCTGC AGGGTTTTGC 004080
004081 TAGTAATAAA GGTGTTGCTG TTGAAGCCGT C
[back to top]

Predicted Small Protein

Name NONHSAT131336_smProtein_1673:1906
Length 78
Molecular weight 8857.4355
Aromaticity 0.142857142857
Instability index 52.9246753247
Isoelectric point 6.87042236328
Runs 14
Runs residual 0.0457368718238
Runs probability 0.0687094657683
Amino acid sequence MCHPAGIWCRDHTSSPFIFSLNLHFVLKTFTRLNFTLSSDSHLIVYPMESLGSKWVTVMY
ACMCMYVYTLLVLECGV
Secondary structure LLLLLLEEEELLLLLLLEEEEEHHHHHHHHHHHLEEELLLLEEEEEELLLLLLLEEEEEE
HHHHHEEEEEEEEEELL
PRMN LLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLHHHHHH
HHHHHHHHHHHHLLLLL
PiMo iiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTToooooooooooooooooooooTTTTTT
TTTTTTTTTTTTiiiii