NONHSAT083375

From LncRNAWiki
Revision as of 00:13, 17 October 2014 by 124.16.129.48 (talk)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT083375

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2459 nt

Genomic location

chr22-:19025350..19044566

Exon number

4

Exons

19025350..19026634,19028571..19029472,19035953..19036156,19044499..19044566

Genome context

Sequence
000001 GCACCACGAC CTCCACAGCT GGCACGCCGA GAGCTGCTAC GAGAAGTCTT CATTTCTGTG TAAAAGAAGT CAAACATGTG 000080
000081 TTGACATCAA GGACAACGTG GTGGATGAAG GGTTCTACTT CACCCCTAAG GGGGACGACC CATGCCTGAG CTGCACCTGC 000160
000161 CATGGAGGGG AGCCTGAGAT GTGTGTGGCT GCTCTCTGTG AGAGGCCCCA GGGCTGCCAA CAGTACCGCA AGGACCCCAA 000240
000241 AGAGTGCTGC AAGTTCATGT GTCTGGACCC AGATGGCAAC AGTCTGTTTG ACTCCATGGC CAGCGGGATG CGCCTGGTCG 000320
000321 TCAGCTGCAT CTCCTCCTTC CTCATCCTGT CACTGCTGCT CTTCATGGTC CACCGGCTGC GCCAGCGGCG CCGGGAGCGC 000400
000401 ATCGAGTCCC TGATTGGAGC AAACTGTAAG TGCCCTCTGA GGGCCACAAC CCTGGTCCCC ATCCCCACCC CAGGTCACCC 000480
000481 CATGTCTCCA TGAGCTCCCC AGCCCTGGAC AGAGAGGGGC AGAGGCAGGT TTGAAGGCCA CATGTTGTGG GGTCTGTGTC 000560
000561 CTCACCTGTG GGCCAGGAAA GTTCTTGCCA AGACCCTCCT TATTCCCATT GAGCCACAGG GTGGGCAGAA CTCGGAGGCC 000640
000641 TCTGAGGGTC AGAGGCCCGG ATCACCCTTC CAGCTGGAGG GGTGTGGTCA GCCAGAGGCT GGGTCTGACA GAAGTGTCAG 000720
000721 CCCAGAGGGA CAGGTTGGTG TGGTGAGGGT GAGACATGCC ACGGAGCACA CCAGGGGTCT CATGGGGATG CCGAATGTGG 000800
000801 CCGAGGGGTA GGCAGCGGGG CACAGGTGTG TGCACTCAGG GACGGCAGGC CCAGCCTGGG GGTGGGGGGC TCAGGAGCCA 000880
000881 GGCAGCCTGC CCCCAGTGAC TTGTCTCTGT TCCCGTTGCA CACTTTTGAT GGCCCAGTGC ACCACTTCAA CCTCGGCCGC 000960
000961 AGGATCCCTG GCTTTGATTA CGGCCCAGAC GGGTTTGGCA CGGGCCTCAC GCCGCTGCAT CTTTCTGACG ACGGAGAGGG 001040
001041 TGGGACTTTC CATTTCCACG ACCCTCCACC TCCCTACACG GCATACAAGT ACCCGGACAT CGGCCAGCCC GACGACCCTC 001120
001121 CGCCGCCCTA CGAGGCCTCC ATCCACCCGG ACAGTGTGTT CTATGACCCT GCAGACGATG ATGCTTTTGA GCCTGTGGAG 001200
001201 GTCAGCCTGC CAGCCCCTGG GGATGGTGGG AGTGAAGGTG CATTACTCCG GCGCCTGGAG CAGCCTCTGC CCACTGCGGG 001280
001281 GGCCTCTCTG GCAGACCTGG AAGACTCTGC CGACAGCAGC AGCGCCCTGC TCGTGCCCCC TGACCCTGCC CAGAGCGGGA 001360
001361 GCACCCCAGC TGCAGAGGCA CTGCCAGGGG GTGGCCGCCA CAGCCGCAGC TCCCTCAATA CTGTGGTGTA GACGGCCTGG 001440
001441 CCTGTACCCC AACGGTCTGG GAGCACCTGT CTGTTGTAGA AAACACCGGT CCCTGGGGAG ACTTGAAAGG CCCCTGTCCC 001520
001521 AGCCTGGACG CCACGCACTG CCGCACGTCA CTGGCGGGCT CGCGTGTGTA CATAGAGACC ACAGCCCGCC TTCTGCCAAA 001600
001601 AGAAGTGATG GCCTGCACCG AGCTTCCTTG AGGGCTTCAG AAACATGCAT AGCTTTGGAT CACTGTCTTC TCCTTTATAA 001680
001681 ATGGCAGAAG AGTGACAAAA TTCATTCAGA CCGCACATGT TAGAGGCAGG GAATGAAGAA GGTACTGTGG GCCATGGCCA 001760
001761 CACCTGATGC GTTTTTGGTG GGCTTACTTG GTGCAGTGTG CTGTCCAGAG AGACCTGCTG ACCCAGTCTG GGACAGGCAC 001840
001841 AGTGGGAGCT GCCACAGTGC CCCTTGCTGG CCGCCCTCAG GAGGGGGCCT CTGGACCGTC AGTGTGGCGT AGGCAGTGGG 001920
001921 TCTGCTTCAG GGAGGCAGCC TCTTGACTTT GTCACAACGG TTGCACTGAA GATGGCCCCC ACAAGCCCAG TTGTGAATAT 002000
002001 CAAGGTGACC CTGCCCCTGG CTGGGAGCTC CCCTGGGGCT CTGGAACCTG AAGCCCTGAG AAGGAGAGCT TGGAAGGAGG 002080
002081 TTGAGCTCTT CACTGTGTCT TTCCATCTGG GCTCTGCAGC CCAGCTCTGT GGCAGGAGGC CTGACCCCAC CCCATCAGTC 002160
002161 CCTCTCCCAG CATTGCTGTG CATGGCTCCC TCAGGAAGAA GCTCTGGAGT GGGGCCGAGG CCCCAGATGC TCTGCTGGGG 002240
002241 TCTGGGGACT GAGCTGCCTC CTGTCTCTCC ACTCTGGAGC CCTGGGCTCT TGCCTCCTGT TGAATTGCCC CTGGGCCTGC 002320
002321 CCCCGGCCCC ATTTGTGCCA TAAAGGGTTG CTTCATTGCA GGAGGGGTGG CTGAAACCAC CATCCTGGGC TGCATCTATC 002400
002401 TCCTTAAAGT CCACTCCTTA CATCACCGCC ACTACTGCAG CTCAGTGCCC AGCGGCCGC
[back to top]

Predicted Small Protein

Name NONHSAT083375_smProtein_161:412
Length 84
Molecular weight 8354.3736
Aromaticity 0.0602409638554
Instability index 51.8891566265
Isoelectric point 8.76898193359
Runs 12
Runs residual 0.0045515394913
Runs probability 0.0455823470529
Amino acid sequence MEGSLRCVWLLSVRGPRAANSTARTPKSAASSCVWTQMATVCLTPWPAGCAWSSAASPPS
SSCHCCSSWSTGCASGAGSASSP
Secondary structure LLLEEEEEEEEEELLLLLLLLLLLLLHHHHHLLHHHHEEEEEELLLLLLLLLLLLLLLLL
LLLEEELLLLLLEELLLLLLLLL
PRMN -
PiMo -