NONHSAT031886

From LncRNAWiki
Revision as of 09:33, 13 October 2014 by 73.162.128.239 (talk)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT031886

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2129 nt

Genomic location

chr12+:128399955..128436097

Exon number

5

Exons

128399955..128400227,128418835..128418894,128429263..128429344,128433369..128433503,128434519..128436097

Genome context

Sequence
000001 ACATCAGCTC CTGGAAACAA ATGCACTTAA CCAGAAAAAC AGGGACATCA ACCAGCAAAG ATGCATCCTC ATTTAATAGA 000080
000081 GCTTGAAATT ACTATAATCT GTAAAGTACA GCGAGAAGAA GAAGTTTCAC CTGCCTGCAC ATCTGTCGTT GACTCTCCAT 000160
000161 CTGACTTGGA GGGACTCTGA GGGACCAACC TGAGCCTGAG AAGAGGCAAG ATTCCCCTTC AAGGACACAC TGGGAACTTA 000240
000241 CGGACCTCTT TCTCCATGGT GCAGAGCGCA GAGGCTACCT GAGAATATCC AGAGACTACA GAGGTCATCA AGTTGACGCA 000320
000321 TCAATGTTTA CATgtgaaca aacagacgcA TAATAGCTCT TGAGCTTGCC ATTGGGAAGT GAAAATCAAC TGCTTGAATA 000400
000401 AATGTAAGGA CTCAGAACAC CAGACATGGG TGCTCCAATC CACGTAGTGT TGAGGCAGTG AAGTTTGTGT TCCAGGAACT 000480
000481 TCAGAACTCC TGAATGTATT GGTGTTGATA AAAGGTGATG AGATGAagta gaatgaccat aaacttagag GTAGACATCA 000560
000561 CCTTGTGACT CAACAGCAGC CAATAAGATG GAAATAAAGG CCATGAAGTT ATCGGATGGC ATCTACAAGG AATACTTAGA 000640
000641 TGAGGAGGTG ACTCTGGTAG CAATTGTTCT TTTGTCTTTC TCACCTTCCT TCTTGTCCTG TCTGGGATGC TGAAGTGATG 000720
000721 GCTGGTATAC AGATGCCCTG TTGTGATGTT GAGATAAACA TGAGAATAAA AGTCGGCGAT TACAGATGGC AGAGAAGAGA 000800
000801 GAGAAAGATA GGATGTAGGT GGATATTGAC GGAACTAAGT TTCTATTGCA CAAGTTCTAC CACCAACCTC TGGACTTCAT 000880
000881 GATATGTGAG GGAGAAATAA ACTCCTATTT TAGAAGAGAT CTATGTTACC TGCAACTGAA CGGAATGCTT AATTGGCACA 000960
000961 TCCTATGTGC TCCAAACCTT CCTTAATTTC TGTGTGTAAG GCATACTTAG TCTTTACCCA TTATTCTGTT CACATTCATT 001040
001041 CACTTGATCA GCCTCCCAAC TCTATGGTAA AGAGTTTACT ACAGTTTACT CTATTAGTAA GGGAGATTCT GGAGCAAGCC 001120
001121 CCACAAATCA TAGATTGAAT ACCCATTCAT TAAGAGAGAG ATTTAAAGAT ACTTCTATCC AAACACATTT TAAATTTCCA 001200
001201 GCACCTTTGA GGTGGTGGAG GTTCATGGTG GTTGATAGGG TTTAAATGAC TTGTGATTCA AGCTCCATAT TACAATGTAA 001280
001281 CAGTGCAAAA GTTCAttttt ttaaatttat tttactttaa gttctggtat acatgtgtag aacatgcggg tttgttacat 001360
001361 aggtatacat gtgccatggt ggtttgctgc acctatcaaa ccgtcaccta ggttttaaga accacatgca ttaggtattt 001440
001441 gacctaatgc tctccctcat gttgcccccc aactccctga cagggcccag tgtgtgatgt tcccctccct gtgtccatgt 001520
001521 gttctcattg ttcaactccc acttatgagt gagaacatgt ggtgtttgat tttctgttcc tgtgttagtt tgctgagaat 001600
001601 gatggcttcc agcttcatcc atgtccctgc aaaggacacg aactcattct tttttatggc ttcataatat tccatgAGTC 001680
001681 TGGATTTGGT TACAGATAGA GCTGGACAAG CCTTAGCTAT ATCTAATGGC TTTTGACATG TTATATCAAT TTCTTACTGC 001760
001761 TTTTGTAAAT GAATATATTA AGAAGAAAAA ATTACAAATG ACCCTTTCTT CACAAACTTT TAGGGTAATC AAGTTCAGTT 001840
001841 ATTAAAAAAT GGATGTTTGC AACATTCATA GTAATTATTT TCAGaaaatt aaaaatgatt aaaaattata ttttCATCCC 001920
001921 ATTCAGATAT TATATATTTA ACTGTGAGAT AATTCTCAGA GATAAATGTC AACTTTTAAA AAATGCTAAA TAAATTAAAT 002000
002001 CAAAATATCC CCTCTAAAAT AAATAAATTA GCTCCGATTA TTTCTTCTTT TTAAATTTTT CCTTATAGTT GTAAAAATAA 002080
002081 AGTGAACGTT TTGGCTCATT TCCTCATTAA AGTTTGAATT TTCCAGTCA
[back to top]

Predicted Small Protein

Name NONHSAT031886_smProtein_1496:1666
Length 57
Molecular weight 6339.7084
Aromaticity 0.125
Instability index 53.0553571429
Isoelectric point 6.48785400391
Runs 9
Runs residual 0.00741608118657
Runs probability 0.0652711535064
Amino acid sequence MFPSLCPCVLIVQLPLMSENMWCLIFCSCVSLLRMMASSFIHVPAKDTNSFFFMAS
Secondary structure LLLLLLLEEEEEELLLLLLLHHHHHHHHHHHHHHHHHHHLEELLLLLLLLEEEEEL
PRMN<