NONHSAT141131

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT141131

Source

NONCODE4.0

Same with

,

Classification

sense

Length

3405 nt

Genomic location

chr16+:23677789..23681193

Exon number

1

Exons

23677789..23681193

Genome context

Sequence
000001 TACTTCTCTC AGAAAACTTG GAAAACACTG AAAAGCAGAA GGAAGGAGAA AACCTCACAT TCCCTTAGCC CTACCCCAAG 000080
000081 ACAGTATCTT CTTCTCCATG TTGTTTTACA CAGCTGAAAT CATGTAGCAT ATACAGAGGC ACGTCATAAA TTCACAGATG 000160
000161 GAAAATAATA TGAACAGAGA GATTTGACAG TATATGATAC CTACCACTGA GTGGTTTAAT TGTTTTTCCA ATTAAAAAAT 000240
000241 AAATCTCATC TCTCAGATCA TTGAATCTGA GTTTCTAAGA TGAACAAAAT CATCACTCAG ATTCTTCGGG GAGGCATTTG 000320
000321 GCCATTCTAC CGTGTCATGC ATCTCTGCTT TTGCAGAGGA GGAAGGAGAG ACTTTTGTTT AGTAATTTCT CCATATTGGG 000400
000401 GTCCTGCTGT GAAAAAGTTT AGCTGTTCTT AGCAAGCACT GGACCAGAAC AGCCTCAGCG ATTATTTAAG TGATTGTCAG 000480
000481 ACATGCATCT GATTGAGGTG AGAAGGATAT TGCCAGAGAA ATATCTTAAC TTCTTGTAAC TTCTTCAAGC TCCTTAGAGC 000560
000561 TGGGTCTTTC TTTCCCCAGG ACTCTTCTCA GGGGAGCTCC CGGAGTGCAC TCAGGAGCTG ATGATTGACG TCACCAAGAG 000640
000641 CTACTACCAG AAGTTTTTGC CCCTGACGCA AGTCTAGCAT CTCTGCCTCA TGTCTTGAAT CTGCTTGAGC TCTAAGATGA 000720
000721 ACCTGGGGAC AAAGTGAGCC AGTCAGCACC TACAAAGAGC TTTTGTGTCT TTGACATCTA CCACCCTCCT CCTTTTAAAA 000800
000801 AATTTCTTTA GAATTTCTCA ATCTTCAAGG CTCTAAGTGC TTAAGAATTC ACTAACAGAC AGACCATCTG GAGGAGCTGT 000880
000881 CTTCAAATGC TGTGCTTACA CCTTATCTAT GAACAGTCAC TTTGTACCAT TATCTGTGGA ACACAGAATC ATCTGTTCCC 000960
000961 AACACTCCAG CCCCTTGGTC CTGTGGATGG CTGGATCCCG CCTGAAACGG ACCTGCAGAG CAGCAGCACC CTTCCGGTGT 001040
001041 GGAGGCTATG TAGCTGGTGC GCTGCTCACG GCCATTCACT GCCCATGCTG AGCGCCTCTC ACACAGGTAA TGCCCAGCTT 001120
001121 TTCTGCTGCT AACACATTTG GCCAGTTGTT GCAGTTGCTC ATCATCTTGG GAAAGGTGTT TGTGACTTTT CAGAGCCCAG 001200
001201 ATTCCTGTTG TCTATTAAAA CTTGAAGGGA GGGGTGAATA GTGTTTCTCT CTTCTTCCCA AAATGACCTT AGCTGTCCTA 001280
001281 GGATAGTTAG TAAAAGACTT TTTAGCATTT TGACCTAGGG CCTTTGGCTT TCACTAAAAG TGGGGACCTC AGTATCCCAG 001360
001361 ATTGTAATTT TGCCAAGTGT TAGATTTGAG TCTCTCATGT GGATGCATTA GTCAGGTGGT TACTCCTTGC TTCAAGGTAC 001440
001441 TTACCTTATT TCATTGAAGA CACCGCATTT GTGAACTCTT GCTTCCTGGC CTAGAACCAT TCAGCCTACC CTGTATTTGC 001520
001521 CATAAACTCC ACAATTCACA CCAAAATGTC TGTACTTAGA GCTAATTCGC ATATATACAG GAAGGGCTCT TAGAATCAGT 001600
001601 TTGTGGGCAC AGAGCCTCAG GAGTAAATGA AGTTACTAGG GCTGTTCTTA CCATCTCCTT CTGGCCAAAT AGCACAACAT 001680
001681 TTCCTCGTTC TGCTCTGACC TCTTAGCTTA GAAGGAAGAT TCAGAAGTGA GGGGCTAAGA AGGTTGTCCT TGCCTAATGC 001760
001761 TCTGATCTGT AAGTGAATAG GGCAGAACAG TTCAGCCTTG AGGTTAGAAT TTAGCAGGAG CTATCCTGAC TTAATATCCA 001840
001841 GTTGTGGGGT TTGCAAAACA AAACAGCTGT ATGTAATCAT TGCCACTAGT TCCATCTAGA ACTCCTTTCT AGTTTGTTAT 001920
001921 TTTTAAAATG TTTATacata aaaccaccaa aatacatagc ttcgacaaga tggaagttta tttctctctc ccataacagt 002000
002001 gcagtgatag tcagctggtc caggccaggc aaggggctgg tccatgatgt catcaggcac ccaggttcct actgtcttgc 002080
002081 catgtggcca cagttagcaa caaaggaggc tgtaaattta gtttctactt gggcagccaa aactctgagg aaggagattc 002160
002161 tgctagtaaa aaggagtggg ggaagaatgg ccattgggag acaacaagca gactcaacca GGCCTCTTTG TTGGCTTCCT 002240
002241 TTCCTCCTGC TGCACATGAG CCTTCGCCGT GCATTTGGAG CCatgacagc tgatagctcc agacctgcat cctcctagct 002320
002321 tgggggctct gaatgaaagg tttcttccct tccagttcga atttggaaac tcccaaagtt ctcaatggtt tgttgtgagt 002400
002401 tccatgtcct cttggatcag tcactgtggc caTGCATGTT TGGCCACATG ATTAATCCAG TCTGGGTCAT GACCTTTTCT 002480
002481 TCATCCAAAA CAAGGTGATG GGAAGACAAA AACAATAGCT ACTACAAACA ATAGGAGTTT ATAATTATGT GCTGATGTAT 002560
002561 TCGAAGATGT GTTGACAGTC GTGAGTGTGT ATCCTAGGAA AGGCGAGCTG GACTCTGTCT CCATGGTGGC TCTCACCCCA 002640
002641 GGGACCTAGG AACAGCCTGT CACCACACAA TTACTTTTAT AACCCTGGAG ATGAAAATCT CCTTGTCCTC AAAATACTTC 002720
002721 CAGAAGAACA ACCAGATGGG AAGGACCTTG GTTGGGACTC TTTCCAGTTC ACTTGGGGCa gagggaattt aatggctcac 002800
002801 gtagctgaaa aggatgggct agattgggct tcaggctgca tcccaggact ccaaacaggg atctgtctct ttggctctca 002880
002881 gctctgcttt catttgagtt ggctttattc ttgggcTTCA CAGTGTGGCC CCACAGCACC AGTTATTGAT aaaaagagct 002960
002961 cccctttgct gacagaactg ctggatttgg ttctcattgg tccagacgag gaaggtatcc agcctcaagt catcattgtg 003040
003041 gccaggaaga tggaatacac caaatggaca ggcctggcat gtacccacag agactgagag ttggtgctgg tggttgtggt 003120
003121 ggcagatgat attacctgaa gaagggacga atgggtgctg ggcaggacaa agcatcagct gtccaGTTCA GGCCTCTCCT 003200
003201 CTTTCCCTGG TGTCTTCATT TTCCTCCGTC TCCCTGCTGT CCCTTACCCT CTGCCCAATC TCTCATTACT CCTGGTCTTG 003280
003281 GGAGTTGCCT TCTGAGGATA CTCCACTGGG GGTACCTGAG CCTGGATTAG AGGGCAGGGG GAGGATATTG CCTAGCCAAA 003360
003361 GTGGGTGTTC AATAAAGAAC CATTTGGAGA TGGTCTTCTG TCTGG
[back to top]

Predicted Small Protein

Name NONHSAT141131_smProtein_2081:2317
Length 79
Molecular weight 8683.1513
Aromaticity 0.0641025641026
Instability index 74.5628205128
Isoelectric point 11.5545043945
Runs 7
Runs residual 0.0439337085679
Runs probability 0.0318091200445
Amino acid sequence MWPQLATKEAVNLVSTWAAKTLRKEILLVKRSGGRMAIGRQQADSTRPLCWLPFLLLHMS
LRRAFGAMTADSSRPASS
Secondary structure LLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLEEEELLLLLLLLLLLHHHHHHHHHH
HHHHHHHHLLLLLLLLLL
PRMN -
PiMo -