NONHSAT053890

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT053890

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2616 nt

Genomic location

chr17+:42023509..42027711

Exon number

3

Exons

42023509..42024015,42024022..42025090,42026672..42027711

Genome context

Sequence
000001 GTTTTCAGTA ACATTCCCAC AGTGCTTGCC ACAGAGCCCT GCTTAAAGAA AGCACCAGTT CCTGGCCTGC AGACCCACTA 000080
000081 AATGAAGATG ATAGTGGTAC CTCTTCCCCT GCCCAACACC ACGCCCTCTG GCACAAGGTG GATGGCCACT GGCAGTGATG 000160
000161 GGGATGGAGT AAGTGCTATG AGACTAGGAG AGGAACACCC CAGGAACACC TGAGGTCACA GAGTGAATGA AGGCAGGTAA 000240
000241 AGGAGCAAAG GGAATGGGGG CCCCCACATA CGTGACCCCA GGGCAGCAGC TTGGCCTGCA CCGAGTCCTC CATCACACAC 000320
000321 ACTCCCCCGC CCCACCCCCG CATCCTGGCT GTCTAACCAC AATCCCTGCT GCCTTCAGTT AATCAACCAA TGAGTGGTCA 000400
000401 TTCATTGCAC ATCTGTTCTG GGCCTCATGG GGGCAGCGGG AGGTGATGAA GAAGCAAGGG TTCTACAAAG AGGTCAAAGC 000480
000481 TACGTGAGGA GCCGGGGAAT GGCTTGCTTT TTTTTTTTTT TTTTTTTAGA CGGGCTCTCA CTCTGTCACC CAGGCTGGAG 000560
000561 TGCAGTGGCA TGATCATTGC TCACTGCAGC GATGTCTCAT GGGCTCAAGG GATCCTGCCA TCTCAGCTTC CCAAGGGGCT 000640
000641 GGGACCAAAA GTGTGTGCCA CCATGGCCTG CTATTTTTTT TTTTATTTCT TGGAGAGATG GGCTCAAGCA TCTACACACC 000720
000721 TCAGCCTCCT AAAGTGCTGG GATTATAGCT GTGAGCCGTT GCACCTGGCC AGGATGGCTT TTTTTGAAAG TGAGGACACG 000800
000801 AAAGTTCAGA AAGGGGAAGA AACTGGCTGT GGTCCTGCAG CAAGCTGGCA GCCGTCCCTC ACCTAGCCCT GCGCTTCAGA 000880
000881 ACCTTACTCA CAGATCATCT TACAAATGCC TCTTTGCTAC TTTGTGAGGG AGGCAGGAAT TACTCTTCAC CCTTATCAGA 000960
000961 TGAGAAAACC AAGGCCCAGT GAGGTAGCTG AGGCTGTCTA AGGTTACATG GGAGTCAGGA AGGGTCAAGA CTAAACCCAG 001040
001041 GTCCACGTGG TTCCAGCCTG GCTTCATCCC TGTTTGCAGA CCCCCCAATC CCAACCCGCC ACTGGCCAGA GCCTGGCCCA 001120
001121 GCTCTTAGGC CAATGTCTTA TTCCAAAATT AAATGGCAGC AGCAGAGGAA GGGATAGATT GGGGGGTAGC TGCCCCCAGA 001200
001201 TGCCAGGACC TATGCCCTGG AGAAACCCTT GGGCTGGGGA TAGCTGTGAC ATGTACACAC ATACCTTGGG CCCCTTGTCT 001280
001281 CACCCACTCT GAGCCAACTC AGGACCCAGT GGGGGCTCAC TCCACTAATT TGGAGCCTGG GCTCTGCCTC CACTTCTCTC 001360
001361 TGCCCTGGGA ACTCCATCTG CCTCCAGGGC CTCCTGAAGA CACGCAGGGG AGAGTTCAGT ATTTCTCCCT TCCTGTCTCC 001440
001441 AGTGTCTGCA ACAGCAAATC CTATCATTGC CGCTCCAAAT TTTTCTCAAA GAGGAAGCCT TCTTTCTTGC TGAAATAGAA 001520
001521 ACTAAACCAA GCTACATAAT ACAGTTTGAG AACAGCCTGG CCAACGTGAT GAAACCCCAT CTGTACTAAA AATACAAAAA 001600
001601 TTAGCCGGGT GTGGTGGCGC GTGCCTGTAG TCTCAGCTAC TCCAGAGCCT GAGGCAGGAG AATCGCTTGA ACCCGCGAGG 001680
001681 CGGAGGTTGT CGTGAGCCTA GATCATGCCA CTGCACTCCA GCCTGGGCGA CAGAGCGAGA CTCCATCTCA AAAAAAAAAA 001760
001761 AAAAAATGAT GAGGTCATAC TACAGTGTGT GAACCCCTAA TCCTATAGGA CTGTTCGCCT TTGAAAAGGG GAAATTTAGA 001840
001841 CACAGAAACA GCCATGCAAA GAGGGAAGAT GATGTGAAGA GGCACAGCTC AGACACCCAT CTTCAAGGCA AGGAGAGAGG 001920
001921 CCTGGAACAG ATCAGAAGGA ATCAACCTGC CAACTCCTTC ATTTCAGACT GCTAGCCACA GGAACTGTTG TTTAAACCAC 002000
002001 CCAGTCTGTG ATTCTTTGTT ATGGCAGCCC TAGCAAACTA ACCAGCAATG CCATCTTCAT CTGATTCTCT TTAAATACTC 002080
002081 CAAGCAGGAG ACCAGTTACC ATTTAAGTCC AACTACCCCA AGAATATCAG ACTGGGGAGG CCACATGTAG GTACAATGGT 002160
002161 CAACAGCCCC GGCTGAACTC AACAGCCAGC ACTAGCATGC GAATGAGCCT TCTCAGGCAT CAGCCCAGTG GATCGTTAAC 002240
002241 CACAACCCTG CCAACCTCTG ATAGTATCCG CATGTGAGAC TCAAAGCAAG AACTGCCAGG GGGAGCACTT CCTGATCCTG 002320
002321 ACCCACAAAA TCCTGAGTGA TGTACAATGG TTGCTTGAAG CTGCCAAGTT TTAGGGTAAC TTATTATACA ACATAGTAAC 002400
002401 CAGAGTGCTT ACCTTCCCCT TCCCTCCCTT GCCACTGGCC GGAACACAGA TGAGGCCATG GTGGGCTGTC TTCAGTCCTG 002480
002481 TAAATGACAT AGATGAGGGC ATGGGGAAGC CACAAACGAG GAGCCTGGGC TCTTGGATGC CCTCATGGAG CACAACTGCC 002560
002561 AACTCCCCAC GTGCCACTCA CATTTCATGC AAGAGAAATA AATTTCCATC TTGTTT
[back to top]

Predicted Small Protein

Name NONHSAT053890_smProtein_569:865
Length 99
Molecular weight 10966.5407
Aromaticity 0.132653061224
Instability index 46.4235714286
Isoelectric point 5.82305908203
Runs 11
Runs residual 0.0267976349418
Runs probability 0.0335308570603
Amino acid sequence MIIAHCSDVSWAQGILPSQLPKGLGPKVCATMACYFFFYFLERWAQASTHLSLLKCWDYS
CEPLHLARMAFFESEDTKVQKGEETGCGPAASWQPSLT
Secondary structure LEEEEELLLLHHHLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLL
LLHHHHHHHHHHHLLLLEELLLLLLLLLLLLLLLLLLL
PRMN -
PiMo -