NONHSAT138334

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT138334

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2491 nt

Genomic location

chrX+:118827531..118830022

Exon number

1

Exons

118827531..118830022

Genome context

Sequence
000001 ATACTCTGCT CAGCGGCCGA GCTCTGCAGT TCTCCTCCCG GCCCCCGTTA CCCCCCTTCC TTTCCACCGA GATCCCAAGG 000080
000081 TGCTTAGATG GGTGCGCCTT GGCCGCTGGA GTCTCGATGC CCCACTGTCG TGTCTCCCTC CCTCCATGGC CCGGGAGCGC 000160
000161 AGCTGCGGCC GAGGTACCGG GTAGGCCTGC GGCACCTGCC GCTCCACAGT AGAGCTTAGA AAGGGGGCCC GTGGGGGGAG 000240
000241 CCGCACTGCA GCCCGGCAAC TGAGCTGAGC TGTGGCATGT GGGGTCCGAG GCTAGAGGTC TTGAGACGCT GCTTCTGCCC 000320
000321 TGGGGTTTCC AAGCCTTGGG TGCTGTGGGC TGTAATTTCG TGGAGCCAAG CTCACCACCT TTGGGTACCT AGTACAGACG 000400
000401 GAACCCACGC CTCCTCTCTC CGACCCAGCC CTAGACGCTC AGCAAGACCA CCCACCCCTT CTAGTACCAG AGGAAGGGCT 000480
000481 GGAGATGAAG GTTCTCAGCT CTATGGAGCG AAATGGAGGG GCGGGTGACG GGGAGCGGCC TCCAGTCACT CTGTCCCGCC 000560
000561 CCCTGATTTC TCCTTGACCC TCATTCAGCC CGGGACAAGG CGGCCAATAG GGCGCAGAGG AGAGAACTTC CGAAGGCAGG 000640
000641 GGTGCGCCCG CCCGCCCTCC TTATCCCCGT GGCCCACCTC CGCGGGTCCT CTCTCGTGGG CACCTCTCCA CTTCTCCGGC 000720
000721 CTTGGCGCCC CCCCTCTCTC TTGGGACTGG CCTCTTGCTA GCTCCCTGTC ACTGGAGCAC GCTGGCAGAT TTGGCATGCC 000800
000801 CGAGGCCCCA GCGGGAGGGA CAGTACATTC CACTCCTCAA AAGGGCAGGT TCCCTGGTGT GGGAGAGAGA CCTGGGCCCG 000880
000881 CCAGAGGGAC CCCAGATGGG GGAGGGGATG ACAAGGCTAT GGAGGAAACC CGGGCCCAAG AGGGGGCCCC AATGGAGCAT 000960
000961 GGAAGATGAC CCCGTAAAGG CCCAAGGCTG GAGCAGCTGA GGAGGAGGCA GGAGAGAGGG AAAAAGAAAA GAAGGAACCA 001040
001041 GGGAGAGAGA GGAAGAAGGG ACTCGATCCA GAGGCCCTGG TCTCGGAGAA GTCAGGGAAG ACAAGAAAAT GGAGTAGCCA 001120
001121 AAGAAGAGCA TGCCTGGCAC TTGGTTCCAC CTGCTGGATG GAGCTTCTCA CACTCTCTCG GGAAGCCCTT TATGTATGTA 001200
001201 CCCTGTGCGC CTGGAGAACA AGGACTCAGC AAACGCATCC CAGGGCCGAG TGCAGGGCCT CGCAGTCAGG AGGCACTCTA 001280
001281 TTAAGCTGAG GGCAGAGAAC TTGGATGCGT GCGAGGGGGG CGGGGAGAAG AAAGAAGAGA CACAGTGAGG GAATAGCAGT 001360
001361 AAGGACAGTC ATGAAGAGCC AAGTATTCAG GGAGCGCCAC TGTGCGCCTT CATTGTGCTG GAGACCCAGG AGCGGCCTAA 001440
001441 AGAGAAGAGC CTGGCAAAAT CAGGGATCAG CGCAGAGGAG CTCCGCGAGA CAGAAGGCGC AGGCCATAGG GCGGATGGCG 001520
001521 GTGGCAAGGA AGAGGGGCAG CTGGAGTCTT TCAGACTCTC TGGAGCTTTG GGGCAGGGCC TGAGCCCTGG ACTGGGTTGG 001600
001601 CAGTGACACG TGGGACCAGT GCTATATCTG AGGAACCACG GTGAGGCTTG GGCCCCAACC GCATGGGGGC GGGGAGTCAG 001680
001681 AGGTGACTGG CGGCTTCCAG GCTGGGTGTC TGGGACAGGG CTCTTCCCTG GCAAGGGCTA GGGCGTGGTG GACTGAAAGG 001760
001761 CCCCGAGGTC ACTAGCCCCG ATCTGGTCTT GGGGTTTGGG TGGTGGTGGG ACAAAAGGGG AGAATTTGGA AGCCAGAGCA 001840
001841 GAAAGCTGCT GCCGGAACGG GAGGGGACTG GAGGGCTGCG GGTCTCCCGG GAGAATCCCC CACACACTGC CCCGCCCGCC 001920
001921 TTAGCACTCC TCGCGGACTG AAGCTCCGTA CTTACCGGCC CAGCGCTGGG GGCACCTGAG CACGGAATCC TTGGTGTCTC 002000
002001 ATCCATATGC CCCATTCGGC CCTCCTGTCC CTAGAGACTA CCAGAACCTC ACTGCGTCCT CCTCCTCCAA AACGCTGAAC 002080
002081 CTCGAAGACC TGAACTAAGT GCTTAATTCA AGTCTCCTGG GACCCTTTGG TTCCTCCAAT ACCTGCCCAC CTGCGGCTAC 002160
002161 AAGTCCCTCC TGGGGCCTGG GCCTCCAGCC TCCTTCGGTG GGGTGTGGGT CAGTGTGTGT GTTGGTCTTG ATGTGTGAGT 002240
002241 AGATGTGTCG GTGTGTTCGT GTGTATTGTC AGTGTGTGCT AGCTGTGTGT CTGTATATCT GTGTGTCAAT GTGTGTGTTG 002320
002321 GCATGTGTTG GTGTGTCAGT GTATTTCAAT GCATGGTGAG TGTATGTGTG TCAATATGGA GAGAGGTGTT AGCTCACTAG 002400
002401 CTTTACATAC CTTATCTCAC TCAGTGCAGT GGGCCGACGA CTGAGACGTG TGAATCAGAA CATTCATTTC AATAAAGAGG 002480
002481 TTTGGAAACT C
[back to top]

Predicted Small Protein

Name NONHSAT138334_smProtein_1304:1582
Length 93
Molecular weight 10501.0217
Aromaticity 0.054347826087
Instability index 60.9304347826
Isoelectric point 12.2002563477
Runs 12
Runs residual 0.00477009696591
Runs probability 0.0491152917624
Amino acid sequence MRARGAGRRKKRHSEGIAVRTVMKSQVFRERHCAPSLCWRPRSGLKRRAWQNQGSAQRSS
ARQKAQAIGRMAVARKRGSWSLSDSLELWGRA
Secondary structure LLLLLLLLLLLLLLLLLEEEEEEHHHLEEEELLLLLLLLLLLLLHHHHHHHLLLLLLLLH
HHHHHHHHHHHHHHHHLLLLLHHHHHHHHLLL
PRMN -
PiMo -