NONHSAT138909

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT138909

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2644 nt

Genomic location

chrX-:146990949..147003676

Exon number

3

Exons

146990949..146992513,146992896..146993746,147003449..147003676

Genome context

Sequence
000001 CAGTCATTCC ATTAGTTTAA TACTGAGTTG TTCAGAAACT TCTCAGTCAG TAACATGAAG ATCCATTCCT GAAAAGCACT 000080
000081 CAAACTGGAC TTGAATTTTT AAAAAATGAA GAATAGAATT AGCATAAAAT ATTAAAGAAC CTCATCATTA AAATTATATA 000160
000161 ACGAGACACT TACTTGTTTT CAAATGCAAC TGTTATTGAA TCTTCATGAA CATCCTTTAC AAATGCCTTG TAGAAAGCGC 000240
000241 CATTGGAGCC CCGCACTTCC ACCACCAGCT CCTCCATCTT CTCTTCAGCC CTGCTAGCGC CGGGAGCCCG CCCCCGAGAG 000320
000321 GTGGGCTGCG GGCGCTCGAG GCCCAGCCGC CGCCGCCGCC GCCGCCGCCG CCGCCTCCGC CGCCGCCGCC GCCGCCGCCG 000400
000401 CCGCCGCGCT GCCGCACGCC CCCTGGCAGC GGCGCCTCCG TCACCGCCGC CGCCCGCGCT CGCCGTCGGC CCGCCGCCCG 000480
000481 CTCAGAGGCG GCCCTCCACC GGAAGTGAAA CCGAAACGGA GCTGAGCGCC TGACTGAGGC CGAACCCCCG GCCCGCTGCG 000560
000561 GGTGTAAACA CTGAAACCAC GTCACGTGAT CAACGCTGTT CCCTCCCCCC GCGGGCTCAG CCCCTCGGCC CCGCCCTCTC 000640
000641 TCTTCAAGTG GCCTGGGAGC GCGCGCATGC GCGCTGCTGG GAACCGGCCG GGGTGCCGGG TCGAAAGACA GACGCGCGGG 000720
000721 CCGGGCGTGC GCGGGCTTGG TGGAGGGCGG GAAGGCTGAA GGGCGGTGAC AGGTCGCACT GCCTCGCGAG GGCCAGAACG 000800
000801 CCCATTTCTG CAGAGGTGCA CTCAGTGGCG TGGGAAATCA AATGCATCCG GTTATCCCAG TTCGGCCTCT CTGGGATTCC 000880
000881 GCGGGAGGGG GTGTCTGGTC TGGTTTGGTT TGGTTTGGTT TGGTTTGGTT TGGTCCGGTT CAAAGTAGCG CAGTCTGACT 000960
000961 GAGCGGGAGG TGGAGTGGAA GGCGAGAATA GGGGTGAAGG ATTAGACAGA AGAAGACTTT GAACTAGGAA CAGTGGCAAC 001040
001041 CAGGGTGACC CAGGCTTTTG TGACCCGTAG AGGCAGAAGC TGCCTTAATC CATGTTCCCT TCGGGATGCT GGTATCCAAC 001120
001121 TGAGAAGTTG ACGGAGCATC TATCGTGTGC CAGACACTGT GCTAAGTGCT AACGAGAAAT CGGTGAGCAA AACAGAAAAG 001200
001201 AAAAAAAAAG AAGAAACGAC TGCCAGCATT GAACTTAGTA CCTAGCAAAG TAGGTGGACA TAAATCAAAT TGTCAGACAA 001280
001281 GTAAATGAGT AGTTGCAGCT GTGATAAGGG GTATGAAGAA TAGGCGCTTC CATGATGGCG GAACATGACC TAGTCTGGGG 001360
001361 TGGAGAAGGG GAGATGATTC CAGCTGGAAT ATGGCAGGTG AGAGTTAACT AATAGCACTG AGTTGGCAGA GGCAGGATGA 001440
001441 CCTGCCCCAG GCAGGTGCCC CAGAATGAGA GGATGTTGCT GCTGGTGGAA CTCCAGCTTT AAAGCGGTGG ATCTAGGGCC 001520
001521 ACATTCCTCA AGGCCATAGC AAACAGGAAA GACCTTTTGG ACTAGGGTCT TTGGAGTAGT CACATGAGGC CAAGAACCCA 001600
001601 ATAGGGCTGA AAGAGAATCT CATCATAGGC TGAATGGGAG CAGAGCATTA GCTGCAACTC CAATGTATTA TAGTAAAAGG 001680
001681 GGACACATTG ATTAGAATGA TTATACCCCT CCAAGTGTAA GCTGTTGTTC AATTTACCCA GGGCTTAAAA ATATGTCAAC 001760
001761 TCTTCACAAC CATTTTTACT ACAAGCCACA CTCAACCTGT GTTGTTCCCT GAACCTGTGA TGCACTTTCA TCCGTTCATG 001840
001841 TCTCTACACA TCCTTCCTGG GCCCCATAGA GGCCCTGCAC CTTAAGGTTT TCAGCTGGAT TCCTATTAGC TAGTTATTTT 001920
001921 ATCTCAGGTA CAGAGAGTTA ACTTTGCTCC TCAATTTTCA GCACATTGTC ATATCAGTCC CTACAGGGTC CTTGTTTACA 002000
002001 TTATTTTATA GTGTTTATAT TATTTTTTCC TCATCTTTCT CGATCCCCAC TCTCATTCTT CTAGACACAA GCTGCAATAT 002080
002081 GTTCGATAGA GATCCTTGAT AATTTAACAC AGATAATGTT TTGTTTATGC CTCTTGCATT TACAGAGATG GTATTGGACT 002160
002161 GTAAATCTCT ATTTCCCACT GTTTTACTCA ACAGTCTGTT TTTGAGCTAT ACCCACATTG CTGTGTGTAC ACCTAGTTTG 002240
002241 CTACCGAGTA TGCCAGAAAT TTTATTCATA CATATCCCTA CTGATCGGGC ATCTAGGTTA CTGCCAACTC CCTGAAGTCA 002320
002321 TACTGTGGTG AAAACCCTTG TGAATGTCTT TACAAACCTT TGCCAGAACT TATTTTGTGC TCTTGTAGAA CTTTGTGCAT 002400
002401 ACCCTCTGCT TCCTCATTAC ACTGCTTGGC ATATAGCAGA GACTCAACAA ATGTTGAATA CCTCTACTAT GATGGTTAGT 002480
002481 GTACTAGAAT AATTTATTTA TATTTCTGTT TTCCTTATTG GATTATTAGT ACCTTCCAGA CAGGGCAAAT GTCTTTTCTA 002560
002561 TCTTTGTGTC CCTGGCACAT TAAAAGTGGT TGATTTCAAG GTATGTTCAA TGAATGAATA AATACTTATC TTCACTGCCA 002640
002641 GTAC
[back to top]

Predicted Small Protein

Name NONHSAT138909_smProtein_2126:2314
Length 63
Molecular weight 6914.2423
Aromaticity 0.0806451612903
Instability index 73.1467741935
Isoelectric point 5.97674560547
Runs 11
Runs residual 0.0237788018433
Runs probability 0.0867177337767
Amino acid sequence MPLAFTEMVLDCKSLFPTVLLNSLFLSYTHIAVCTPSLLPSMPEILFIHIPTDRASRLLP
TP
Secondary structure LLLHHHHHHHLLLLLLLHHHHHHHHLLLLEEEEELLLLLLLLLLEEEEELLLLLHHHLLL
LL
PRMN LLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLL
LL
PiMo oooooooooooooooooooTTTTTTTTTTTTTTTTTTTTTTTiiiiiiiiiiiiiiiiii
ii