NONHSAT138743

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT138743

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3041 nt

Genomic location

chrX-:135998786..136079662

Exon number

14

Exons

135998786..135999041,136004767..136005239,136005653..136005728,136005751..136005802,136005810..136006076,136007406..136007538,136026375..136026806,136039392..136039564,136057615..136057678,136067205..136067229,136067436..136068038,136075645..136075821,136077059..136077154,136079448..136079662

Genome context

Sequence
000001 GTAGAAACAT GCAGTTATCA TTTATTTCAA TTCTTTTCTA AACACAGACT TCCAATTCCA ATAATGCAAG TATAGCAAAT 000080
000081 GCATTAGTTT CTTATGGCTT TACCTCAGCA CATAAATGAA AATAATACAG CAAAACTGTA ACTTCTGAGC AAGTAATCAC 000160
000161 AAGAATTATA AAGACTAAGA AAAGGAAGCC AAACATGTAG TACAGCTAAT GAGACCAAAT ACTGTTCAGA ATGAAAAAGA 000240
000241 GTTGAATAAA AATACAGCCG AAGGGGAAAA TGCCACCAAT TATGATGCCA AGAAGTGGTT TTGTAAAGAA GGTATTTTAA 000320
000321 GAGTTTACCT TTTCTTTATT TCCAAAATAT GCACCAAAAA ATGTTAGTGG TACTAAAATT CTAAACCAAA TTGTAAGGAT 000400
000401 ATCTAGCAGA GTAGCAAAGG AGACGGCAGC TGATGACCCT TCCATCCAGA GAATTTTCAT CATAATAAAG AGATCAGCAA 000480
000481 AGACAATTTC CAGTTTTTAA ATGGGGATTT TAATAGACAT ACTTCCAAAG AAGACATGCA AATGGTCAAT AAGCACATGA 000560
000561 AAAGATGCTC AATATCATTA GTCATCAGGG AAATGCAAAT TAAAACTATA AAATAGGCGG TGCACCCTGG GATAGGGAGC 000640
000641 GATCTCCAAG CGAGGCGGCA AGATGGACTC AGGATTCTTC CACCGATCAA GTGCAGAACA GTATAATCGG TTCAGCAACA 000720
000721 AACAGAAGAA ACTACTGAAG CAGCTGAAAT TTGCAGAATG CCTAGAAAAA AAGGTGGACA CGAGCAAAGT AAATTTGGAA 000800
000801 GTTATAAAGC CTTGGATAAC AAAAAGAGTA ACAAAAATCC TTGGGTTTGA AGATGATGTT GTGATTGAGT TTATATTCAA 000880
000881 CCAGCTGTAA GTGAAGAATC CAGACTCCAA AATGATGCAA ATCAACCTGA ATGGATTTTC GAATGGAAAA AATGCTCGAG 000960
000961 AATTTACGGG AGAACTATGG CCCCTGCTGC TAAGTGCACA AGAAAACATC TCGGGAATCT CTTCTGCTTT CCTAGAACTG 001040
001041 AAGAAAGAAG AAATAAAACA AAGACAGATT GAATAAGAAA AATTGGCGTC TAAAAGAAGG AAAAAACTCC AGAGCTTTTT 001120
001121 TTTTTTTTAA ACTAAAGCTA TATAAAGCTT GTGGATTAAA CAGAATAAAT TTCTAAATTT TGGTCTTAAA TTCCTGGGTG 001200
001201 CAGCAATTTG TCTACCTTAG CCTCCCAAAG TGCTGGGATT ACAGGCACAA GTTATCACGC CCTGCCCATT TCTTATATTT 001280
001281 TACATTTTTT GTATGTTAAG TATTTTGGTG GTGACTGCCT GTTGTCCTAG ACACCCAGGA AGCTGAGGTG GGGAGAGGAG 001360
001361 TGATGCAGCT AGCAAATGCC TGGTCAGGGA TGTGGAGACT TACTACTCAA AATGTAAAGA GGAATAAAGG AAAAATAGGA 001440
001441 GAAAAATAAA AAGGAGAAAC AGTGATGCTT CCCTTTTGAT TTACTTACTC AGCATACAAT TTATCCCTTG GCTTCTGTGT 001520
001521 AATGAATAAT AGGTAATCTT TATGGAATTG TCTATACATG TGTATGCATA ATAAAACCAT TCACTCACAG TTGGGAAAAA 001600
001601 TTACTGATGT GCATCTCTTT CTATTCATTT TTGTAACCCT AGTGCCTGGT ACATAATATG CCCTTTTAAG TGTTTCTAGA 001680
001681 ATGAATAAAT TCTATTTAAA GCTGAAAAAC TTTTTATTTT ATTGTTTTAT TTTATTTTAA GACAGAGTCT CTTATTTTGA 001760
001761 GATGGAGTCT CACTCTTTCA CCCACTCTAC AAACTGGCAC TAAAGCAGGT TTGTCAGTAG AGGGCGCTGG AGAGACATCG 001840
001841 CAAGGGAAAG GGTTTATTTC CTGGTTGATC TTGGCTCACT CCACAGGCTT CTGTTTTCTT TCAGCATGCT GCAGCCCAAT 001920
001921 CGTTTAAGCC CAGGAGTTGG AGACCAGCCT GAACAACGTT GTAAGACCCC ACCTCTACAA AGTCCGTATC CTTTGACCCA 002000
002001 GCAAATAATT AGTAATTATC CTAAATAATT AAGACTACAT AGATGGCTGT ACAACAATAG CTGTCATTTA TTGAGTGCCT 002080
002081 ACTATGTATT AAACACTGCT CTAGCACTTT ACAAGTATTA TCCTATTTAT CCTTGTAACA ACCCTGTGTA TTACAGTACA 002160
002161 CACGATTCTT TTTTAGAGAC GGAGCTGTTG CCAGGCTAGA ATGCAGTGGC GCAATCTTGG CTCACTGCAA CTTTGGTGGG 002240
002241 ACTACAGGCA CGCACCACCA CGCCCAGCTA ATTTTTATAG TATACGCTAT GATTATTTCC ATTTTACAGA TGTAGATATA 002320
002321 AAATATGTAT TATATATATA TGAACATGGA AAGAAGTCCA TCATAAACTT CTCAGTAAAG ACATAAAAGA TCACATACAT 002400
002401 ACATATATAA AATCTCACAT GAAGTTATAC ACAGTATACA TAAAAAAAAT TTATTAAAAT TGGAGAGGAT TACGAGGAGA 002480
002481 CTCTTGGAAC TCTATACCAA ATAAAGAGTT TTGTAGGAGG TTTTCACAAG CCTATACTAC TTTTGTAATT GTAAAAACCA 002560
002561 TAATGTTTCA TAATAAGGAG GTGTGAGGTG GGGGTCTCAA AGATCCAGAG ACCTACCTGG ATCTCTGGCA TGGTAGTGTG 002640
002641 TCCAGAAATG GTGGGTTCTT GGTCTCACTG ACTTCAAGAA TGAAACCGCG GACCCTCGCG GTGACCGTTA CAGTTCTTAC 002720
002721 AGGCGGTGTG TCTGGAGTTT GTTTCTTCTA GACGTTCGGA TGTGTCCGGA GTTTTTTCCT TCTGGGAAGA GGAAATTCAC 002800
002801 AAGAAAGAGT ATCACTATAT AAAAGAAATA GCAAAAAGTA GCTGCTGGTT TTATTTAAGC TTTGTTGTTG TTTTTTTTTA 002880
002881 GACAGAGTCT CACTCTGTCG CCACGCCTGG ACTGCAGTGG CGCAGTCTGG GCTTACTGCA ACCTCCACCT CCTGGGTTCA 002960
002961 AGTGATTCCC GTGCCTCAGC CTCCCGAGTA GCTGGTACTA CAGGCATGCA CCACCACGCC CGGCTAATTT TTTTTTTTTT 003040
003041 T
[back to top]

Predicted Small Protein

Name NONHSAT138743_smProtein_662:889
Length 76
Molecular weight 8896.2554
Aromaticity 0.12
Instability index 40.1946666667
Isoelectric point 9.37371826172
Runs 11
Runs residual 0.0022695035461
Runs probability 0.0381798911211
Amino acid sequence MDSGFFHRSSAEQYNRFSNKQKKLLKQLKFAECLEKKVDTSKVNLEVIKPWITKRVTKIL
GFEDDVVIEFIFNQL
Secondary structure LLLLEELLLLHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLEEEELHHHHHHHHHHH
LLLLLEEEEEEEELL
PRMN -
PiMo -