NONHSAT119502

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT119502

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2564 nt

Genomic location

chr7-:23140847..23145322

Exon number

3

Exons

23140847..23143197,23143869..23143936,23145178..23145322

Genome context

Sequence
000001 ACACAGTGGC GCGCCGTTCC GAGCAATGCG ACTGCAGGCA CAGGTGCGGC CGGATTCCCT GTCGCTGTCC CGCGCTGAGT 000080
000081 GAATTTTGCC TCGGAGCTGG TCCGAGGTCG CGCCTGCTCT TTTGTGTGCC CAAGGCAGAT TTCCAGAAGT AGAAATCTGT 000160
000161 CTTCTAACTC TAAGGCCTTG GGAGATGAAG GATCCGTGAT TCAAAGGCAG AAATAGATGG AGTCTTGCTG TGTTCTCCAG 000240
000241 GATTGAGTAC AGTGGCTTAT TCACAGATGT GATCATACTA CACTACAGCC TCAAACTCTC GGGGTCAAGC AGTCCTCCTG 000320
000321 CTTCAGCCTC CTGAGGAGCT GGGACTACAG GCCCACATCA CCGTGCCCAC TAGATTCACA GTCATTTTGC TTGTGGAAGG 000400
000401 TCTTGCCTCA ATGTTGATAG GTGCTGACTG ATCAGGGTCA GGTTGCTGAC TGATCAGAAT GGTAGTTTCT GAATGTTGGG 000480
000481 AATGGCTATG GCAATTTCTT AAAATAAGAC AACAATGAAG TTTGCTGCAT TGATTGACTC CCTTTCACAA AGGATTTCTC 000560
000561 CATATCATGC AATGCTGTTT GATAGCATTT TACCCATAGT AGAACTTCTT TCAAAATTGG AGTCAATTCT CTCAAACCTT 000640
000641 ATCTCTGCTT GGTCAACTAA GTTTATGCAC TATTCCACAT CCTTTGTTGT CATTTCAACA ATATTTACAG AGTCTTCACC 000720
000721 ATGAATAGAT TCCTTTTCAA ACAACCACTT TCTTTGCTCA GCCATAAGAA GCAACTCTTC ACCCATTCAA GTTTTATCAT 000800
000801 GAGATTGCAG CAATTCAGTC ACATCAAATT CCACTTCTAA TTCTAATTGT TTTTCTATTT CCACCACATC TGCAGGTACT 000880
000881 TTCTCCACAG AAATTTCAAA CCTCTCAAAG TCATCCGTGA GGGTTGGAAT CAACCCAAAT TTCTGTGAAT GTTGATATTT 000960
000961 TGACCTCCTC CTAATCACAA ATGTTCTTAA TGGCATCTAG AATGGTGAAT CTTTTCCAGA AGGTTTTCTA TGTGCTTTGC 001040
001041 CCAGTTCCAT CAGAAGAATC ACTGTCTGTG GTAGCTCTAG CCTTATTAAA TGTATATTTT AAATAATAAC ACCTAAAAAT 001120
001121 CAAAATGACT CCTTGATCCA TGGGCTGTAG AATAGACATT GTGTTAGCAG TCATGAAAAC AAGATTAATC TCCTCATATA 001200
001201 GCACCATTAT AGCTCTTGGG TGACCACGTG CATTGTCAAT GACCAGTAAT ATTTTGAAAG GACACCTTTT TTTTTGAGCA 001280
001281 GTAGGTCTCA ACAGTGGACT TAAAATATTC AGAAAGCTGT GTTGTAAACA GATGTGCTGC CAGCTAGGTA TCGTTGTTCC 001360
001361 ATTTATAGAG CACTGGTAGA GCAGATTTAG CATAATTTGT CAGGACCCTA GGATTTTTGG AATGGTAAAT GGGCACTGGC 001440
001441 TTCAACTTCA AGTAACCAAC TGCATTAACT CCTAACAAGA AGAGTCAGCC TGTCCTTGAA AGTATGAAAA CAGGCATGGA 001520
001521 CTTCTCCTCT CTAGCTATGA AAGTCTTAGA TGGCATCTTC TTCCAATAGA AGTTGTTTCA TGTACATTGA AAATCTGTTG 001600
001601 TTTGGTGTAG CCACCCTCTT CGATGATCTT AGCTAGATCT TCTGGATAAC TTGCTACAGT TTCTACATCA GCACTTGTTG 001680
001681 CCTTACCTTA CACTTTCATG TTATGGAGAT AGCTTCTTTC CTTAAACCAC AGGAACGAAC CTCTGCTATC TTCTAGATTT 001760
001761 TCTTGGGCAA CTTCCTTACT TCTTTCAGCT TTCATAGAAC TGAATAGTGT TAGGATCTTG CTCTGGGTTA GGTTTTGGCT 001840
001841 TAAGGGAATG TGGTAGCTGG TTTGATTTAT TTAGACCACT AAAGCTTTCT TCATATCAGC ATTAAGGCTG GTTTGCTTTA 001920
001921 TTATCATTTA TGTGTTCACT GGAGTAGCAC TTTAAATTTC CTCCAAGCTT GTTTGCTTTT CACTCACAGT GTGGCTGTTT 002000
002001 GGTGCCAAGA GGCCTAGCTT CCCAGTCTTG GCTTTCAACA TGCCTTCCTC ACTAAGCATA GTCATTTCTA GCTATTTTAT 002080
002081 TTAAAGTTAG AGATGTGCAA CTCTTCTTTC ACTTGAACAC TTAGAGGCTA TTGTAGGGTT ATTACTTGGC TTAATTTCAA 002160
002161 TATTGTTGTG TCTGGAGGAA TAGAGAGGCC TGGGGACACA AAGAGAAATG GGGAACAGCT GGTCACTGGA GCAGTGAGAA 002240
002241 CACACGCAAC ATTTATTGAT TAAATTCGCT GTTTTATGGG CTCGGTTCAT GGTGCTCCAA AACAATTATA ATAGTAACAT 002320
002321 CAAACATCAC TGTTCACAGG TCACCATAAC AGATATAATG ATAATGATAA TAATAATAAT GTTTGAAATA TTGCAAGAAT 002400
002401 TACCAAAATG TGACTCAGAG AAACAATAAA GTGAGCACAT GCTGTTGGAA AAATGGTACT GATAGACTTG ATCCATGCAG 002480
002481 GGTTGCCACA AACGATTAAT TTGCAAAAAA AACAAACAAA AGAACATGCA GTATCTGTGA AGCACAATAA AGCATACTGC 002560
002561 AAAT
[back to top]

Predicted Small Protein

Name NONHSAT119502_smProtein_2288:2428
Length 47
Molecular weight 5421.3797
Aromaticity 0.0434782608696
Instability index 66.8130434783
Isoelectric point 5.37628173828
Runs 8
Runs residual 0.0125517598344
Runs probability 0.0307255785196
Amino acid sequence MVLQNNYNSNIKHHCSQVTITDIMIMIIIIMFEILQELPKCDSEKQ
Secondary structure LEEELLLLLLHHHLLLLEEHHHHHHHHHHHHHHHHHHLLLLLLLLL
PRMN -
PiMo -