NONHSAT104852

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT104852

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2590 nt

Genomic location

chr5+:159895253..159914700

Exon number

2

Exons

159895253..159895447,159912306..159914700

Genome context

Sequence
000001 CTCTTTCTCC AAGACGCTTG ACCGCTCTTC CTTTCCTGGA TGGCACCAGC AGGGCCGATT GGAGTGGTAA ACCCTGGGCC 000080
000081 GGAAGGCATG CCAAAGGGTG GACAGGATGG ACAGGAGACA GTAGCACAAC GAGGAGGGGG AGAACAGCGG CTGAATTGGA 000160
000161 AATGATAAAA TAAAATGAAA TTTTAGGAGC TCGCTGGCTG GGACAGGCCT GGACTGCAAG GAGGGGTCTT TGCACCATCT 000240
000241 CTGAAAAGCC GATGTGTATC CTCAGCTTTG AGAACTGAAT TCCATGGGTT GTGTCAGTGT CAGACCTCTG AAATTCAGTT 000320
000321 CTTCAGCTGG GATATCTCTG TCATCGTGGG CTTGAGGACC TGGAGAGAGT AGATCCTGAA GAACTTTTTC AGTCTGCTGA 000400
000401 AGAGCTTGGA AGACTGGAGA CAGAAGGCAG AGTCTCAGGC TCTGAAGGTA TAAGGAGTGT GAGTTCCTGT GAGAAACACT 000480
000481 CATTTGATTG TGAAAAGACT TGAATTCTAT GCTAAGCAGG GTTCCAAGTA GCTAAATGAA TGATCTCAGC AAGTCTCTCT 000560
000561 TGCTGCTGCT GCTACTCGTT TACATTTATT GATTACTTAC GATGATTCAG GTACTGTTGT AAGTGCTTTA CATGCTGTTA 000640
000641 TACGAGACTC TTGGGAGAAA TCACTTTAAT GAAGCTTGAG ACACATGGCA TTGCCATGCA ATGATTTTTC CCCCCTCTTC 000720
000721 ACGGGATCAG AGGGAACTAA TAGAATGTGA CAATGATTCT TTAGCAGGGA CTGCTGAGGC TTCTGGTTCC TTTTTAAGAT 000800
000801 CTGCAGTGAA AGAAGATGAG AAACATGGAT ATGCCCTTCT TTTGGTCCCC CTCTTCCTTT ATTTGATCTC TACTTCCTTC 000880
000881 TATAAATATA TTAGGGCTAC ATTGTCCCTT TGTATTTCAA ACAAGGCAAA AAGAGGTTGT AATTACACTT TACTGCAATC 000960
000961 CTCAGTTTCT CCAGGGAACA GGAATGCAAA GGCTTTGAAG GCCTCTCTAT TTGCTGACAT GGTCAGCTGG GTGCCATGGG 001040
001041 CCAAGTCCTT CTGTTGCCCT CCTCTGTCAC CAAGTAAGCT AGGTCCTTTC TGAGGCTCAG GTTTGCTGTG ATGATGATCA 001120
001121 CTTTTAGGCA GAAGGTTAGA GGCCTCATGA GTGCTATATG GACTTTATTA GGCTTTAGAT TTGATGGGGA ATAAGGGATG 001200
001201 TGATTTGTCT TTTGGGAACT CATCTTTGAT TCATCATTGT CTCTTGGTAT CTTGGAATTT CCATGTCATT ACAGTCTACA 001280
001281 GAATGAAAGA GTAACCTGTC CCAGAGGAGA GGCAGGTGAA AGACTCCACA GCATGCTCAT TCTCATTCTG TCTTCTCAGT 001360
001361 GACACCGAGG TTTACTGAGT GCCCACTATG TGCCAAGCAC TGTGCTCAGG GCTTTCTTTG TATGCATGAT CTCAGTGAAT 001440
001441 CTCACCAAGC CTCATCTGGA AAACGGGGAC AAATTAACAA CAGGATGGCA AATTGAAAAA CACGTAACCA TGTTCTACAG 001520
001521 ATGGAAAGGG GTGCTTGGTT ATTATGAAGG CCCCCTCGCA AGCGTGTGGG ACATGGGTGT GTTCTCTGGG TTGTACTGAT 001600
001601 CAGATCAAGG ACCTCCCCCA CCCTTCTCAC ACTCTGCCCA CTTCCGCCCT TTGCTTATCA GACCCTTAGC CAGTGACTCA 001680
001681 TTCCAGAACC AGAACCTTGG TGAAATCTCA ACCGACACCA GAGATCGGTG TCTTCAGTCC TAGACTGATG GAGAAAATCC 001760
001761 AGAATATATA CTAGAAGCTC CAAATGCTCT GGGTTTCAGC TCCTCTGTGC TGTGGACACT GACTTTGGCT CAGAACTCCG 001840
001841 ATTTAGTACA AAAGGCTCAT TTTTATTTCA GGGGCACTCT TCCTAAAGCA AACCTAATAA ATGAAATATG GAATTCACAG 001920
001921 ATACACACAC ACATTAAAAA ATTAACCTAG TGTATCTGTG AGGAGTAGGC AGAAATTCAC TGTATAAAAG AATGCTTCAT 002000
002001 TTCATAGAGA ATTTGTGTTA AGATTCCATT AGATAGTACA TTTCTCAAAG ATTTTTGAGG TTGTATTTGC TTTACCAAAA 002080
002081 CTTGGTTTAT GTAAGTGGAA AAAGCATGTT GCAAAATAAC TTGGTGTCTA TGATTCAGTT TATGTAAAAT AATAAATGTA 002160
002161 TGTAGGAATA CGTGTGTTGA AAGATGTACA TCAATTTGCT AACAATGGTT ATCTCTGACG TGGTGGGATT TGAGATGTGT 002240
002241 TTTTCTTTTT GGTTGTATTT TTCTCTATTG TTTGACTTAA CACAGAACAT GTTTGGTTAC AACAATAAAG TTATTGAAGA 002320
002321 CAATATAACA CCCAATTGTG ATGAGTCCAG ACAGTACTCA TGCTGTGCAT TTATTAGACA GTATGCCATA GAGGTGGGTC 002400
002401 ATGGAGGAGT GGAGAGAGAG CCCTGATTTG TTGTCCCATG AAGAAGATAG GCACAGGGTA ATATGTAGTG TGATTCTTTT 002480
002481 TTTTTTTGCA CGAGGAGAAG TGTGTGTGTA TCCGTGTAAG TGTATGGATA TCCAGTGGGT AGTGGGTGTT AAGATTGCAA 002560
002561 GTGATTTTTT TCCCCTCTCC CTTAATCATG
[back to top]

Predicted Small Protein

Name NONHSAT104852_smProtein_2183:2425
Length 81
Molecular weight 9338.587
Aromaticity 0.1625
Instability index 52.05875
Isoelectric point 4.31390380859
Runs 11
Runs residual 0.00131578947368
Runs probability 0.043487337605
Amino acid sequence MYINLLTMVISDVVGFEMCFSFWLYFSLLFDLTQNMFGYNNKVIEDNITPNCDESRQYSC
CAFIRQYAIEVGHGGVEREP
Secondary structure LEEEEEEEEEELLLLEEHHHHHHHHHHHHHHHHHHHHLLLLEEEELLLLLLLLLLLLLHH
HHHHHHHHHEELLLLLLLLL
PRMN -
PiMo -