NONHSAT066604

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT066604

Source

NONCODE4.0

Same with

,

Classification

sense

Length

2577 nt

Genomic location

chr19+:44084790..44088116

Exon number

3

Exons

44084790..44084821,44085364..44085516,44085715..44088116

Genome context

Sequence
000001 GGGGTGGAAG GGACCACATC CAGCAGGCTG AGAGGGCAAG GAGTTGGTGC ACACCTACAA GGGCTGCATC AGGTCCCAGG 000080
000081 ACTGCTACTC CGGCGTTATA TCCACCACCA TGGGCCCCAA GGACCACATG GTAACCAGCT CCTTCTGCTG CCAGAGCGAC 000160
000161 GGCTGCAACA GTGCCTTTTT GTCTGTTCCC TTGACCAATC TTACTGAGAA TGGCCTGATG TGCCCCGCCT GCACTGCGAG 000240
000241 CTTCAGGGAC AAATGCATGG GGCCCATGAC CCACTGTACT GGAAAGGAAA ACCACTGCGT CTCCTTATCT GGACACGTGC 000320
000321 AGGCTGGTGA GTGGTGCCTG AATCTCTGGA AAAGGAAACA GAACTAGAGG TCCAAACTTC TAGGTTCGAT GGGAGGAGAG 000400
000401 GGTTCCAGAG AAGTGGGTGA GGATGTGTTC TGGGATTATG AGGAAGAAGG GGCTGAGGTC CCTGATTCCA GTCTGAAATC 000480
000481 TCCCTTTCAG GTATTTTCAA ACCCAGATTT GCTATGCGGG GCTGTGCTAC AGAGAGTATG TGCTTTACCA AGCCTGGTGC 000560
000561 TGAAGTACCC ACAGGCACCA ATGTCCTCTT CCTCCATCAT ATAGAGTGCA CTCACTCCCC CTGAAAAGCT ATCTGAACAG 000640
000641 AGGAAGATAA TGTAGTGTGA AGTCCCCATT TGTCCTCAGC CTGTAACTCC CCGTGTGCCT ATAAAGAAGT TAATAGAGCA 000720
000721 AGCCTGAGTC CTTGTGGTGT GATGTATCTT GATGAGATAG AATGCAGGAA ATAGGGGGTC TCAATCCTCA TTTCAAATCC 000800
000801 CCTCCTTTTG AGGAAGAGAG GGAAGGCCCT GCCTTCTTTC TTTCTTTCTT TTTCAGACAG GGTCTCACTC TGCCACCCAG 000880
000881 GCTGAAGTGC AGTGGTGCAT TCAAGGCTCA CTGCAGCCTC GACCTCCCGG GCTCAGGTGA TTCTCTCACG TCAGCCTCCC 000960
000961 TAGTAGCTGG CACTACAGGC ATGCACCACC ATGCTCGGTG ATTCTCCCAC CTCGGCCTCC CTAGTAGCTG GGACTACAAG 001040
001041 CGTGCACCAC CATGACTGGC TAATTTTTTA ATTTTTTTGT AGCAACAGGG TTTCACCACA TTGCCCAGGC TGGTCTTGAA 001120
001121 CTCCTGGGCT CAAGCGATCC TTGGCCTCCC AAAGTGCTGG GATGACAGGT GTGAGCTACC ATGCCTGGTG GCCCTGCCTT 001200
001201 TTATGAAGGG ATTAGTCTTT GCCAGGTCCC TGTGCTGTGT TTTCCATGCT ATACCTCATT TCATGATCAT AACAACTGTG 001280
001281 AGGTTGATAT TATTATCCCC ACTCTACAGA TGAACAAACT GAGGCATAGA TTGTGGGTGT CAAAATTTGC CTTGGGTCAA 001360
001361 ACAACCAGGG AGGGTCAGAA CCTTGATTTG AACCCGAGTC TGGTATACTT CAGAGTCCAT GATCATCTCT CTCAATTAAC 001440
001441 TGCTTCCTCT TTTTGCCTCA GTGTTCATGT CTGCAAAATG TAGATATTAA TAGTTATAAC CTCATAAGGT TGTTGTAAGA 001520
001521 GTCAAATGAA AGACAACTTA AAAAGCGCTT AGTGCAATGC CTAGCACCAC ATGTGGATTC TAAAGACAAT GTATGCCTGA 001600
001601 GTAAACAGTA ATCATCATAA TAGCTGCCAC TTACTGAGTG GCCATTAGTT GACAGGTGCT AAACACTGAC ATCAATTGGG 001680
001681 ACAGACTAGA TGTTACAGTA ACAAAACAAG TCCCTGATCT CAGTGGTTTA AAACAACAAA GTTGTATTAA CTAACACTGT 001760
001761 AATAATGCTA TGTAACAAAT CATCCTAAAA CTGAGACTTC TTTTTTTTTT TTTTTTTTTT GAGACAGAGT CTTACTCTGT 001840
001841 CTCCCAGGCT GGAGTGCAGT GAAACCATCA CAGTTCACTG CAGCCTCAAC CTCATGGGCT GAATCGATCC TCCCCGCTCA 001920
001921 GCCTCCCTGG TAGCTGGGAC TACAGGCATG CGCCACCATG CCCAGCTATT TCCTGTAATT TTTTTGTAGA GACAGGGTTT 002000
002001 CCCCATGTTA CCCAGGCTGG TCTCAAACTC CTGGGTTCAA GTGATCCACC CACCTCGGCC TCCCAACATG AAAACTGAGA 002080
002081 CTTATACCAA CAAGCATTCT TCTAAGGGTT TGTGGGATGG CTGCTTTGGC TCCAGTTCGG GAGGATTCAG GTCTCTTCCA 002160
002161 TGAGATTCCA CAATTTTCTG GGATCAGCAG CTCCCTCGCA AATGTTCTCA TGGCACATCA CGGGAGCACA AAAGCCAAGT 002240
002241 GCACTTAAGG GCCTTGCTAA GGTTAGGTCT GCTAAAATTC CATTAAGTAA ATCATATGGT GAAACCCATC AGTGGAGCAG 002320
002321 GGAAATATAC TCCCCCTCCA GTAGAAGGGA GAGGAGTGGA TATTTGCTGA ATGACTGCCC ATTCACAAAA GGTAAACTTT 002400
002401 TCCTTCACCC TAGCACTATC CAGAGGGTGT TTTCCTCGTC ATAGTCATAA ATCAGCCACC GGCAGGGCAC GATGGCTCAC 002480
002481 GCCTGTCATT CCAGCACTTT GGGAAGCCGA GGCAGGAGGA CCACTTGAGC CCAGGAATTT GAGACCAGTC TGGGCAACAT 002560
002561 GGCAAAACCC TGTCTCT
[back to top]

Predicted Small Protein

Name NONHSAT066604_smProtein_2159:2443
Length 95
Molecular weight 10616.019
Aromaticity 0.117021276596
Instability index 53.8553191489
Isoelectric point 10.5470581055
Runs 15
Runs residual 0.0307585568918
Runs probability 0.048401342519
Amino acid sequence MRFHNFLGSAAPSQMFSWHITGAQKPSALKGLAKVRSAKIPLSKSYGETHQWSREIYSPS
SRRERSGYLLNDCPFTKGKLFLHPSTIQRVFSSS
Secondary structure LLLLLLLLLLLHHHHHHHHLLLLLLHHHHHLHHHLLLLLLLLLLLLLLLLLEEEEEELLL
LLLLLLLEEELLLLLLLLLEEELLLLEEEEEELL
PRMN -
PiMo -