NONHSAT004289

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT004289

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3137 nt

Genomic location

chr1+:87597609..87602352

Exon number

3

Exons

87597609..87597935,87599297..87599401,87599648..87602352

Genome context

Sequence
000001 GGAACATCCT CGCGGCCCGA GGCGCGGTCG CAGCCGGGGA GCACTCGCCA CGGTGCTGTG GAATTCTCTG GTTTTTCACG 000080
000081 CAAGGTCAGG CGTCCTGCTG GCGCCCTCTC GCCACCCTGC CCTCCCGTCA GAAGCCCGGC TCCTCGCCGG GGAAGGCCGG 000160
000161 ATGCTGGCCC GCCGGGACCT GGGACTTGTG CCACATGGAG TGTCGGGAGT CTCCATTGCC GCGAGTTCTA CACCACAGGG 000240
000241 CCAGGCTGTT TGCTCCCCAT CGGTCGCTGC CCCCAGCACC CTGTTGTTAT TAAGGACTCA TTTGCTTGGA GCGGCATCAT 000320
000321 TACAAGGCAC CAGATACTGC TACACCATCT CATCGGCATG GACCTATATG TGGCAGCAAG TCCATCTCAT CGCTTTTGGT 000400
000401 GAAAGTCAGT CCAGTTTGTA AAGATTCTCA TTGTCACTGT AGACGAGGAA CTGGGACACC AAAAGGAGAA ACTCTGGCCA 000480
000481 CGCTTGCACC CTGTTCCCAA TCCTGGTCCA GTGTCACCCA CAGATGGTAA GGAGCTCTAG AGACCTCACC AGCCCCTGGG 000560
000561 ATTGGTCACC TCACTCTTCT ATGGACAGAG ATTCCTGCTG GGATCCTTTG AGGGCAAGCA GACCCTTCTT CCAGCTCGGA 000640
000641 CTGTGAACTC CACTGCAGCC GTAAGGACTG TCTGTGACAG TGAGCCCGAG ATGACTGGGC TCTGTGCTCC CTCCCGGCCC 000720
000721 TCCAATCCTT GGCCTGCCAC AGAGAACTGA GCTCTTTTAT TAGCACCATG AATGTGACTG ATACAGCTAG CCATTCCCTT 000800
000801 GTGCGAATGA CTCAGTTTAT TAATGCTCTG CTAAAGATGG CTTCTTTGCT TGCCAGCAGC CTTAAACAGT ATTTCATTAA 000880
000881 AACTGGCTTA ATTATTTTGA GAAGACGGCC CAATTAAAAG CTATACACTC CCTCTATGTG AGTGTTTATA CATAGAGCTG 000960
000961 tatatataat acatatttgt aagtgtgtat atatatatgt gtgtatgtat gtgtctataa atatataGGC TTAGCAATTT 001040
001041 CATTACATGG GATAAATTGT TGGAAAAAAT ACCCAGGAGC TGGTCCCCTT TCTGTTGCTA GATTCAGAGT AGAGGCCACC 001120
001121 CCTCCACTCT GGGAGAGGCT GGTGTTGGTG ATCTCTCAAT GACTCTGCAA TGGAAGTCCC AACTGCACAG AGCCCTGCCC 001200
001201 CAGTTTCAGG AGCCAGCAGC CTCGGAGAGG CGGATCCTGA CCTCTGCTCT GCTCTTGGGA TAGCCTTTCC CTTCCCAGCA 001280
001281 GGGTTGAGAT ACTTGGGCCG GGAAATGTTG TGGCAAAGTG TTTGCCAAAG CTCAGGAGAG ACACAGACTT GGGGCTTTTG 001360
001361 TTTCTTGAGC TGGCTGTCTA GCTTTCCTAA TGAGCAAATA TGTTCTCTTT AAGGAAACAA ACAAACAAAG CAAAAACACC 001440
001441 AATTCATCTG GATTTTATTC ATTTGTTTTA AATACAAACA AACAAAAGGA GAGTGGTTAT TTCTGCACCA ACTATTTCAA 001520
001521 ATGCAAGTTA CTCCATCGCT CGGGGTGGTT GGATGGTGCT TGTCACCATA GGACCCACAG GGCTAGTTCC AACTGTTATT 001600
001601 CGGTAAGGCT TTTTTCTTTC CAAAATTCCC AGTGTTCCTT TAAGGCCCAT TTAGCTGCGG GTTTTGTTTA TTCTCCCGGC 001680
001681 AATCAGCATT TAAAATAAGA CAAACAAGCA TTTTTTCCTG GGCTGTGAAT CCCCCCGGCC AGCCTCCACC TGCACACCTG 001760
001761 AAGCCAGCAT GTCCAATCAA ATTTCTCTGT AACCCATATC CCCTTTAGAG ACTTGCCCCC GTCGTATACC AGGCTGGAAA 001840
001841 TAGAGAACTT AAGCAGGGCA AATGTAATTT TAAGAATTGC TAATGATGCT AGAAATCTGC AATGCAATTA GCGTCATTGG 001920
001921 ATTTGGCGCT CCTCCGAAGG CACAAAACTC CTTGTCATAG CGCAGTGGCA GCAGCGGCAA GTGCCTCCGC ATGTGCCGGG 002000
002001 CTGTCCGGGT ATGCTGGCAG CCGCTTTGCA CTGAGATGTG AGCAGTTGGT TAGGCTTCCT CTCTTTCTTT CTCACAGATA 002080
002081 CTGACTTCTT TGTCTCTTTT CTGGGTTGCA GAGGGATGGG TATTTTCCAT TGATTATTAC TTTAGCATTT GACCCTCCAG 002160
002161 TGGAGTCACC CTGTTTTTTT TTTAGAAAAC TGAGACTCTC ACTTTGTGAA TTCACTGTGC TCTCTGGGAT TTCAGTGCTG 002240
002241 TAGTTCAACC ACCAATCCCC CTGTCCTGAA CTCCAGTACT TCTGATGCTA TTAATTGGTT CCTCAACAAT TGTGGCCTTT 002320
002321 TCCATCATTG CCCACCATAG TATATACTTT TTCTTTCTCT CTCTTTTTTC TAATTTCCTT GTCTTCTTCA CTCTCCATGG 002400
002401 AGCCAGAGGT AGTATGAAGA GTTAAAAATA GGAATATAAA GAAAGCCAGA GGGACAGAGG GAGTGAGAAA GAAAAATTTT 002480
002481 AAAAAGGGAG GAAATGAATT ATTGGATTAA AAATAAACTT TTACTTTTTT GCAGAAAAAT TATTTTTGCT CTCTGGGAAA 002560
002561 ATAACATGGG CCAGGCATAA AAAGCATGTC AGCTGGCTAA AAGATTGCAA AATCCAGAAG ATGATCTCGA TGTGTCTGTT 002640
002641 CAATTTAGCA AGGGTATCTA CTAGGGGATC CTCTTTTAAA TATGGAGGCC CAAATCAGAA GCTTGTAGAG GGGAGCTATT 002720
002721 CTTCCAAGAT TCCAGATGTG TCTGTGAGAC AACACGTTAT GGGGCAAATT GATTTCACCC TTGGGAAACC AGGGAGATTT 002800
002801 TCAAAGTTAT GTCTGCAAAG CCAGCTAATG CAATTCCCCA TTAGTGCATT AAAGTGCGCC CTTATTAATT CAAACATAAA 002880
002881 GGCAACAAAA TAAGCtttta aatttaaaat ataatacata tataatGAGC ATGTGTGAAA GCCTTATTCA AATGAAAATA 002960
002961 CAGGAGTGTT TGAACTACTG AGGTATCTTT TGTATTGAAT TATGAGCATA TGTAATAGAT TTAATTATTA ATTTCCCCAT 003040
003041 TGTTCTATGC ACACAGACAG GGTTCAAGGC ACAGTCATTC TCTGGCTTTC ATAGATCTAA TTTGTATAAT TATTGCCTGA 003120
003121 ATAAAAAATT GCTCCAA
[back to top]

Predicted Small Protein

Name NONHSAT004289_smProtein_1169:1411
Length 81
Molecular weight 8560.5923
Aromaticity 0.1125
Instability index 58.03875
Isoelectric point 4.00238037109
Runs 11
Runs residual 0.00131578947368
Runs probability 0.0473896356249
Amino acid sequence MEVPTAQSPAPVSGASSLGEADPDLCSALGIAFPFPAGLRYLGREMLWQSVCQSSGETQT
WGFCFLSWLSSFPNEQICSL
Secondary structure LLLLLLLLLLLLLLLLLLLLLLHHHHHHLLLLLLLLLHHHHHHHHHHHHHHHHLLLLLLE
ELLHHHHHHLLLLLLLEELL
PRMN -
PiMo -