NONHSAT001496

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT001496

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3565 nt

Genomic location

chr1+:23243783..23247347

Exon number

1

Exons

23243783..23247347

Genome context

Sequence
000001 CTGCCTCACT GAAGCACAGG AAGGACCCAG GCCCCAGACC ATCACCCACC CCAGGGCTGG GCCCTGGGCC ACTCCTGGCT 000080
000081 CACTCCAGGG CCCCTGTTGT TTGAAGATGG TACCAAAGGC TGGAAGACTC TTGCTAGGGA AGATAACGTA AATGCATTCA 000160
000161 AAAGACAGGG TACCACATGA TGCTAGGGAA AGTGCGTCAT GACCGCAGGG TAGCAGCCTG CTCCTGTCAC TAGGGATGTG 000240
000241 AATCTGGGGA GACACTTCCC CTCACTGAGC CATCTGTAAA ATGAGAGCAT TGGACTGGGT GAGGCAGGCT GTGCAAAACA 000320
000321 CTTATACACC TTAGTCCCAT GAGGAAGGAG TATTCCCCCT GCATATTTTG GGGGGAAACT GAGGCTCAGG GGGATGAAGT 000400
000401 ACTTTGCTCA GGGTAACACA GCCAGGAAGG AGCAGAGCAA GGCCATGAGC TTTGGATTTT TCTGTTTCCA ATGTCTCCTC 000480
000481 TACCGTGCTC AATTAGATGA TGAGAAACAG TTGGTGTCTG ATGCTGGCCA ATAAGAGAAA AGAAAGCTCA AGTGGGCTGG 000560
000561 GGCTGTCAGG GAGGGCTTCA TGGAGAGGCT GTCTTGAAAA TGACAGAGGT CAAAAGAAAG GTCACTCCAA GACGCAGCTG 000640
000641 CCAGAATGGT CCAACACAGT GAGGAGTTGT GTCTGGATGG GCCAGTGGAA CGGGGGAAGT GAAGGTTGAT ATAAGTGGAG 000720
000721 GTTGAGGCAG TTTAGGCGTG ATGTACAGTC CAGCCACAGC AGGTTCTTGA GCAGGAGGGT AGCATAGTGA GCATCAGGTT 000800
000801 CTAGGAAGAA GCACCAGTTC AGCCATCAGA TGGGGCAGGA TGCCTCCCAG CTACTCCTCT CCCCGAGAAG GAATTGCCCC 000880
000881 CGGAGCGGCC CTCATTTATT CCAGGAGAAG CCAGGCCCTG CTGCTTCAGT TTCCATCCTC ACTGACGCAG ATAATCCAAA 000960
000961 GAGTCACTTC AGCTGCTTGG AACAGCAGGG TTGGGTCTTG TGCTACCTGA TTCTTAATTT GGCTGATGTC TACCCCATTT 001040
001041 CCTCCAGGCT TCTCTAGAGG TGGCAGGGGT GGGGTGGGGA GAAAGCTGCC AGACACCAAA GGAGCTGAGC CTGGCGGGGG 001120
001121 CGGGCGTGCG GGTGGGCATG AGAAATTGCT CAGGAAACCC ATCCCTGCCC TGGTCACCCT AGTCCTGGAG AGATCACCCA 001200
001201 GACATGTTGG AATGAGTCTG GGGTCCTGTC TTTTCCAATA ATGTGTTATC CTGGGGAAAC CCAGAGGGAA GTTGAGGGAG 001280
001281 CCCCACTGTG CTGTGTGATG CTACATACGC ATGCGAATAC ACCACCGGGT GGCCTTGACC CAGCCTTCTG CAAAATGGCT 001360
001361 TCTGCCATCA AACTTAAGGT GGAGAATGGT CTTGCAGCTG CCAGTTTTTC CTTCAGAGAA ACTAAGGTCA GAGTGTTGGC 001440
001441 TGCAGAAAAG GACTTGGCCT TTGTCACATA ACAGGAAAGA GAGTCAACTC CGGGCACACC CCCTCGGAGG CTAATCTCAG 001520
001521 GAGGTTGGAA CCACAGGGAT TGCAATAATC GCTGAATACT TACCATATGC CAGGTGCCAA AGCATTATCC TCTCCACTTT 001600
001601 ATGAGTGAGG AAACTGATTC CCAGAGGTGA TTCCCAGAAG ATTCCAACCA GGCCATCTGT CTGTAGTCCT TCCTTCCCAC 001680
001681 CTCCTGGGAT TTCTCAAAAT GTTTGCCATC GCCCACCTGT ATCAGAATCT CCTGGGCTAT TTATTGTTTA ACTTGCAGAC 001760
001761 TCCTGGGCGC CACCCCAACC CCCTAAATCA GAATCCCTGG GGTTGGGCCC AAGAGTCTCC ATTTTTTGAC AAGCTCCTCC 001840
001841 AGGAGATTCC TCCAGACATG GAAATCCTCA CTTGACTCTT CCCCCATAAC AAACTGCCCT GATGTCTGCA GAGAGAGGGT 001920
001921 CTGCTGTAGA CAGGGATTAA GAAAAAGCAG GACATTTTCA GATAGGATGC CCAAACCCAG ACGCAGATTA GGACCCATGA 002000
002001 AGTAGGGGGT AGAGCTGACC CTCAGGAATG TGCCCAGACA GAGCTAAGCT TTCAGGAAAA TTAGAAAGAC CCTCAGACCC 002080
002081 AGAGTCTGTC TGAGTTTAGC TTCGCAGCAT GAAGGCTGCC TTGAGAAAGC TGATATAACA AGAAGCAGCA GAAATTTAAT 002160
002161 ACTTTGGAGG GGCCCAGCAA CCCACTCAGG GCCACCCTGG GAGACCCCAC AGACCCCAAG GCTATGGTCC AGACCTGTTC 002240
002241 ACTCAACACC TGGCCAGGCG GGGCCTCCAG CTGGACCTAG AGCTACAGAT GCCCCACCAC CACTACCAAG CCCGGCTACC 002320
002321 CAGACCAAGA CACGGCTTTT CCTGGAGGAG AGAGAGAGCA AAGGCCCTGT CTCTACCCAA GACCTAACGG CCCCTGTTGA 002400
002401 AGGAGAGGGA GCAGGAGAGG GAGGAGGGAG GAGGTCACAG CCAGGACCAC ATACACTCTT GGGGGCCCTG CTGAGACTGC 002480
002481 AAGAGTCATG AATTCTAACG TTCCACAGGT GAAAGATTCC AAGATTCTAG GATTGCAAAA CCCCATCATT CTAAGATTCC 002560
002561 AATGGAATGA TTCCATGTTT CTAAGATTCC ATCACACTAT GATTCTATGC TGCTAATTCT TGAAATTTCG AGTCTCATTG 002640
002641 TTCGTTGCTG TTCCCCAGAC CTGTGGGCCC TAGCATTTTT TTAAAAGCAC AAAAAAAAAA GAGCAAGAAA GAGAGAAAGA 002720
002721 GATGGGGAGG GAGACAGCTG AAAACAGATA CTAAGAAAAG CTCCATCCCT TGGATCTGAG TTACAGATGC ACCTTGGCAA 002800
002801 GGGCCAAAGA CCCTCCTTTT GAGTGGGGTG CAGAACGTCT TTCCCTGGTA GAGTGGCAGA ATTGCATGCA TCAGGCCTTC 002880
002881 CTGGGGGTAA AAGGGGCTGG CTGTTCCAGG CTACAGCTGA GTAAAGCCCC ACACAGGCCA CAGTGCCCAC TGGCTGGTGG 002960
002961 ACCTAGGAAC CAAGCAGGGC CCCACTGGCT CAGTCTGGGG AGGGACTCAT CTGGGGTGGA ATTTCCCTCC CTGCAGCAAG 003040
003041 GAAGCTGCAG GGCCAGGAAT TTGCCTTGGG AGACCCCCAC TGAGGAATAT TTCCGAAGCA GAACCCCTTC CTATTCAGAG 003120
003121 CCAGAGTCTT AACACTGGAC AACCACAGGG TGTTGCTGCA AACTCCAGAG CCAGGTGCCT TCCCTCTGAC ATTTGGGACA 003200
003201 TAGCTTCCAT GGCCACACAG CCTTCGCCCC CTTCCAAGAC CCCCCTTGAC CTTTCTAATC TTAGTCACTG CCTTCCAGAG 003280
003281 CTGGGAGGCC ACACGGCAGA GGTGCCTGTG AATCACTCCG TCATCAGCCT GGCCGCTTTC CCCTCTGTGG AGAGGGACTC 003360
003361 TGATGGGCAG GGGGCACCAA GTTGAGCCTC TGAGGCTGGC CCAGCATAGG CCAGGCAGGG ACATTCACTC ACAACCAGCC 003440
003441 TTTTTGGACT CTAGAGAAGA GTAATTACCA CTTGGTCATG CCCAGCCCCC AAATCATAAT CATAGCTGCT ATTTCTTGAG 003520
003521 TGCCAGCAAT GTGCCAGGCA CCGTGCCCAA CTTTTTATAC ACAAT
[back to top]

Predicted Small Protein

Name NONHSAT001496_smProtein_1025:1240
Length 72
Molecular weight 7335.4633
Aromaticity 0.0422535211268
Instability index 60.2901408451
Isoelectric point 11.3118286133
Runs 9
Runs residual 0.0217939214233
Runs probability 0.0300300300301
Amino acid sequence MSTPFPPGFSRGGRGGVGRKLPDTKGAEPGGGGRAGGHEKLLRKPIPALVTLVLERSPRH
VGMSLGSCLFQ
Secondary structure LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHLLLLLLEEEEEEEELLLE
EEEELLEEEEL
PRMN -
PiMo -