NONHSAT004288

From LncRNAWiki
Revision as of 19:31, 16 October 2014 by 124.16.129.48 (talk)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT004288

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3344 nt

Genomic location

chr1+:87597609..87602352

Exon number

4

Exons

87597609..87597942,87598347..87598546,87599297..87599401,87599648..87602352

Genome context

Sequence
000001 GGAACATCCT CGCGGCCCGA GGCGCGGTCG CAGCCGGGGA GCACTCGCCA CGGTGCTGTG GAATTCTCTG GTTTTTCACG 000080
000081 CAAGGTCAGG CGTCCTGCTG GCGCCCTCTC GCCACCCTGC CCTCCCGTCA GAAGCCCGGC TCCTCGCCGG GGAAGGCCGG 000160
000161 ATGCTGGCCC GCCGGGACCT GGGACTTGTG CCACATGGAG TGTCGGGAGT CTCCATTGCC GCGAGTTCTA CACCACAGGG 000240
000241 CCAGGCTGTT TGCTCCCCAT CGGTCGCTGC CCCCAGCACC CTGTTGTTAT TAAGGACTCA TTTGCTTGGA GCGGCATCAT 000320
000321 TACAAGGGTG TGGGGTCGCG GCTCCTTTGT CCAGGACTAC GCAGGGGCTT GACCCCAGGG CGCTGGTTTA GGCCGGATCT 000400
000401 GGGGTCCCTT GTCACTCCCA GGCTTCTTCC ACTTCCGAAT TCTGGAGAAC CGGGAATCAA GCCCTGCGCG TTCCTCTTCT 000480
000481 TCCTCCTTCG TGCCGAAAGC ACGCTTCATG TCTGCCAGGG CATCAGTTCT GAAACACCAG ATACTGCTAC ACCATCTCAT 000560
000561 CGGCATGGAC CTATATGTGG CAGCAAGTCC ATCTCATCGC TTTTGGTGAA AGTCAGTCCA GTTTGTAAAG ATTCTCATTG 000640
000641 TCACTGTAGA CGAGGAACTG GGACACCAAA AGGAGAAACT CTGGCCACGC TTGCACCCTG TTCCCAATCC TGGTCCAGTG 000720
000721 TCACCCACAG ATGGTAAGGA GCTCTAGAGA CCTCACCAGC CCCTGGGATT GGTCACCTCA CTCTTCTATG GACAGAGATT 000800
000801 CCTGCTGGGA TCCTTTGAGG GCAAGCAGAC CCTTCTTCCA GCTCGGACTG TGAACTCCAC TGCAGCCGTA AGGACTGTCT 000880
000881 GTGACAGTGA GCCCGAGATG ACTGGGCTCT GTGCTCCCTC CCGGCCCTCC AATCCTTGGC CTGCCACAGA GAACTGAGCT 000960
000961 CTTTTATTAG CACCATGAAT GTGACTGATA CAGCTAGCCA TTCCCTTGTG CGAATGACTC AGTTTATTAA TGCTCTGCTA 001040
001041 AAGATGGCTT CTTTGCTTGC CAGCAGCCTT AAACAGTATT TCATTAAAAC TGGCTTAATT ATTTTGAGAA GACGGCCCAA 001120
001121 TTAAAAGCTA TACACTCCCT CTATGTGAGT GTTTATACAT AGAGCTGTAT ATATAATACA TATTTGTAAG TGTGTATATA 001200
001201 TATATGTGTG TATGTATGTG TCTATAAATA TATAGGCTTA GCAATTTCAT TACATGGGAT AAATTGTTGG AAAAAATACC 001280
001281 CAGGAGCTGG TCCCCTTTCT GTTGCTAGAT TCAGAGTAGA GGCCACCCCT CCACTCTGGG AGAGGCTGGT GTTGGTGATC 001360
001361 TCTCAATGAC TCTGCAATGG AAGTCCCAAC TGCACAGAGC CCTGCCCCAG TTTCAGGAGC CAGCAGCCTC GGAGAGGCGG 001440
001441 ATCCTGACCT CTGCTCTGCT CTTGGGATAG CCTTTCCCTT CCCAGCAGGG TTGAGATACT TGGGCCGGGA AATGTTGTGG 001520
001521 CAAAGTGTTT GCCAAAGCTC AGGAGAGACA CAGACTTGGG GCTTTTGTTT CTTGAGCTGG CTGTCTAGCT TTCCTAATGA 001600
001601 GCAAATATGT TCTCTTTAAG GAAACAAACA AACAAAGCAA AAACACCAAT TCATCTGGAT TTTATTCATT TGTTTTAAAT 001680
001681 ACAAACAAAC AAAAGGAGAG TGGTTATTTC TGCACCAACT ATTTCAAATG CAAGTTACTC CATCGCTCGG GGTGGTTGGA 001760
001761 TGGTGCTTGT CACCATAGGA CCCACAGGGC TAGTTCCAAC TGTTATTCGG TAAGGCTTTT TTCTTTCCAA AATTCCCAGT 001840
001841 GTTCCTTTAA GGCCCATTTA GCTGCGGGTT TTGTTTATTC TCCCGGCAAT CAGCATTTAA AATAAGACAA ACAAGCATTT 001920
001921 TTTCCTGGGC TGTGAATCCC CCCGGCCAGC CTCCACCTGC ACACCTGAAG CCAGCATGTC CAATCAAATT TCTCTGTAAC 002000
002001 CCATATCCCC TTTAGAGACT TGCCCCCGTC GTATACCAGG CTGGAAATAG AGAACTTAAG CAGGGCAAAT GTAATTTTAA 002080
002081 GAATTGCTAA TGATGCTAGA AATCTGCAAT GCAATTAGCG TCATTGGATT TGGCGCTCCT CCGAAGGCAC AAAACTCCTT 002160
002161 GTCATAGCGC AGTGGCAGCA GCGGCAAGTG CCTCCGCATG TGCCGGGCTG TCCGGGTATG CTGGCAGCCG CTTTGCACTG 002240
002241 AGATGTGAGC AGTTGGTTAG GCTTCCTCTC TTTCTTTCTC ACAGATACTG ACTTCTTTGT CTCTTTTCTG GGTTGCAGAG 002320
002321 GGATGGGTAT TTTCCATTGA TTATTACTTT AGCATTTGAC CCTCCAGTGG AGTCACCCTG TTTTTTTTTT AGAAAACTGA 002400
002401 GACTCTCACT TTGTGAATTC ACTGTGCTCT CTGGGATTTC AGTGCTGTAG TTCAACCACC AATCCCCCTG TCCTGAACTC 002480
002481 CAGTACTTCT GATGCTATTA ATTGGTTCCT CAACAATTGT GGCCTTTTCC ATCATTGCCC ACCATAGTAT ATACTTTTTC 002560
002561 TTTCTCTCTC TTTTTTCTAA TTTCCTTGTC TTCTTCACTC TCCATGGAGC CAGAGGTAGT ATGAAGAGTT AAAAATAGGA 002640
002641 ATATAAAGAA AGCCAGAGGG ACAGAGGGAG TGAGAAAGAA AAATTTTAAA AAGGGAGGAA ATGAATTATT GGATTAAAAA 002720
002721 TAAACTTTTA CTTTTTTGCA GAAAAATTAT TTTTGCTCTC TGGGAAAATA ACATGGGCCA GGCATAAAAA GCATGTCAGC 002800
002801 TGGCTAAAAG ATTGCAAAAT CCAGAAGATG ATCTCGATGT GTCTGTTCAA TTTAGCAAGG GTATCTACTA GGGGATCCTC 002880
002881 TTTTAAATAT GGAGGCCCAA ATCAGAAGCT TGTAGAGGGG AGCTATTCTT CCAAGATTCC AGATGTGTCT GTGAGACAAC 002960
002961 ACGTTATGGG GCAAATTGAT TTCACCCTTG GGAAACCAGG GAGATTTTCA AAGTTATGTC TGCAAAGCCA GCTAATGCAA 003040
003041 TTCCCCATTA GTGCATTAAA GTGCGCCCTT ATTAATTCAA ACATAAAGGC AACAAAATAA GCTTTTAAAT TTAAAATATA 003120
003121 ATACATATAT AATGAGCATG TGTGAAAGCC TTATTCAAAT GAAAATACAG GAGTGTTTGA ACTACTGAGG TATCTTTTGT 003200
003201 ATTGAATTAT GAGCATATGT AATAGATTTA ATTATTAATT TCCCCATTGT TCTATGCACA CAGACAGGGT TCAAGGCACA 003280
003281 GTCATTCTCT GGCTTTCATA GATCTAATTT GTATAATTAT TGCCTGAATA AAAAATTGCT CCAA
[back to top]

Predicted Small Protein

Name NONHSAT004288_smProtein_1376:1618
Length 81
Molecular weight 8560.5923
Aromaticity 0.1125
Instability index 58.03875
Isoelectric point 4.00238037109
Runs 11
Runs residual 0.00131578947368
Runs probability 0.0473896356249
Amino acid sequence MEVPTAQSPAPVSGASSLGEADPDLCSALGIAFPFPAGLRYLGREMLWQSVCQSSGETQT
WGFCFLSWLSSFPNEQICSL
Secondary structure LLLLLLLLLLLLLLLLLLLLLLHHHHHHLLLLLLLLLHHHHHHHHHHHHHHHHLLLLLLE
ELLHHHHHHLLLLLLLEELL
PRMN -
PiMo -