NONHSAT125838

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT125838

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2838 nt

Genomic location

chr8+:29779029..29811123

Exon number

3

Exons

29779029..29780165,29780553..29780837,29809706..29811123

Genome context

Sequence
000001 CAGAGGACAC CATTGCCTGG AGATGACAGA AGGCTGAGGA TGGAGAGAGG CACTGATAGA GGCCAGACCA GACAAGCAGT 000080
000081 GCCAAGTCCA TAGTGGGACA CCAAAGGGAG GAGGCAATCC CAGATGAGGT CCATCCCAGA ACCAGACCTT GCAGGGACCT 000160
000161 CAAGGAGCTA TGAACATAGA AACCCTATAC ATGATATCAC TTGAATCCCT CCACAAGGGC TTCTGATATT GTCTTAGACC 000240
000241 TTTGGAAAGT TCAGGGGTAG TAGCCTCTGG GCCCATAGTG CTCCCCTGAT GATGCAGCCC CACCCTCAAC TCTTGCCACT 000320
000321 GCAGATGCTC CTTGGGCAGT TTCATCTGGG CCTTTCACAT CAGCTCCCTC CAAATGTGAC TGGACCTCAA ATGGATCTTC 000400
000401 AGACCCCCAT GCCCAACCAC CTCCCAGCTA CCTTCCCTGG ACAACCCACA GGTCACATGA TCACCAGGAA GCCCATGTCT 000480
000481 CTGCATGATA ACCTGGAGAA ACCTACAGAT GCCAGGCTTC TGAATCTCAT TCACCATGCT TCCCAGGGAT CAAGGAAGAA 000560
000561 ATACCCAGAG ATACAGACTG AAAAATCAGG AAACTGGATG GGATTCAGAG CCCTTGGAAT GGAGCCTTCT CATTCATTCA 000640
000641 CCCAACTCCA GTCTAAAACG TGCAGCTCTG AAGCAGGGCT GCTTCATAGA GACTTGAGTA AATAGCACAG TAAAGCCCAG 000720
000721 GTTTTACCAG TGAGGAGGAG AGTAAATACA CTGATCATTT GCAAGAGCAC CTCTTCAGAT ATGGCACCTC TCCTTGGAGA 000800
000801 GAGGAGACTA TGGAAGACAG GGAGACCATT CTAGCTGAAG CTCACACCAC TTACACAACA AAAGATCTAA GTCGAAAAGG 000880
000881 GAGAGAGTAA AGACCTGGCT CACATGGGTG TGAAGTGTGC ATGTGGAAAC ATTGCATCTT TAACATTTCA CAACATCTTG 000960
000961 TAGCAGGCTT ACTATGGCAG GAAGGACAAG TGTGGCTGGC CCAGCCAGTG CTGTCCTGCT CAGACCTTCC TAGGCTTTGC 001040
001041 AATGCTGGGA TAATAGGGAA GCCACACTCA ATGAGCATCT CTCAGCCTCA CATCTCCTGG TCAACCCAGA ATGCCACAAC 001120
001121 CACAGGCCGA ACCACAGATC TGGAGGCTGG GAAATCCAAG AACATGGCAC TGGCATCAGG CAAGGGTCAG CCCATGCTGG 001200
001201 AGGGCAGAAG GCAGAAGTGA GCATGTGAGA CAGAGATAAG AAATTAGGCT GAACTTGCTT GTATAGCAAA CCCAGCCTCT 001280
001281 CAATAACTAA CCCGCTCCAG CAATAATGGC ATTAATCCAT TCATGAGGGC AGAGCCCCCA TGACCTATTC ACATCCTAAA 001360
001361 GGCCCCACAT CAATCATATC TCAACATGAG GTTCAGAGGA CACAAACATT CAAACCATAG CAGTCGGTAG AGAGCATCAA 001440
001441 CATATATTGT GGAGCAGGAA AGCAAACGTC CAGAGAGAAT GTCTATGATG GTACCCAATG AGTCTAGGAC CAGGATTCTT 001520
001521 ATGCTATGGT CACTCATATT TTTGAGTTTC ATCAAATGAA TGTCCTTCAC ATCACCAGCA AGCAGCTTCC ACCTTCTGGA 001600
001601 TCTTCAGAAT GGTCTTAAAG ATGAGGGATG GAGGCCAGGC ATGCTGGCTT ATGACTGTAA TCCCAGCAGG AAGCTAGGAG 001680
001681 CCCAAGGCAG GAAGATTGCT TGAGGCTAAG AGTTCAAGAT CAGTTTGGGC AACACAGCAA AAACCTATCT CTACAAAAAC 001760
001761 TAAAAATAAA AATAAATCAC CTGAGCATGG TGGTGCATGC TTATAGGCTG AGGTGGCTAA GGTGGAAGGA TTGCTTAAGC 001840
001841 CCAGGAAGTT GAGACCGCAG AGAGCCATGA TAGCACCATT GCACTCTGAG CAACAGGGCA AGACCCCAAC TCTAAAAAAA 001920
001921 TAAATAAACA AATAGAGATG GAAGGAAGAA TTAATGGATA AAGTTTTGAG ACTATCCCAC TTATAAAAGA ATTATTTTTA 002000
002001 TTTCTTGAAT ATAAGAAAGA CAGGGTCTTG CCATATTGCC CAGGCTGGTC TCGAATGCCT GGGCTCTATC AATCCAACCA 002080
002081 TGCCCAGTTA ATGTATTTAT TTATATTTAT TTTTTGTAGA GATAGGTTCT TGCTATGATG CCCAAGCTGA TCTTGAACTC 002160
002161 CTGGCCTCAG TTGAGCCACT GTGCCCAGCC TTAGGAGAAT TATTAAATGT ATAAGTAAAT GTGATTCAAT GCTCATACCT 002240
002241 CTCTAAACTT ATTTCATAAA CACATTCAGA ATCATAAAAT GATGGTTCAC TTTTTTAACC CACATTTGTA AAATATAGTA 002320
002321 AACATAGCTA TAACTTAATT ACTATCTTAG CTTGGGTGTC CCTGAAAGCA GAGCTTGAGG CAAAGACTTA CGTGTGATTG 002400
002401 TTTTCCTGAG ATACATGATT CCAAGGAGAG GATGTGAATG ACAGTGTCAG CAAGTGAGCA AGGAGGAAGG GTCCCTAGAA 002480
002481 ATGGGCATTG GCAAGTTGGC CTCCACTGTG GGTAATTGGC TTCTCAGTCC TGAAGGACTA TTTGACAAGC CATATGAAAT 002560
002561 ATGTTTTAGA ACTGTGAATA GAGGAAGAAA CATTTATTCA CCAGTTTCCA AAACCCATTG ATCAAAGGTC ATGCCCCAAG 002640
002641 TGTTAACACC CCCAGTGCCT TCCAGCTTAT GCATATGTGA GCCCTGGTGG GTTTCCAGGC ACCTGGCACC CAGACTGCAA 002720
002721 TGGCAGAGAA GCCCAGGGTG GGATGTTAGA GGGCATAGTA AGGGTCAGAG GCATCTGCAG GAAGGAACTG ATGATCCGCA 002800
002801 GCATCTTGTC ATATTCTGAG AACTTATTAA AAGCAAGC
[back to top]

Predicted Small Protein

Name NONHSAT125838_smProtein_920:1219
Length 100
Molecular weight 10867.43
Aromaticity 0.0505050505051
Instability index 45.7333333333
Isoelectric point 9.57000732422
Runs 16
Runs residual 0.027926322044
Runs probability 0.031503930401
Amino acid sequence MWKHCIFNISQHLVAGLLWQEGQVWLAQPVLSCSDLPRLCNAGIIGKPHSMSISQPHISW
STQNATTTGRTTDLEAGKSKNMALASGKGQPMLEGRRQK
Secondary structure LLLEEEEEHHHHHHHHHHHHHLLEEEELLLLLLLLLLHHHLLLLEELLLLLEELLLLEEE
LLLLLLLLLLLLLLLLLLLHHHHHHLLLLLLLLLLLLLL
PRMN -
PiMo -