NONHSAT100204

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT100204

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3281 nt

Genomic location

chr5+:6583415..6588868

Exon number

3

Exons

6583415..6583914,6585888..6586140,6586341..6588868

Genome context

Sequence
000001 TAAAGATGGG GCGGCGCGGC CGGGGCTCGC CAGCACCAGC CCTGGAACCC CATGGGGGAC CTCGACAACA GCGCCTTTGC 000080
000081 GGCGTGGGCC CCTTTCTGCA GCTCCTCCAC CCGCGCGCTG TCGGCTCCGG GACACTGGGG CTTTGAAGGG CGCCAGTTCC 000160
000161 CAGCCGCACC TGCACGCCTA GGAGGGCGCA GGCCGCGTGT CCCTGTGGGA GGAAGGCCCC GGCAACCTGC AAAGACCTCA 000240
000241 CGTCACAGAA ACTTGCGGCG TTTGCCCCCG ATGCGCGCGT CAGGTGCTCC GTCCCTTCCA GGTGGCAGAG GAGAGGGAAG 000320
000321 GGTCGCGCCT GGTGGGGGCC TGAGAGCCAC CGGGCTGCGG TGCGCGAGTG TGGACACCGC CCCGCCTTCT GCGTCTACAG 000400
000401 CCCTGGGTTC TGGGACCTTT GGAAACCGCT TTGGACGAAG CTGAGCATCT CTTTCCCCAC GCTGACCTCC CCCATAGACT 000480
000481 CCGGATGGGA GTAAATGAGG TCCTGGGAAA TCCAGAAACT CTAGAGAAGC TTGTTTGCCA AACCTTGCTT TGGGCGGTGG 000560
000561 AGACAGAATA TCATGTCCCA GATCAGCGTG TGGTGACTGT GGAGGCATTT ACACACAGCA CACTCCAGCT CTGCCACTGG 000640
000641 GCCGGCTGAG GTGGCTGCGG TACACGGCGC CCATCCTGAG CCCCAGGTTC CTCACATGTC AACAGCATGA AAAGTGCCTC 000720
000721 TAAGACATCC ACAGGTGTGC AGTGTTAAAG CAGGCAGCGC CTCAGCTTGG TTAGCTCCCT TAAGGACTGC GGCTCCCTTA 000800
000801 GGACAAGGCT CTGTCCCTCA GAGCACTGAG TGCGATCTCA TGCTGGCATC TTAGAGACAT TGTATTCACC AGGTAAGATG 000880
000881 GATTCACATC TGCTGGGTGC ACGAACACTT CCTCCTTCCT TGCCACCAGC CAGCATCACA ACCAGCCTGG CCAGGCTCCC 000960
000961 GTCCCAGCCC AGCCTAGCAG CTAACCCCAG CTCCATCCCC CAGCAAGCCC CTCTGCCTGC ACCTCAGCTT CTTCATGGAA 001040
001041 GACGAGTCAG GTCATGATGC CACCAGGGAG CGTGTTTCCA TGTTGAACTG AGACACTAAG CCCTTGACAA ATGTTGGCCA 001120
001121 AGAAAAAAAA GGGGCTCCAA GAGGGGAGCT GTGGCTCAGG TAGAAGACGG TGAATGAGCA TTGCCTGAAT CAGAGTTAGC 001200
001201 TACTCTGGTT ACACAGGTGT CCCCGGCAGG TTGGTCAAAA GCCCTTCCCT GAGTCCCGTG ATAAGTGTGG GGAGGGAGGC 001280
001281 AAATCAAAGA CCCCTACACA CCTCCTGCAT GGATGGCCAG AGCGCTGGTC CCACCTCCTT CCTTCCAAAC TGCAACCCCG 001360
001361 ATCAGCACAG CAGAGAAGTC TCCTGTGGCA GGGACCAGTA ATTCAGTGCT TCCCCTGGAG AGTCTGGGAT TGCACAGAAC 001440
001441 ACCAGAGGCT GGGGATTTAT TTGTTCACCA Acattcattc aacaaacact tactgaggcc aacactggga atggagacga 001520
001521 acttccctgt cctcatggag cttgctttca aaagggaagg aggcagacag taaacaaaac aagcaagtca atatacagca 001600
001601 tgatgcagaa aaggtgtgat gCCACCCTCT TGCCTGAGTT TATAACAAGC AGATGATGTT GTCCTAAGAA TTAATTTCAG 001680
001681 CCGGTGAAAG AGTACAGGCA AAAAACGTGG TCTCGGACTT TCCGGTGATG CTTAAGAACA TAGTTGCAGT TTTTGTTCTG 001760
001761 ATTCTTTGTG CATCTCGCGC TAATGAAGGA TTCTGCCATG ATTGCCACCT CTGGAGACGT CACAGCCAGC ACAGCCATGA 001840
001841 CCAGGGATGA ACTTGGCCAT TGTCCAGCAG CACCATGGGA GGACCAGCTG CTGTAAGTGA AGGTCATCAT CCATCTGATT 001920
001921 TGCTGGCTTT TTGCAACTAC TGCGGTGGAA GTCTTAGAGT AGCCACCTTC TCTGTACTCA GACCTTCACT TCGCCAGATG 002000
002001 TCAGCACAGC AGTGTCAGCA GCTTCGTCAA ACACAAACAC CACCACCTCT CTGGATCAGA AGCACATGGA AAGCCCACCT 002080
002081 TGCAAGAAGA CCCAGCTATT CCAAAAGTAC CCATATCCCT GTAGCATCCT CTTAGCTGAA TGTCTCTACT CCTTAAACTT 002160
002161 GTACCACTTT GAAGAGTAAA AATTTAAATT CATTCATATT ATCATTTTTT AGTTTTTAGT ATAAGTGGAG Ctgtttatcg 002240
002241 gccatttgta tgtccttttc ttttttttgc aaattactcc tttgccaaat tttctctcaa atgactcatc tttttcttat 002320
002321 tgatttgcag aagctattta catattatgg atgtggggtg tctaacatac acagcaaata ctttgtccca gtgtgtcact 002400
002401 ttttaaaaaa attcttataa tgtctttCAT CTTTAAAAAA TTATTAAGCT ATTGTGAGCT TTTATGCCTT TTCAGTTTAT 002480
002481 TTTCTTACAT AGATAATTAT ATGAAGTGCA AATAGGGGCA GTGTTAGTCT CATCTTCCCA AGGCTGCACC TCAGATGTGG 002560
002561 AATGTTGACA GCAACATCTG TTACCTTGTT TCTGTTTTTA GGATTTTAGC GTTAAggcca ggcataatgg ttcatgcctg 002640
002641 taattccagc actttgggag gccgaggtgg gcagatcact tgaagccagg agttcaagac cagcctggcc aacatggaga 002720
002721 aaccctctct atttttaaat acatttaaaa tGTTAACAAT TAAAACAAaa tattttaccg ttagtatgat gcttgataca 002800
002801 ggatttctgg tacatacgcc ttatcaaatt aaataagttt ccttttactc ttgtttattg ttatttttta aaataaggaa 002880
002881 tgaatcttgg attttaGTGT ATCAATAATT ATATTTTATT CTTTAACTTC ATAATATTCC TTATAATTCC TTATGTCATA 002960
002961 TTAAAGATGT AAATTCATAT ATATTTTATC ATACAAATGA TTAATAAATG TATAAGCATG TCATTATATG AGAGTATACT 003040
003041 TCCTTATGTT TCATGAATGC ATCGTGACAG GCTCattttt ttaatcactg ctcgatttgt gccactgatg ttttatgagg 003120
003121 cctatttatg tcggcgatgg taagagtgtt agtgtttcct ctgtgggtac cttcatctac cgtgagtctc ctggatatgc 003200
003201 tagcatcaca gaatggcagg cggaaatttc cagctttttc tatagaatga gtttaacaca gaaaatagat ctttcttgaa 003280
003281 a
[back to top]

Predicted Small Protein

Name NONHSAT100204_smProtein_1874:2134
Length 87
Molecular weight 9445.7083
Aromaticity 0.0697674418605
Instability index 75.4465116279
Isoelectric point 10.4332885742
Runs 11
Runs residual 0.0117146824019
Runs probability 0.032567326685
Amino acid sequence MGGPAAVSEGHHPSDLLAFCNYCGGSLRVATFSVLRPSLRQMSAQQCQQLRQTQTPPPLW
IRSTWKAHLARRPSYSKSTHIPVASS
Secondary structure LLLLLLLLLLLLHHHHHHHHHHLLLLEEEEEEEELLHHHHHHHHHHHHHHHHHLLLLLLE
EELHHHHHHHHLLLLLLLLLLLLLLL
PRMN -
PiMo -