NONHSAT078335

From LncRNAWiki
Revision as of 09:49, 16 October 2014 by 192.168.72.52 (talk)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT078335

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3799 nt

Genomic location

chr20+:5451842..5457780

Exon number

2

Exons

5451842..5452139,5454280..5457780

Genome context

Sequence
000001 ATCTCGCGGC AGTTGATTCC TAGAGGTGGA ATCCATTAAA CTGACAAAGC CCCAGTCCCG GGGTCCTAAT AGTCGGGACT 000080
000081 ATTAGGTCAT CCTGGGTACT CAGGCCTCTA GACTCTAGAC TGAGCTGCCT TGGTCACTCG GGGACAGTTG GCAGAGTATT 000160
000161 CGTGGTCAGG GAGGTGACCC GTGGTCAGCA GGATCAGGCC ACCCAGGGAC AAAGGGTGCT TCTGCGCCAG GCCCTGGAGA 000240
000241 AGGACAGAGC GGTGGGGACT CGGGGTCGGC CGCAGATAGG GGAGTCACCA CCTGCCGGCA ATCAGCCATG ACTGCCTTTG 000320
000321 CACTGTCCAT GCTCTCAGCC CACCACCTCC TCCCCCTGCC ATTGCAGTGG CTAACACTGG AGACGAAGAC CAAGACACCA 000400
000401 CCGCCTTTCT CCAGTACCAC TCAAATCAGC ACAGGCAAGG ACAAAGGCCT CAATCCACAA CTGCTGAAGA TGGACCCTGG 000480
000481 CCACATGGGA TGGTCAGACA CGCCTGCCCA GCTATCTGCA GGCGAAGAGG CTCAGAAGAG GTTTAGGGGC CTGAAGGACA 000560
000561 TCTTGCTTCC ATGTCCATAT GAGCAGGCTA TTTCTGCTCC ATGAGTCAAT TTTGCCATAT AAAATACTTA ATTTCAGCCA 000640
000641 TTCCAGGGTG CTGTAGGATG CACAGCTTCC CATCAGCCCA CCTGAACTCC AGCCATGCCA TTTTAATACC AGGAATAAGG 000720
000721 TCACCTGCTT TCCTGCCCCT TAGGAGGCCA GAGCCGTGGA AGCAAAATGG CACTTCTGTT TACCTGTTAT ATTATTTTTT 000800
000801 TGTCATCCTT ATATGCTTGG AAAATGCAAT TATATGAAAA AAGTTTAGTA ATTACAGACA TAACAGCAGA AAGTCCTCGG 000880
000881 AACCAAGCTT ACTCTCATGG CCGATTCTGC TCCACCTGGG ACTCTGCTGT GCTGCGGGCA TCCTGTGGTC AGAATCGCAG 000960
000961 AGGGGCCATC AGGGAGGGCC TTCCCAGAGG ATGGACCTCA CGTGACTGCT GCGTGGGCAA GTGGCACTGG CCACTCTGCC 001040
001041 TGGAGAGAGG AGTAAATGCA GGGCTGGCCA GGCGACCTGC ACACTCTGCT ACTGGCCTTG TCCATCTTTA GCCTCTAATT 001120
001121 TGAAAATGAG GCTCACACAG ACCAAGAGTA TCTTTGAGGG TTAGTACAGA CCACAGAAAT GCCCTGGGCC TTTCACTCTC 001200
001201 TCCTTTTGCA AATTCCCATG TGTGGAAATG CCGTTTGGAT AATGAGGGAG CCTGAAGGAG GTGGACACAT GAGCAGCCCC 001280
001281 GACAGGCCTG GCTCCATCCT CTGAAAATGG GGCCCCGTGC CCGGCGTGTG GCCTTACTGG TTCAGTCTTC TTTACAGTGG 001360
001361 TAGGTTTTGA GTGCCCAGAT GCCCAGTGCC TCCTACCTGG AACAGCACAG GATCTGGCAG ACCCCTGGAA GAATCACATG 001440
001441 CACACTTAAA TATTCAGGGA GTTCCCACCC AGCAGAGCTC GCCTCTGTGG CTACCTTGGT CTTGCTGCTG ATATCTGCCA 001520
001521 GAAAAGGCCT GGACTTGGAG ACAAGCCTGG GATTTACACT CAGTCCTTCC CCATCTGGCT GGTTCCATTT CCTTGGCTCT 001600
001601 CACTGCTGGA AGTCTGTGCT CCTGAACATC AAGTCAGAGG GGGCATCTGA ATGCAGGGCA GGGAGCCTCA GATGGGAAGA 001680
001681 AGTCAGAGGA ACCAGAATGT GTCAGAAAAT GCCAAGTCAT GTGCCTGAGC TCAAAAGTCA GCTGGGCCAC AGGCTGGCTG 001760
001761 TGTGATCTTG GGCAAGTTCA ACCAGCTTCT TTATACCTCT TTTCTTGCCT CAAAATGATA AGAGAAACCA CTTCACTAAT 001840
001841 ACACTGAGGG CTGCTATTAA GTTCTATGTA CAAAGACCCA TGGCAGGCCC TATGCCCTTC GGTGAGCACT ACTCCTCCTT 001920
001921 ACAATTTACT GCCAGGAACA CTGGGCAAGA GAACTTCAGT GGAGCAGGGA TTGGCTGAGC ATGAGCCAGG GTTGGGGGAA 002000
002001 GTAAATAATG GGCTGTTGCC AGGGCCTGAG CCCAACAGAG AAAGGCTGTG TGCAGAGGGA GGGCCTCAGG TCCTGGGGCT 002080
002081 CCTCCTGGCC TCTTTCGTCC CGACTACTTC ACACCCTCCT CTAACAACGA CTCCCACCTC CTTTTCCAGC TCTCCTTGAT 002160
002161 CCTGCTCAGG GTGGCGCCTG CTGTCCCTGG TCCCTTGGTC CCCACCTGCC TCAGTGCCCC CCAGTCACAT CTGCTGTTTC 002240
002241 TGCCATGGGT CAGCAGACAG GGTAGGGGTG ACTGGTGGTG CAGAAGAAAC CATCTGAGAG GGGGACCCCA ACACGGACAG 002320
002321 GGCACAGACG GGGCTTCCAC CAATCTCAGT GGATGAAGAT TCTGTCCCTG CCATCCCCGC ATTCTCTCCC TGGTCTCAGA 002400
002401 GGCCCTCCTG GGTCTCCAGT TGTCCTCTCT CCCACCTCCA CACTTTCTTG TTCCAGTCCT GCTCTTGGAT TTCTTTAATA 002480
002481 ATTTTCCTAC CTCCAAGATC CCCTGATGAT CAGTTTCTGC CTGGGGTCAC CAGGCGACTG ACCATGGTGG GGATGGTGAC 002560
002561 TTGAGACTCC TGGACCACAG TGCAGGTGAC ATATGCAACC TACAGAGTGA AAAGGAACAG TGTCACTGCT GGGTCATTTT 002640
002641 GAAGATGAGG CTTAGGTAAT GGATTAAAGA CTTAAATGTT AGACCTAAAA CCATAAAAAC CCTAGAAGAA AACCTAGGCA 002720
002721 ATACCATTCA GGCCATAGGC ATGGGCGAGG ACTTCATGAC TAAAACACCA AAAGCAATGG CAACAAAAGC CAAAATTGAC 002800
002801 AAATGGCATC TAATTAAACT AAAGAGCTTC TGCACAGCAA AAGAAACTAC CATCAGAATG AACAGGCAAC CTACAGAATG 002880
002881 GGAGAAAATT TTTGCAATCT ACCCATCTGA CAAAGGGCTA ATATCCAGAA TCTGCAAAGA ACTTAAACAA ATTTACAAGA 002960
002961 AAAAATCAAA CAACTCCATC AATAAGTGGG CAAAGGATAT GAACAGACAC TTCTCGAAAG AAGACATTTA TGCAACCAAA 003040
003041 AGACACATGA AAGAATGTTC ATCATCACTG GCCATCAGAG AAATGCAAAT CAAAACCACC ATGAGATACT ATCTCACACC 003120
003121 AGTTAGAATG GCAATCATTA AAAAGTCAGG AAACAACAGG TGCTGGAAAG GATATGGAGA AATAGGAACA CTTTTACACT 003200
003201 GTTGGTGGGA CTGTAAACTA GTTCAACCAT TGTGGAAGAC AGTGTGGCGA TTCCTCAAGG ATCTAGAACT AGAAATACCA 003280
003281 TTTGATCCAG CGATCCCATT ACTGGGTATA TACCCAAAGG ATTATAAATC ATGCTGCTAT AAAGACACAT GCACACGTAA 003360
003361 GTTTATTTTG GCACTACTCA CAATAGCAAA GACTTGGAAC CAACCCAAAT GTCCGTCAAT GATAGACTGG ATTAAGAAAA 003440
003441 TGTGGCACAT GTACACCATA GAATACTATG CAGCCATAAA AAGAATGAGT TCATGTCCTT TGTAGGGACA TGGATGAAGC 003520
003521 TGGAAACTAT CATTCTGAGC AAACTATCAC AAGGACAGAA AACCAAACAC CACATGTTCT CACTCATAGG TGGGAATTGA 003600
003601 ACAATGGGAA CACTTGGACA CAGGGTGGGG AACATCACAC ACTGGGGCCT GTCATGGGGT GAGGGGAGGG GGGAGGGATA 003680
003681 GCATTAGGAG ATATACCTAA TGTAAATGAC GAGTTAATGG GTGCAGCACA CCAACATGGC ACATGTATAC ATATGTAACA 003760
003761 AACCTGCACG TTGTGCACAT GTACCCGAGA ACTTAAAGT
[back to top]

Predicted Small Protein

Name NONHSAT078335_smProtein_1865:2158
Length 98
Molecular weight 10207.4134
Aromaticity 0.0515463917526
Instability index 42.8793814433
Isoelectric point 5.14678955078
Runs 13
Runs residual 0.00248186330661
Runs probability 0.0465125759245
Amino acid sequence MYKDPWQALCPSVSTTPPYNLLPGTLGKRTSVEQGLAEHEPGLGEVNNGLLPGPEPNRER
LCAEGGPQVLGLLLASFVPTTSHPPLTTTPTSFSSSP
Secondary structure LLLLLLLLLLLLLLLLLLLLLLLLLLLLEEEEELLLEEELLLLEEELLLLLLLLLLLHHH
HHLLLLLEEEHHHHHEELLLLLLLLLLLLLLLLLLLL
PRMN -
PiMo -