NONHSAT057058

From LncRNAWiki
Revision as of 08:41, 13 October 2014 by 73.162.128.239 (talk)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT057058

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3956 nt

Genomic location

chr18+:5236721..5245193

Exon number

5

Exons

5236721..5237302,5238083..5239048,5240647..5241256,5241698..5241890,5243589..5245193

Genome context

Sequence
000001 TATCAAAATG CATCTTGTTT ATTTGGCTTT TGAAACTGTT ATAACACAAC TTATAACATT GGAAATACAT TATTTTATTA 000080
000081 ATACAAAGTA GTTTTCATAG TAAAATACCA AATTAATTAA AAGGCAGAAT GAAACGTACC ATATACATGT AGACGACAAA 000160
000161 TGAAAACTGT CAAAGTATCC TTCTCTCTAA AAAGCCACAA ATTACGTTCT TTGAAGTAAG TATTGGGTTT TATTTCCTAA 000240
000241 ACTATACACA TCAATATCAC CTCCTGAAGC TTTTTACCCA GTTTCACGGT TGGTATTTCG GACATGCAAT GTTCCATGCT 000320
000321 GTCAAACTTG GAATGCCTCG TTTGGATACA GGGAGACAAA GACTTGGAGA CCCAGAAGCC TGAAAGTGGT TCTTGACCGA 000400
000401 CTTTGTCCAC TCAAATTAAG GCATCATGCA GCTGGTCATC ACTTACATTG CTGGTTGCAC GCAGGCACAC ACACACCAGT 000480
000481 GCTCACAGCC AGCACTTGTT AAGAATGGTT GAGCTGGACC AAATGGTATA CGTGTCTTTC ATCAATATAA ACATGGACTG 000560
000561 CATTCATCTG GACACGGGCC TTGCGGCCGG GAGGGACCGA GCGGGGAGCA AGGCCTGCGG GGAGGCGCAG GATGGACGCG 000640
000641 TTGGCTGTCA TGATGTAGGA CCTCCTGACC TAGAACCATG CCCGGCGCAG GGAGAACAAC GAGCTTACGG ACCGGGTGCG 000720
000721 GGGGCTGCTG TGCGAGAAAG CCTACCTGCT GGCCCGGGAG CGTTCGCCGG CCCACCTGGT GGCTTTCCCT GGACGTTTAA 000800
000801 CGGCGACTCC GCCCGGCCTC CCGACTTTTT GATGCAGGCG TTCTCTTACA TGACTTTCTT CGAGGCTAGA TTCTCGAAAG 000880
000881 ACATCCTGAA GGTGGCTTTC TTAATTAGCT GCCTCACCGG CTGGCGGAGC AGTTCGTAGT CGCCTACATC GAGAGGGAGA 000960
000961 GCCCCGTCCT GGCTCAGTAC TGGGGCTTCG TGGACGCGCT GTAGCGGGTC TTCGCCGTCG TGGGTAGGAA ACAGTCGGGC 001040
001041 CGGATCCCCG GTGTCTGGGG AGCGGCTGCC GGCCGGGCCC GCGCGAGCCG CTTCACTCCT ACAAGCCCGG CTCACTAGTT 001120
001121 CCAAGCTGCT GGACTTTGAC CCTCGCTGGG AAGCTTGATC GCCGCCTGTT CTCGCCAATC TCTATGCCTT TGCACTGCCC 001200
001201 AGCAACCCGG TCACTGCCCA GGACCTGTTA AAAGTTGAAT GGACCTCCCA GAATCACGCC GCTGCCACCG CCGTTTCTAG 001280
001281 AGACGGCTTT TGGCCACCTT TGAGAATCAG GGTCAAACTT GGAAGGGCCT GCATTTGTCA CACTTCCTCG GACACTAATT 001360
001361 TGGATAGTTC AGGCCCAGAG ACGGCTTCCC GGGTTAGTGT GCCTGACAAC ACTGAAGAAA AATTGTGACA TTTTCCTTCC 001440
001441 CCAGAATTAC TTCGTTTTAT ACATTTCACT TCTCTTCTGA AATCACAGCA ATGCCAGTTT GCGCCTTTTG GTCTGAGGAA 001520
001521 CGCAGGCCCT GGCCACTACC AGGAAAAAAA CCTTCTGGAT TTCAGCTTTG GCAAGCTTCC TCTGTTTCTG AGACACCTCC 001600
001601 ATCACCTGGG TTACAATCCC ATCCATGATT TTCCCCATCT CCAGGCATTT TTCATGGAAA GCTCCTCATG CTTTTTAATT 001680
001681 CCAGTTTATC CATTTGTGTG GCTTTAACCT CCACATCCAG GTCTTTCTGG CTAAACTGCA GTATATCCAC AATGGGGCCA 001760
001761 ATGGACACAT AAGGGAGGCC CTATCACAGG AACACGAACA CCCGGAGTCG CTAGAAATCA GATTCCACCG GCTTTGCTTC 001840
001841 AGGCCGACTT CCTAAGGAAG GCATGGTAGC ATCACAGATG CTGTTTGTCT CAGTTTCCTG AGAAGCCAGT TTCTTAGAGC 001920
001921 TGTCATTCAA GAGTGGGTCA AATTTGAGAT ATAAAAACCT TTCTCAAGGT AGACTCTTTG GCCCCTATGC CTAGTATGTC 002000
002001 AGCTGGGCCT CTGAAGCTCT CCTCTTTGAG GTCCTGGCAG GCTCTGGCCC CTGTTGAGAA GGGGAAGCAG TCATGTGAAC 002080
002081 TCATGTGCTG GAAGAATCCA CTCTACCCCT TCCCTGGCTT CTCTGCTGCA ATTTTTGTCT TTAAACGTAT TTTCAGGTAT 002160
002161 CCTTTGCACA TGGCCTGCTA TCAACAATCC TCTACAATGA TTCTCAGCAT CACCTAGGGA GCTTATGTAA AATCTCAAAT 002240
002241 TCCTGGGCCC TATCCCCAGA GTTTCTTATC AAGTAAGTCT GCATTGGGAT CTGAGGATTT GCATCTCTAA CAAAAATTCC 002320
002321 TAGGCGATGC TCACACTGTT GGTGTAAGGG TACAGTTTCC TTGATGGAGA ACTATCAAAG CAGTCAAAGC TATCAAAATA 002400
002401 AAAAAGAATT CTGATTGGGT GCAGTGGCTC ACACCTGTCA TTCCAGCACT TTGAGAGGCC AAGGTGGGCA GATCACATGA 002480
002481 GGCCAGGAGT TCAAGACCAG CCTGGCCAAC ATGGCAGAAA CCCCATCTCT ACTAAAAATA CCAAAATTAG CCGGGCATGG 002560
002561 TGGAACACAC CTGCAATCCC AGCTACTCGG GAGGCTGAGG CAAGAGAATC GCTTGAACCT GGGAGGCAGA AGTTGCAGTG 002640
002641 AAACAAGTTG GTGCCAATGC ATTCCAGCCT GGATGATGGA GCGAGACTCT GTCTCAAGAA AAAAAAAACA AAAACAAAAA 002720
002721 CAGAACTCTG CCTCATAGCG TCTTTTAAGT TATACACTGG ACCTATCCTT TCCTCACTGA CAGACATTTT AAAAATTTTT 002800
002801 TGGTAAGGCC TAGTTCATAT AAAATGTAAT CCAAGCCAAA AGTTAACAAG AATAAGGGGA GGAAAGGGGA CTCCAATAGC 002880
002881 AGAGAAAGGT ATTTACCTGG GATATACACT GCAAGAAAAT CAAAGCTATA AGAAACGTCC ATGAATAGTA GCCATAAGGC 002960
002961 ATCAGAGTGA TAAAATTCCT GTCCCTAGGA GGGAATATTG GAGTTTGCCA GAGAAACAGA ATGAGAGAGA CAGAGAGGTT 003040
003041 TATTGTAGGA ATTGGCTCAT ATGATTACGG AGGCTGAGAA GACCCACGAT CTGCCATCTG CAAGCTGGAG AATCAGGAAA 003120
003121 GCTGGAGGTG TAATTCAGTC AAGTCCAATG GCCAGAGAAG CAAGTGTACT GATATCCAAG AGCAGGAGAA AATAGATGTC 003200
003201 CCAGAACAAG CAGAGAGGCT GATTTTGTCC TTCCTCTGCC TTTTTGTTTC ATATGGGGCA CTGAATGGAC TGATGCCCAT 003280
003281 CCACATTAGT GAGGGTGGAT CTTCTTTACT CAGTCTACCA GTAGAAATGT CAATGACTTC CAGAAACACC CTCACCAACA 003360
003361 CACGTGGAAA TAATGTTTTA CCAGGTATCT GGGCATCCCT TGGTTCACTC AAGTTGACAC AAAATTAACC ATCACAGAAG 003440
003441 GAGACTGGCC TTACTCTGAA ATTAGGAAAC TAAAGAAGTG ACCAGAATGG AGACTAGGTA GAGACAACTA GTTCTCTACC 003520
003521 AAACATGTAC AGTTATTCGT TGGTATCTGA AGGGGATTGG TTCCAGGAAC TCTCAGGGAT ACCAAAATCT GCAGGTGCTC 003600
003601 AAGTCATTTA TATAAAATAT TACAGTATTT GCATATAACC TTTGCACATC TTCCATATAC TTTAAATCAT CTCTACATTA 003680
003681 CTTATAATAA TGAATGTGTA AATGCTATGA AAATAGTTAC TACACTATTG TTTATTTGTA TTTTTATTGA ATTGTTTTGG 003760
003761 GGTGGGGGGC AGCTGTATCT TTCTTAGTAA TAGAACCCCT GGTTTTAGCT GGGCACATGA CTGCCCTCAA TAAAGATTAA 003840
003841 AGTACCCCAG CCTTCCTTGA GATTGTGGCC ATGTGACTGA ACTTTAGACA GTGAGATATA AGCAGATATC TTCTGTGGCA 003920
003921 GTGTTAGGAA ACTATTAAAG ACAGTAAGAA CATTGC
[back to top]

Predicted Small Protein

Name NONHSAT057058_smProtein_2510:2737
Length 76
Molecular weight 8421.6433
Aromaticity 0.0266666666667
Instability index 61.8653333333
Isoelectric point 10.1537475586
Runs 10
Runs residual 0.0156028368794
Runs probability 0.0315845257022
Amino acid sequence MAETPSLLKIPKLAGHGGTHLQSQLLGRLRQENRLNLGGRSCSETSWCQCIPAWMMERDS
VSRKKKQKQKQNSAS
Secondary structure LLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHLLLLLLLLLLLHHHLLLHHHEELLL
HHHHHHHHHHLLLLL
PRMN -
PiMo -