NONHSAT005342

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT005342

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4050 nt

Genomic location

chr1-:114420790..114430081

Exon number

2

Exons

114420790..114424619,114429871..114430081

Genome context

Sequence
000001 GGGTGATTTA CCAGTGTTTT TAACTTCCTG CTGGGCTGAA AACTGCTTGT TTCGTGGAAA AGCAAAACTT GACAGCAAAC 000080
000081 ATCTAAAATG AAGAGCTCCC AAACTTTTGA GGAACAAACG GAATGCATTG TGAACACTCT ACTCATGGAC TTCTTGAGCC 000160
000161 CAACATTGCA GGTTGCCAGC CGGAACCTAT GCTGTGTAGA TGAAGTAGAT TCAGACAGGA GCTATACTCC AGAACACTGT 000240
000241 GGAATCTCTC AGCAAGACCT GGTGTGCTCA GGATTCCAGC TTAGCTTATG AGAGAGCTTT TCTGGCAGTG TCAGTGAAAC 000320
000321 TTCTTGAGTA CATGGCTCAC ATTGCTCCTG AAGTAGTGGG ACAGGTGGCT ATCCCCATGA CGGGTATGAT CAATGGGAAC 000400
000401 CAAGCCATCC GGGAGTTCAT CCAGGGCCAG GGAGGTTGGG TAAGTTTGTT GGGGCTCCTG GTTTTGCTAG GTATTTGCAC 000480
000481 TGTTTTTGTA CTGATACCTG GGATTCCAAT TCATGACCCT TTTTGTAAAT TGGAAACCAA CAGGAAAACA GGCTTCCCTA 000560
000561 TTTTGCTGAA AATGCCTAGA TTTTCTTGCC ATGGAAAAAT GGAGCAGACA GAAGCTCAGC CTGTGGATGT TGGACTCGAG 000640
000641 TCATGCTACC TTAAATATTG ACTGCTGACT ACTGTAAAGC TAATCTTCAT TTCCTACACA TAATCATCTA TAAAGTATAT 000720
000721 TCATGATAGA GTTCTAATTT CAGCCTGGCT TCACTCTGTT CCTTTGTAAA TTTCTGGGTC TTTTTTTCCT AACTTCTCTA 000800
000801 TGTGTGTGTT AGGGAAGTCA GAATCACCTT AATAGTATCT AAGAAAAAAC AGTCTTTGCT GTAGTGGTAA CTATATAAGG 000880
000881 TATTCTAAGC CCATGTTTTA GTAATCATCA GAACTTATGC TGTCATAGAG CACTCCTTTC CATTTACAGG GTCAGTGAAA 000960
000961 GCCTTAATAA GAGCTACATT TCCTGGAGCT ATCTGCAGAT CATTTCCTGG TTTCTTAGGA AGTTCCTGAA AGCAATTCCT 001040
001041 TTGACTGTAG GATTTTCTCT GTCTTCCCTA GGAAAATCTG GAGAGCTGAA GAGTTGGAGC TATTCACGAG ACTGAACATC 001120
001121 ACTTCCTTGT TGACTGATTG GGTGATTGAT TCCTGGGAAT TCAAACAAAC AAATAAAAAA GCACTTTTTT CATTTTATCA 001200
001201 GAACTGAACT TAGCTGAATA AGTTATTTTT TACTGATTGT TAAAGTTGGG AGCAGCTGCC AGAGGCCTGC AGAGTTGGTT 001280
001281 TTTGTTTTGT TTTGTTTTGT CAACTTAATG CAAACCACAG AGATTTTCTA CTTTCTGTTT TCACATGAGT TTTAATGAGG 001360
001361 TTCTGTTGAA GCAAAGACCC CTAGACACAA AGTAATGACT TGTTAGTAGT GGAATTATAA GCAACAGGGC AGGCCTTTGC 001440
001441 TGGAGGTATT TTGAGAGAAA GGGAGAACAA TGGAAACTAT TTCTTCAGAT GTAGCCCTGT CTTTTGGTAA GAATTGTGCC 001520
001521 TACTAATTTT GCAATTTAAA GGATTTCAGG AAGCTTTTTG GTTGAAAAAT CTTGTTTTTT TTTTTTTTAG ACGTAGTCTC 001600
001601 ACTCTGCCAC CCAGGCTGGA GTGCAGTGGT GCGATCTCGG CTCACTGCAA GCTCCGCCTC CCGGTTTCAC TCCATTCTCC 001680
001681 TGCCTCAGCC TCCTGAGTGG CTGGGACTGC AGGGGCCCGC CATCACGCCC GGCTAATTTT TTGTATTTTT AGTAGAGATG 001760
001761 GGGTTTCGCC GTGTTGGCCA GGATGGTCTG GATCTCCTGA CCTCGTGATT CGCCCGCCTT GGCCTCCCAA AGTGCTGGGA 001840
001841 TTACAGGCGT GAGCCACCTC GCCCGGCCTA AAATCTTGTT ATTTTAAGTT GAGCATTTTC ATTCAAAATC ATCCCTAAAC 001920
001921 TTCATGTTAA TTTCACCTGA GAAGGACTAT TTTATGCATT TTAGAGGTTG GAAGCAAAAA ACAAACAAAC AAACAAAAAC 002000
002001 AGTTGTTTCT ACTAGGAAGG TCAAAGAAAA TAAAAGTTAC TCCATTTTTA CTGCCACAGG ATGCAGGAAG TGCTGGCCCA 002080
002081 CATCTAGGAC AGCAAGGCCA CCCCAGCTTA GATGAAGCTA GCTGCATAAC ACAAAGCTTT AAAAGTGTGG TTCACACACC 002160
002161 ACTTGTGGAC CTTATTGAAT TGCTGATTCG CATTCCTAGA GATTCTGATT CTGTAGGGGT AGGCTGGAGC CTAGGAACCT 002240
002241 ACCTTTTAAA CTAGTTCCAT AGGTGATTCT GATGTACATA TAGAGTGTGA CAGCCATCAC TATAGAGAAT AGATTGATAA 002320
002321 ATTACAACCC ACGCGTCAAA TCTGGCTTGC TGTCTTATTT TGTAAATAAA GTTTTATTGG AATAGAGCAA CATTCACTTG 002400
002401 TTTACATATT GTCCATGGCC GCTTTTGTGC TACAATGGCA GAGTTGAATG ATTATAGAAG ACCATACGGC TGGTTAAGTC 002480
002481 CAAAATATTT ACTAGCTGGC CCTTTACAGA AAAAGTTGGC TGACCCCTTC TCTAAATCAA CATTTCTCCT TGGTAACTGA 002560
002561 AACTCTATTA CAGTCCTGAC AATTCCAGCA AACACAGCTG TAGATAGGGT TTAAACTCAA AGATATTTAA CTCTTCTTTG 002640
002641 GGAACTTAGT CTCCATATGT TTGTTAGTTC TTGCATAATC CACAGTTTCT CTGGGTCTTC AGCCAATCAG AGAGCTCTGT 002720
002721 AGCTCCTGAG TAGTCCATTT CTCTGGCCCT ACAAGTGAGA GTGATTGGAA GCAAAGCTTA TGATTTGTAT GATCTTGTAT 002800
002801 CTCAAAATAG TCCTTAATGA TCTAATAGAT TAGTAGGCAG TCATCAGGTA GAGACTCCAA AAACCAGACT ACTTTCCTAA 002880
002881 ATATCAAGAA GAAAGGCATT GCTACAGTGA TTCATGAAAA GCAGCATTAA TAACTTTGGC TAAAGTTTAA CAAAGCTAAC 002960
002961 CACTTCCCCT CTATCAGCCA GCCATCTATG TATCTCCTCT AAATGCAGAG AAGTAAATAT GTAGTGCTGT TAATACATTT 003040
003041 TTGCTTTTTA AAGTTATGCT TTGTCCTGGC GCAGTGGCTC ATGCCTGTAA TCCCAGCACT TTGGGAGGCC AAGGCGGGTG 003120
003121 GATCACCTGA GGTTAGGAGT TTGAGACCAG CCTGGCCAAC ATGGTGAAGC CCTGTCTCTG CTGGGGATGC AGAAGATTAG 003200
003201 CCGGGCTTGG TGGCGCGTGC CTGTGGTGCC GGCTGCTCAG GAGGCTGAGG CAGGAGAATC ACTTGAACCT GGGAGGCGGA 003280
003281 GGTTGCTGTG AGCTGAGTTC GCGCCATTGC ACTCCAGCCT GGGCTACAAA AGCGAAGCTG TCTAAAAAAA AAAAAAGAAA 003360
003361 ATAAATAAAG TTATGCTTTT TCCTCTATTC CTAGTTAAAT CACAACAAGT TAGTAATCCA TAAATGATGT GTCCTGTTTC 003440
003441 TCTTTAGTAG AAATTATATT TTTGGCTACC AGTTAAGAAA CTTGTACTCC TTTGTCCCTT ATGTTACTAT AAACTCAAGA 003520
003521 TGATGAGTTT TGTGGTATTT GACTTCATAG GCAAAATCAA AATTTTTACT TTGTTGCTAT TCTGTTTTAT GAAATAAACT 003600
003601 TCTGTCTATG CATTTGAACT AAGTTTCAGC AAATTCAATC TAAATTGAAT AATTCCAGCT CCCAGTTTTA TCCTATGTTG 003680
003681 CTCATAAAAC AGTTCCAAGT ATACTGCATT ATCTTGAGAT TTGAAGATAT GGTGCCCACG GGGATTATAC TAGGCAAATG 003760
003761 CGTTAAGCAG CTCTGGCCTA GGTGTTGTGT ATTTTAAGAG ACTCTATCTT AGGAGAGCTT AAGTGATTGG GCTGCAGGAA 003840
003841 GAAGACATTG TAACCCAGGA ATTAAAAATG GATTCAGATT GCCTGATTTT AACACTTTAG TTTCACCATA GGCTAATTAT 003920
003921 GTGACATTGG GCAAGAGACA TAATTCTTCT GTACCTTAGT TTCTACATTT GTAAAATAGA GATGATTTGG TAACTTATTA 004000
004001 ATAAGATTTT TGTGAGAGAT AAATAAAACA AATACATTTT GTAAAAAAAA
[back to top]

Predicted Small Protein

Name NONHSAT005342_smProtein_3002:3166
Length 55
Molecular weight 6285.5619
Aromaticity 0.148148148148
Instability index 64.8777777778
Isoelectric point 9.69427490234
Runs 4
Runs residual 0.0886178861789
Runs probability 0.0362789259849
Amino acid sequence MQRSKYVVLLIHFCFLKLCFVLAQWLMPVIPALWEAKAGGSPEVRSLRPAWPTW
Secondary structure LLLLLEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLL
PRMN LLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLL
PiMo iiiiiiiiiiiiTTTTTTTTTTTTTTTTTTTToooooooooooooooooooooo