NONHSAT024052

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT024052

Source

NONCODE4.0

Same with

,

Classification

sense

Length

4990 nt

Genomic location

chr11+:109294858..109299838

Exon number

1

Exons

109294858..109299838

Genome context

Sequence
000001 AATTCGGCAC CACCGCCAGC CTCCAGTCCC CAAGGAGCAC ACGCAGCTTC CTCCTGTTTG GACACAGCTG GCGAGGGCCT 000080
000081 TTTGCAAACG GTGGTACTGT CCTGATCGTC TAGCCCCTCT CGTTCCCCGT CCTCGTTTCC AGCATCTTTG CCACCCTTGC 000160
000161 TTTTTTCCTT CTTCCTTCCT TTTCCATTTT CCTCTGGCCC CTCTTTCCTC TTCCTGGTTT CCTTACCTGC CCTCCCCTTA 000240
000241 CTCTTGTTTC TCCTCCGCCG AGGCACTGTG CGGTATTTGT AAATATTGGG CGAGGAAAGT CTCGGAAGAA GAAATAACGC 000320
000321 TGATAATAAT ACTTTATTAA TATTTATAGT AATTATTATA ATACTAATAA CACAATCCAA GCGCATGACA AATCACATAA 000400
000401 TTTCTAACTG TCAATGGAGG ATGCATCTTC CCATTCCACC CCGCGCTCCC CTCAATTAGG AGGAGAAACC GCACAAGCTA 000480
000481 CCAAATATAT AAAAGGCATT GATTCCCGGA GCAAAGGGAG GGGAGGGGGC CCGGTAATGT ACATAGCTGT CAAGTTAAGA 000560
000561 TTTAGTTTCC TTCCTTCCCC TCTGACCCCT AAGCTTTCAC TTTTCCTTTA GCTTCCCTTT CCTCCCCATT CCCACCCTCA 000640
000641 GCCGGGCTCA GGCAATAGTA TATTATAAAG AAAATGTCTA CATTAAGCAC CAGGACTGCA AGAGGCCATA GAGAATAGTC 000720
000721 CCCGGAAAGT GTTTATGACA GGTACCCCTC GCCAGTCCGC CCATTTCTCA CCCTTTGGTT AACCATTACA TTTTCCAGGA 000800
000801 CAGAGCATTT AATTTACTTT TTAAAATGAC CCTCGCTGGC CGAGCATAAG TACGTTAAAA TCTTTAAATG AGTTTTTTTT 000880
000881 AAAAAGCTAA CGCTTTCATT CCCTGCCCCG CCCCCACCCG TACACCTTTG ACTTGTGACA TTTTCAGGAT TTACAAAGGA 000960
000961 TCTGGGAGCT GTCCAGCCAG GTCTGGTGCT GAAGTCGCCT CACCGTTCTG ATTACTTCCT CTAGTTGTGA AGGCAGAGGA 001040
001041 GGGGCTGTTT TGGAAAGTGA CTATTCTGGC TTTTGGTTGG GTTCTTTCTT CTTTTTCTAC AATCGAGTTA GCGTGTACTA 001120
001121 TTGGTTTTCT TATTATTAAA CATTGCATAA GTTACCTTTT TTGTAAAAAA AAAAAAGTAT TAGGTGATGT GCAGTACTGA 001200
001201 AAGTGCAGTA TCTAACCAAC TAGAACGTTT GTTTTATTTT TAGAACAAGT GCACCTTTGT TATATATTTA GTATATTGGT 001280
001281 ACCAAATACA GAAAAAAACT ATAGTTCTGT ACTATGTCCT CCAAACTGTA TATTATTGTT CTTAATTTCC AGCTGTTGAT 001360
001361 ATAATGGTTA CCACTGGATG AGAAATTCAG TGGTGCAGAC CTGGCTTCTG CTGTTTCCAG AAGTGTTCTT TTGTACCTTA 001440
001441 TTCTGTAGTA GACTGTTATA AAAAGATGAC ACACACGCTT TATTTTTTCT TTTGCAAATA ATAGAAAAAA CACAACCACA 001520
001521 GAAACAAACA AATAATGGAT GTGCAAGAAT GCCATCTATT AAAAACATGG TTAATATTTA AACAGTGCCT GTAGTTCTCC 001600
001601 CTGCATGCAG TTACCACCTG GAGGCAGTGG TGTGTGTGCT TGCTTGTACT GTATGTGGTT GGGGATGAGA CTGGTGGGGA 001680
001681 TGTTGGGCAA TTTGGATGAT AGGACATGCG AATTTCAAGA AAAGCTAGAT CCAGTAATTA CTTATAGGAT ACAAATGTTT 001760
001761 GCAGCTTGTT TGTTTAGTTA CAAATCTGTG CCTGCAGCAA AAGAAAGGCT CTCTTTCTCA GTCTGTCCTA ACTTCACTGA 001840
001841 ACATACAAAA ATACTGAGAA AAAAAGAAAG TGAGAAAATA AGATAAATAT TTTATTATTT TCTAATGCCA GACAAGGCCA 001920
001921 TCTATGTTAG AAATGGAAGT AGTCTATAGT TATTTACCAC CTTGTTTCCT TTGGTTGAAT GACAGAAGAA GCCAGAAGCA 002000
002001 TATTACAATA TTAATTATAT GCAGTGACAT CTCTTCTTGG AAATCAAGCC TTTGACAGCT ATTCAAATTT TTGGAAAATG 002080
002081 TATGAATGAA AGTGATCTGC AAAAAGCTCT ACCAAGAGAG TCCTTACTAT CCTGCAGGGT GTAAGGCAAC AGTGCCCCTA 002160
002161 TGAGCATCCC TCTCCCAAGC AACTCATTGT TCTAAATTGT CTACTTGGTC CAGACATATG TGCTCTTATC CAGAGGAAAC 002240
002241 ATTACCTCTA ATATTTTTGG TCATCTATTA AATTGCCCAA GCTCAGTTTG CATCTTTTAT AAAAATAATT TAATAAACTT 002320
002321 AATTTTAATC ACCCATAAGA TAATCACCTT TCCTGAAGCA CAGAAGATCA TCATTGAAGC ATGCCATCTT GTAATATTAG 002400
002401 GAAGACCAGG TATAATCTTT GGGTACAATA TATTAAACAA TGAACCAGTT TTTCTCCAGT GCCTTAGTCA CTTCCTAGTA 002480
002481 ATAGGTAAAA GAGTGCATTA ACTCCCTCAG GCCACTAGGG AATCCTGTAG TGCATCAGAG TTCTACACTG TACATCAGAA 002560
002561 TGACTAAGGC ACTGTGAAGA AGAAAGTCCA TTGATTTTTG TAGCTTACCC TCATCCAGTG GTAGCTCTTT AATACCTTAT 002640
002641 TACAGAATGA CTTTTGGACT TGAAATTTGA ATAGAATCAG GGTCCCAGAA GGGAGCTTCA TCATTTCCAA TGAGCTCCAA 002720
002721 TAAGTGCATG GATATTACTT TCTTTCCCTT ATTTGGTGCA CTTTCTACAG CGAATAATTT CCCATTTTAT TAAAGCAATA 002800
002801 AGATGTTCTG TTGTGAGCAG TGCTGGATAA CGATTCCATT TTCTTGGTGC TGAAGCTACG CATATGTTCT TGCCTTATGA 002880
002881 ATCATAGTAT ATTCAAGGCA TGCAATAATA ATCCCCTTCC AAGGAGCAGC CTGTACACTC TGTGGGGGTA ATTATGAAAT 002960
002961 TGTCCTACAT CATTTCTCCC CAAAGGAATG CAAGGAGCAA GAGAAATAAA AGCATTATGT ATATTGTAAA ACAAATGATC 003040
003041 TCAATACCTC TCAACGTGGG CACACATCTG TAATACACTT TTATTGTGCA ACTTTAAATG TGTGTGCCTT TCTCCATTAA 003120
003121 TTGACTGTAG TTTAATATTT TTAATAGTGC TCTGTGTAAA GATGATGCAG TTTGTGAGTC AGATTTCATA CTGAGCTTAA 003200
003201 TATACTCAAA ACCTAAGCTT CCAGTGGTAT TACTTGTGAT TTTTTTATGA GATAGAGAAT TACATGAAAT TCTAGTAAGC 003280
003281 CAGTTTCCAC ACAAAATTAT TTCAATAAGC ACATATCCAA GCAATTTGTA CAGACCTTGT TTAAAGCACT GCTTCTTATA 003360
003361 GGGCTCAATT GGCAGTCTCA GGAGCTTGGG ATGCTACCAA CAGGGCATTT AGGTTTTCAT TTTTATGAAA CAAAAGGCAA 003440
003441 TCTGGAAATA GCTAAATTTA TTTTAAGGAT TTTATTCACT GATGTCCCGT CAAACTAAAT GCTTCAAACC CCTATAGCTA 003520
003521 CAGCTTAACT TCTGCACTTG CTATTCAGTA TTAATAAGGT GGCATGCTTG ATTCTTATTG TTTTTAAAAA TGAGAAAATT 003600
003601 TGGAGAGAGA ATACTATTAT GTCAACGGTA CAAGACTCTG AATCTTGAAG ATGTAGATGG ATATAATATT TAGACTTTAT 003680
003681 ATACACCCAT AGATATGTAT TTATATATGC ATACGTTTTG TATAAATTTA CAATTGACTT TTTGTATTCT CTTTTCTGTC 003760
003761 ATTACAAGAA TGAGATGGAA ACCAAAATAG TTGTTCCATC CTCTTACCCA AAGAGGATAC TGAAAAGTCC GGTATGTGCA 003840
003841 TGCACTTGTT TCTCTGGGGT CAAATCTGAA TGACTTTCTC TTTAATAATG AAAGGGGAGA GTCCTGGTTA GCTTGGAAAA 003920
003921 TTGGTAATTA GATAAGCTTC CTTTCCTAGG TTAGTGACTC AAATGTAGCA ATGATAATAA AGTTATGACC ATATCAATGT 004000
004001 GGTCTAAATA TCTAGATGGA AAGCAAAAAT ATCCAGAAGA GGTTTAGGGC CATCTAGACC ACAAACCTGG AACCTGGATT 004080
004081 GTGGTTTGAT TTTTTTCATT TTCAACCTAT ATGTTGTGTA GAAATACAAT TTATATCTGC ATAGCTAAAG GTAAAGTGAA 004160
004161 AATATGACCA TTATCTGTTT AGTTTGAGAA ATCATTTTTG GATAATATGG ATAAATTAGC TTTAAGAAAC TTCTTTGACA 004240
004241 TCTGACTTTT CCACCATTTC TATCTGTATC AATTGTATTA ATTAGATAAT GGTTAAATTT TTATTTTTTT ATCATGTAGA 004320
004321 CATATGCTTT CATGCTTTTC TGTGGGTGAA AGTTGAACAA TTGCTAGCTG ATATTATGTG GAAGTTACAG GTTAAAGTAG 004400
004401 AACCAGAAAT TTTAATATAA TCTGATCAGG TTCCTTAGTC CCATTCCCAT GTGAATCTGA ACTTTGAATT TGAATCTGAA 004480
004481 AGGCCTACTT ATTTGGGCCA TTTTTTGTTG GAACTTGTGA TTGTAACTCA CAGATGTTTC AGTTCATATT AGAATAAATT 004560
004561 GCCCAATCTA TAAGAGAGTG TTTAATCATT GGAATGAATG TAAATCTGAT TCTGATTTGG TTATTTATTT GATAACATTT 004640
004641 GAGAAACAGC TAATGTGGCT TGACATCAAT ACTTATTAAT GTCTTCAACC TAATAATACA ATTGGTGTCA TGTATGAATG 004720
004721 CTTAGCTGAC AATAAGAATT TAATCAACTT TATTTGGAGA CAATGACTTA TCAAGGAGGT CCTACCCATT TGTGGTAGGG 004800
004801 ATATTTCCCT TACAGTTGTG ATTACAAGGG ATTGTTTCTA TAGAGAGATG AACAGCAACA ACAGTGACAA TGATGACAAC 004880
004881 AGAAAAACTG ACTTTTGAAC TTGAGTATAT CCAATAGATA GAAGAAAAAA GATATTAGAA TTCATGGAAA TTTTGTTTTG 004960
004961 CTTTTCTATC AATAAAAGAG TTTTCTTGTT
[back to top]

Predicted Small Protein

Name NONHSAT024052_smProtein_1604:1885
Length 94
Molecular weight 10598.4639
Aromaticity 0.0967741935484
Instability index 42.4419354839
Isoelectric point 8.82122802734
Runs 16
Runs residual 0.0411190760653
Runs probability 0.0480149266915
Amino acid sequence MQLPPGGSGVCACLYCMWLGMRLVGMLGNLDDRTCEFQEKLDPVITYRIQMFAACLFSYK
SVPAAKERLSFSVCPNFTEHTKILRKKESEKIR
Secondary structure LLLLLLLLHHHHHHHHHHHHHHHHHHHLLLLLLLHHHHHHLLLEEEEHHHHHHHHHHHHL
LLHHHHHLLLEEELLLLLHHHHHHLHHHHHLLL
PRMN -
PiMo -