NONHSAT100517

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT100517

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3861 nt

Genomic location

chr5-:12653170..12737969

Exon number

6

Exons

12653170..12654688,12657511..12658227,12659546..12660157,12663441..12664022,12723669..12723939,12737810..12737969

Genome context

Sequence
000001 TTTAGGGACA AGAATCAGAT TCAGAAATCT CTAGATAATA AAACTGAAAT CATTATTATT TTGTATAAAT ATATCTACAT 000080
000081 TACTTGCTTA TTTAATAAAG GAATACATTT TTAATTTTCT AGGCTATTAT ACATGAAATG TTGCTTTGGC TTTTGCTTTA 000160
000161 AAAAAAAAAA AGTATTTGGT AACATGTGGA CATTTACTAA GCAAATGTGA TAGCTTAAGA TTGAGGGTAG CCACCTGTAA 000240
000241 TCCCAGCACT TTGGGAGGCC GACACAGCCA GATCACGAGG TCAGGAGTTC AAGACCAGCC TGACCAACAT GGTGAAACCC 000320
000321 CGTCTCTACT AAAAATTAAA AAACAAACAA AAAAAACGAA AGAAAAGAAA ATTAGCCGGG TGTGGTGGCA GGCACCTGTA 000400
000401 ATCCCAGCTA CTCGGGAGGC TGAGGCAGGA GATATAATTC CTCAATTTAG CCTTCCCACC TCAATACAGT CTGATAACAG 000480
000481 ACGAGCCTTT ATTAGTCAAA TCAGCCAAGC AGTTTTTCAG GCTCTTAGTA TTCAGTGAAA CCTTTATATC CCTTACGGTC 000560
000561 CTCCATCTTC AAGAAAAGTA GAATGGACTA AAGGTCTTTT AAAAACACAC CTTACCAAGC TCAGCCACCA ACTTAAAAAG 000640
000641 GACTGGACAA TACTTTTACC ACTTTCCCTT CTCAGAATTC AGGCCTGTCC TCGGAATGCT ACAGGGTACA GCCCATTTAA 000720
000721 GCTGCTGTAT AGACATAACT TGGCCCATGA TAGCTAGTAT TCAGTTCTTC CTTTTATGCA CAACCACAGC CAGCAGGAAG 000800
000801 CTACCAGAGA ATATGCACCA GTGAAATAAG GTTGTAAATA AAAAAGATAT GCAATCCATG AAACAGAACA TCCAGCCAAG 000880
000881 GATCATAACA GCAAATGCCA GCTCTGGTGA GCACGTTATA TTGAAAAGGG TGTGACTGTG GTGAAAGACT TGCCACAAAT 000960
000961 CATGAAACAA AACCAACCAG CACTGACAGA TCATTTAAAA TGTTTAAATA CTTCTTTGTT TGACATTTAA AATGTTAGTC 001040
001041 ACGTTTCTGC AATTTTACCG CCTGAAAATA TGTTCACTGC AAAATTTTGG CATAACCAAA AGCACATCTG GAAGTTACAT 001120
001121 TTCTCAATAA GCTAAAAATA GGATTTAGGG TTAACCTGAA GTGTGTCAGC AATTCTCTGA GAAGGGTGGA TTAGAGTGCT 001200
001201 CCTAAATAAT GCTGAAGTGT TTACAATTCA AACACACCTG AGGCATCTCT TCACATCCTG TTTTCCTGTA ACTCTTGTGA 001280
001281 TCTCCTGGGC CCTTGTGCGT CACCTGGGTT GGAGGATTAT CCATCCGCTC TTTCTAAAAC ATGCTCTAAA AAATGTCTTA 001360
001361 GGAAAAACAC TATCTAAAGT CGTTTATAAG CTATTCAAAT TTTTAGTAAA ATATCAATGA AAAGACTTTC TAAATGCCTG 001440
001441 CCACATGACA CCTTCGTGTA CCAAAACCTA TCAAAATTAC ATCTTAAAGT CATTTTTTGT TTTCACTGAC AGGACCATAT 001520
001521 TTCTCTTGCA GTTGTGTAGT TTGCAATGCC AGATGTAGGG ATGTCAATAT TTAATGGATA AAAATATATT TTATAAAGGA 001600
001601 GCCCACTGGG ATCACTTCTA AAGGCAAGAG GGGAATAACA GACAATGGGG CGTGCTTGAG GCTAGAGTGT GGGAGGAGAG 001680
001681 AGGGGATCCA AAAAATTAAC TATTGGGTAC TAGGCTTAGT ACGAGGGTGA CAAAATAATC TGTAGAACAA ACCCCCATGC 001760
001761 ACGAGTTTAC CTATATAACA CACCTGCACA TATTGCCCTG AACCTAAAAT AAAATTTTAA AAAAATAAAT ACTCTATAGT 001840
001841 AAAATCAAAG ATAACTATAA ACATAATATT TTGGTAAATG AGTTAAATTA CTGAATAGTC TAGTGTTTAG CAAAAATTTA 001920
001921 CAAAAACTAT ATTTTCCACT TTGACTTTAA GGATAATTTA GGAGATTACA TATGTTGCTT ATTACTGACA GCAAGAGAGA 002000
002001 TTATTATTTT GAGATTCTAA TCTTTAAGTA TTCTTATTTG TAGAAAAGTA TTAAAGGTAA TAATAAAATA AAATAATTTA 002080
002081 TTTAAAGGAA TTTTTAGTAC GTATTTGCAA AGTGAGGTAT CTCTTTACAA ACAGTTACTG TACTAAGTTG TGTATGTCTA 002160
002161 TGTTTACATT TGTAGGTCTT GTTTTTCTAA GGCTATAATG AGTATAGTTA ATCGATAATA TTACTTTTGC TTAAAAAGTA 002240
002241 AGTTAAGTCA TAAAGATATT TAAATGTTTG GAAAACATTC TTATGCATGT ATCTACACTG TTCTCTTTTC CTGTGCTCTT 002320
002321 CATTTCTTTA TGGAAATCTG AATTTTATAT GGATAGATTT TCAATGTCAA GTGTTGATCA AGAAACAAGT CACCAGATTA 002400
002401 TACTTTAAAT ACCACAGGTT CAATGTAAAT TTAATAAAAA CACTTGTATA TAAGATATAC GTATCTATTA TTAATGAATA 002480
002481 TATAAGGAAT AGAATTTAAT ATTAACAACA TTTAGAAATA AACAGCCTAA GTTAAGAATA GTGATGCTTG GGAGAGTGTG 002560
002561 AAAGGGAGTG GAACAGAATG TGGGAAGAAT TTTTAAAACT ATTTTGACTG TGTCTGTAAA ATTTGTAATT TTTATTAAAG 002640
002641 CAGATTTAGT TAAACATGTT AAATTTTAAT GTTATTTTTA ATTCTAGTTA GAATATACAT GAGTGTTTGT TATAATTTCA 002720
002721 CCCTCACTAC GGGGAATTAA AACTTTTCTC AAATTAACAT GAAAAAGAAC AAATGCACAC ATCTCAGCAG AATCAGAAAA 002800
002801 AAAACTAAAG ATAAAAGTAA GGTATCATCC TTAGGATCAA AAGCAACAGC TTTGCAAATG TGCACAGAAT GAGATATGCT 002880
002881 ATAATTATTC ATCACCTCTT TCTCACAATG TAACAACTAT TTAAGAAGAA TGGTTTAGTT ATTTTAAAAG AGTGTTTTCG 002960
002961 GGTTTTTTTT CTGCCCAAAA AAAAAAAGTA CTATTACCTT TTTTTCCTGG GTTTACTTGG TATATAAAAT TCTTTTTAAA 003040
003041 ATTTTATTTT TCCTTAAGTT ATTGGTATAC AGGTGATATT TGGTTACATG AGTAACTTCT TTAGTGGAGA TCTGTGAGAT 003120
003121 CCTGGTGCAC CCATCACCTG ACCGGTATAC ACTGAACCAT ACTTGTTGAC TTTTATGCCT CGCCCCCCCT CCCACTCTTC 003200
003201 CCCAAAGTAT ATTGTATCAT TCTTATGCCT TTGGGTCCTC ATAGCTTAAG TCTCATATGT TAGAGAGAAC ATACAATGTT 003280
003281 TGCTTTTCCA TTACTGAGTT ATTTCACTTA GAATAATAGT CTCTAATCTC ATCCAGGTCA TTGTAAATGC TGTTAGTTCA 003360
003361 TACCTTTTTA TGGCTGAGTA GTATTCCATC ATATATATAT ATCACAGTTT TTTTATCCAC TTGTTGATCA ATGGGCATTG 003440
003441 GAGTTGGTTC CACGATTTTG CTATTGTGAA TTGTGCTGCT ATAAACATGT GTGTGCAAGT ATCTTTTTTG AATTACGACT 003520
003521 AAAAGAATGG TTTAACTATT TAAAAAGAAT GATTATCACC AAATGAATTT GGTGATACCA TCAAATGAAT TTGGTGATAA 003600
003601 TACATGACAT GATAATTATG ATAATAATTT TAAAGCAGAA ATCAGTCAAT ATTTTCAGAG AGAGATTATA CAAATGCAAT 003680
003681 TTAGACTATA ACAGTTTAGA CTTATATCTT AGAGCATGTT AACATGTGGG GAAAATGATC TCTTGTGTGG TTATTTAGAA 003760
003761 AGGGAAACTT TTAAAGAAGA AAGCATAAAT GTGGGCATAT TCTCATCTTT TACTTAATGT AATTAGAGTG AACATATTGA 003840
003841 AGCCACATAC TAGAATTTTA T
[back to top]

Predicted Small Protein

Name NONHSAT100517_smProtein_3584:3721
Length 46
Molecular weight 5444.5701
Aromaticity 0.0888888888889
Instability index 43.9755555556
Isoelectric point 8.73785400391
Runs 8
Runs residual 0.0155303030303
Runs probability 0.0442170111289
Amino acid sequence MNLVIIHDMIIMIIILKQKSVNIFRERLYKCNLDYNSLDLYLRAC
Secondary structure LLEEEHHHHHHHHHHHHLLLHHHHHHHHHHHLLLLLHHHHHHHHL
PRMN LHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLL
PiMo i????????????????????????????????????????????