NONHSAT115180

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT115180

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

5514 nt

Genomic location

chr6+:138141787..138147395

Exon number

2

Exons

138141787..138145737,138145833..138147395

Genome context

Sequence
000001 TATTTAATCA CTGCTGTTGA AGAGCACTGT GAGTTTGTCA TGTAAGAAAT CCCATGGGAA ATGATAACAG ACCTGACCAC 000080
000081 AGAAAAATGG TAAACTTCTG AACATTAAGA AGGGCACAGA CTAAAATACA AACATCTACA TAGAAAGATC TACAGTAAAA 000160
000161 AATAAGAGAT AATCATCTAA TATCTTCTGA AACATAGAGC TTATAAAAAT TGATTCGAAA AACATGAAGA TTTCAGAATA 000240
000241 GAAATGGACA AAGGACACAA GACAACTCAC AATACAGCAA ATATAAATAA ATAAAAAACA GATGGAAAAT GTTCATCTTG 000320
000321 TTACTTAACA AAGGCAAAAT TGAAACAGTA GTAGAGTAAT TTTTCTTACT ATCAAGTTAG CAAAAATAAT TTCTTTGCCT 000400
000401 AATCATGTTG ATAAGAGCAC AATGAAACAG GAAATCTCAT AAACTGCTTG CAGGAGTATA AAACTTTCTG GAAAGCAATG 000480
000481 TGCTGTTTCC CAAGAGGACT CTAAGGGTTT GTATCCTTTG ACCTAGTAAT TCTGCTTCCA AGAACTTATC ATCAGAAAGT 000560
000561 AATTTAAAAA TTCCAAAATA TGGACATCAT GCATAAAAGA TGTTCATTTC ATCATTGCTT ATTAAAAAAA AAACGGAACA 000640
000641 CCCTAAATGC CTACTATTCA TTTCAATATT TTTTGAGCAC CTAATATGTG TCAAACAATG AGCTAGAACA GACACCAAAC 000720
000721 ACAAACCAGT TATGATTCTT ATCCTTCCAA TACTGGGAGG AACTGCAGGT TGGGAGGAGA TATACATTGA ATTAAGTAAT 000800
000801 TATTACACTT GTTACAGGAA TGCATAGCAG AGGCACCTAA CCAGAGGGGG GTGACATTTA AACTAACTCC TGAAAACTAT 000880
000881 GTGAAAGTAA GCAGAAGGGG ATGGCAGATG AAATATTCTG ATTAATACTT CAAGAAGCAG AAACAATGGG AATAAAAGAA 000960
000961 TCTTGCCGTT TCAAGGAAGT ATCATAATGT GTTCCAGAAA AAGGAAACAA CATATTGCAA GTCCAGGAGG AAAAAAATTT 001040
001041 GACCTAAATT AAGAGTGTCC TCATATTTCA AAGTGGCGAT GAAGAGAGTC CAAGCATCTG TTGGGAAGAT GAGACCTAGC 001120
001121 AGGAAACACA GTGGAGTTGC TTTCCTCCTC TTCTTCTGAT TAATTTCACA GGTTTTTCAC ATTTTTAGTT TAAATGTCAC 001200
001201 CTACGCTGGT TAAGTCCCCT CTGCTATGAA CTCTTGTAGC AAATGTCATA ATTACTTGAT TAAATGCTTA TCTCCTCCAA 001280
001281 GACTGTAAGT TCTGCAAGGG TAAGAATTAT ATCTGCCTTG CAGATGGGTG TGGTGGCTCA CACCTGTAAT CCCAGCACTT 001360
001361 TGGGAGGCCG AGGATCACCC TAGGCCAGGA GTTCGAGACC AGCCTGGCCA ATGTGGTGAA ATCCCATCTC TACCAAAAGT 001440
001441 ACAAAAATTT AGTTGGACAT GGTGGTGCAC ATCTGTGGTC CCAGCTACTT GGGAGGCTGA TGTGAGAGGA TTTCTTGAGC 001520
001521 CTGGCAGGCG GAGGTTGTAG TAGTGAGCCA AGATTGCACC ACTGCACTCC AGCCTGGGTG ACAGAGCGAG ACTCTCTCTC 001600
001601 AAAAAAAAAA AAAAAAGAAT TTTATTTGTC TTGTGTTTGG TTTCTGTTCT AGATCAGTGT TTGACACATA TTAGGTGCTC 001680
001681 ATAATGGTGT TAGGGAGTGA CCCACCACCA GGAAGCACAT GGGAGGATGG CTATGTGGCA GTGTTGAGGG ACACTGAATC 001760
001761 CGAAGTGCTG GGTCTTGAAT TCATCAGTAA AAACATGAAG ACCTGGCCCA TCAGGAGGAC CCACAAGGGA ATGAGGAAAG 001840
001841 GTCATCGTGT TGGACACACA TCTATGGTCT GGGCCTGTCA GGCTCCAAGG GTTGTGGTGT GTGTGCACTT GTATCTCCAA 001920
001921 GAGAACGGGA GTGACATCCC CAAACACAAA TGAGGTTAAC TTTTCTAAGG AACATTGGTT AGTGGCCAGG CGTGGTGGTT 002000
002001 CACACCTATA ATCCCAGCAC TCTGGGAGGC CGTGGAGGGT GGATCACTTG AGGTCAGGAG TTTGAGACCA GCCTGATCAA 002080
002081 CATGGTGAAA CCCCATCTCT ACTAAAAATA CAAAAATTAG CCAGGCATGG TGACACATGC CTGTAATCCC AGCTACTTGC 002160
002161 GAGGCTGAGG CAGAAGAATC GCTTCAACCC AGGAGGCGGA GGTTGCAGTG AGCCGAGGTC ACACCACTGC ACTCCAGCCT 002240
002241 GGGAGACAGA GTGAGACTCT GTCTCAAAAC AAAACAAAAC AAAACAAAAA CCAAAAACAT TGGTTAGTGT AGCCAAGAAA 002320
002321 GATTTTTGAC TTAAAAAATA AATCTGTTAT CCATGCAGTG TGTTGATGAA AAAAAGAAGA TAGTATGTGT TACAAATAAG 002400
002401 ATAAATATTA CATAACCATT TGTAAGATTA TTTATGAAGA GTTTGTAATG ATGTGGGAAA GAGGTTATGA TATAAAGTTA 002480
002481 AGCAAAAGAA AAAGTAAATG CCAATTATAA AAATAGTATA TTGTTAGCTT TGCTTAAACA TATATACGCA CACACTATCC 002560
002561 GCAAAAAGCA GGAATATACT TCACTCTGTA AGTGGTGGTT ATCTCTGAGT GACATTTTAC AGGATGTTCC CCTGCTTTTT 002640
002641 TTTTACTGTA ATGTACATTC CAAATTATTA GATTAGTATA TTTACTTTTA CAAATGGTGA TTATACATTT TAATAGTTTC 002720
002721 AGAATGTTAA GATCTTAAGA GACTAAGCTT AGTTCTATAG CTAGCACATA GTAGCATTGC ATAAATATTA TTCAATCAAT 002800
002801 TAATGAAGTC ATCACTTCAT CGCAAGAGTG TACACAAAGT ATTAGAACTA TTAAAATGTT TGTGTGTGCA CCAGCCCCAT 002880
002881 CAATTCTGAA CAACGCCCTA TGCCCTATCT GTCATAATTA TATATTCTTA TTTTTGGGGT CCTCACTCCT TAAAGCAACA 002960
002961 TTGTTGTTTC CTTCATTGCT TCTCCCCAAG CACAGAAGCA AAAAAAAAAA AAGAGATCAT TTATTGTTTT AAAACCTTCA 003040
003041 TTTATTTTGG GATGTCACTA CCAATTTTTC AGGCCTTAGA CATGTAAGCA TGCATTTCTA CTTTTTTAGA TTTTGAATCA 003120
003121 TGTAGAAATC TAGCCAAATT AATTGAGAAT ACTCTATTGA ATAAATCAAC AACTTTACTT CCAGGGTCCT GTTAACCTAA 003200
003201 GCTAAAGATA AAAAAGCAGC AACAAAGTGC CACTGTTATC TGGTTAAAAC TGACAACTGT AAGATGTATG GATGATTTTC 003280
003281 TACAAGTTTA GAAACAGGAA ACAGTGCTTG CTGCACTTGC CACAGGGCAT ATAAACTGAG AAAGTTATGT AGAAATTCAA 003360
003361 GATCAGAGGA TGAGCGCAAG GATGATCTTT GAGAACAAAA CAGTGAACAG AGCCCTTTGT GGTGGCAAGC AGTGACGTGA 003440
003441 GGTTGTGTTT ATGTGAATAC AAACCCTCTC CTCCCAAGTC TGGTGACTAC AGTGTTTTCT TCCCACCTCG TTGTTAATTG 003520
003521 TGGAAATCTG GCTGTCACTA TTAACATATC AAACAGAGGT CTTGATATTA TCTGCAGCTG ATCATTTGTT TAGGTTTTAC 003600
003601 CCGCAGGAAA AGTCTGGTGG CAGCCAGTCC CAGGACAGCC ACGCATGACG CACAGAAGAG TCCTTATAGT TCCCCTGGAG 003680
003681 GGGGTGATGA TGCAGGCAAA TGCCACGCCT GATCCCTTTG GGTCTCCGTG GCAACAAGGC TGCTTGTCAG ACTAGATGAT 003760
003761 AATGAGCCAT GATCTGGAAG AGGCTTATTC CATAACAGGA GGGAGGTTTC AAAGGTAACA CGTCAACAGC AATGCCTCGG 003840
003841 CAACAGCTGT TTGTTATTGT AGTCGTCAAG GACAGAAAAA GGACAAGGCT GCCAGAAGAT GTTGCATATG GAACAGAAGA 003920
003921 TTCTTGCCCT GGAAGAAGGA ATAGAGGAGA TGATACCTCC CTTCTGAACA TCTCAGCACC CCATCAAGAG AACTTCAGGT 004000
004001 CTCTGCCATT TAGACGTGTC ATTTGTAAGT AAAATAAATC AGGGAAGTAA TAGGAGGAAA GCTAAGAATG AAAGGAATTC 004080
004081 ATCACCACTG CAAATTTGTC CCTGAAGCAA GGTGAGTCCC TGCCCATCTG GCTACACCAG CTCTTTTTTT TTTTTTTTTT 004160
004161 TTTTTTTTTT TTTGGCCCTT ACTAGACCCC ACTTGGGATT TTGTTGGCAT ATTTGCTTGG ATGTCTATCT CCTCTGTAGT 004240
004241 TCCTGAGGTC AAGATTACAG TCTCTTTTTT TTTTTTTTTA GTTTTTGTAT CTATGGTGCC TGGCAAAGAA AAGAAGCTCC 004320
004321 ATCAATATTT ACCAGTTGCT GGTGGCAATG GTAATGGTAG TTGTGTAGAA GGTGGGAAGG ATAGAAGTAT AGCTAAGTGA 004400
004401 CGGGACATTC TATTTCTACA CCTTGTTTAG CTATGGAGGT CTCTGGAACT AGCCAGGCTA GCCAAGACTG GGACTTCTTG 004480
004481 TGTCTTCTCC AAGAGAGGCA GAATGAGCCT AATGGTTTAG AACGTGGGCT CTGGAGCCAG ACAAGCCAGA TTAGAATCCT 004560
004561 GACTCTACTC CAAGCAGTGT GAGCTTTGGC AAGTTACATA CCCCATCTGT GCCTCTGTGA GGATGAAACG AGTTCCTTAG 004640
004641 AATGATGCCT GGTACCCCAG AGGAGCTATT CAGGTTCAAT TGACTCTGTT TCTAGTATTA CTGCTCAACT TAGGAAGAAA 004720
004721 AGAAGTGCAG AAGAGGAGAG GCTTTTGCTT CAAATTTTGA GTTTCACATT TAACCTTGCA GTTTTATCCC TGGGGAGCTT 004800
004801 CTGCCTGGAT GGAGGATGAA TCTGGTATCT AAGCTCCTCC TTAGGGCTGT CATTTAGACC AGTGCCAATC CCTAGGAGTG 004880
004881 CTTCTGATTT CCTGCTTATC TCCTACAAAT GTACAGTTAA GTCAATTCAA AAGAAAAGAT AAATGGTCCA AACAAATCCA 004960
004961 TTGTAGCATA CTGTAATTCA CTGTCCATGG GGCAATTCAT TTGCCCTTGA ATTATTATTA TCTTCCTTAC CTGATGTGCT 005040
005041 TGACACTTAT GATCTTGAAC CATGCTGGGC ACCATGGCAA ATTTTGATAT TTGATATCAA ATATCAAAAT ATTTGATATT 005120
005121 TTTTCCTTGA TGCTTTTGAA GACACCTCCT CCATAACCAC GATCAGCGAC CAACACTCAA AAGGCTGAGG GGAACTGAAG 005200
005201 CCAAGTTAGG GAAACAATGT AAATGCCTGG CTGGTTAACA CGTGGATATC CAAAGAGTTG TTGGGTCTTC TCACCTCTAC 005280
005281 CCTCATCTGC CGTGAGCACA CAGTGCTCCA CGGATATATT CAAACTTGAA TCAGGCCAGG CACGGTGGCT CATGCCTGTA 005360
005361 ATCCCAGCAC TTTGGGAGCC CAAGGCGGGC GGATCACAAG GACAGGAGAT TGAGACCATC CTGGCTAACA TGGTGAAACC 005440
005441 CGGTCTCTAC TAAAAATACA AAAAATTAGC CCCGCATGGT GGCAGGTGCC TGTAGTCCCA GCTACTCGGG AGGC
[back to top]

Predicted Small Protein

Name NONHSAT115180_smProtein_5222:5452
Length 77
Molecular weight 8340.6079
Aromaticity 0.105263157895
Instability index 76.8671052632
Isoelectric point 7.77142333984
Runs 11
Runs residual 0.00332728372656
Runs probability 0.0415281591752
Amino acid sequence MPGWLTRGYPKSCWVFSPLPSSAVSTQCSTDIFKLESGQARWLMPVIPALWEPKAGGSQG
QEIETILANMVKPGLY
Secondary structure LLLEELLLLLLLEEEELLLLLLLLLLLLLLLEEEELLLLEEEELLLLLLLLLLLLLLLLH
HHHHHHHHHHHLLLLL
PRMN -
PiMo -