NONHSAT057030

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT057030

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3220 nt

Genomic location

chr18+:4264602..4296000

Exon number

3

Exons

4264602..4264901,4275301..4275379,4293160..4296000

Genome context

Sequence
000001 GAGCCAAATC CCGACAGGAC TTGTTACTTT CACTTGCTCT TTTTTCCTCC CATTTTAGCA GCCAAGGATG GGAATTACAC 000080
000081 AGCGAAATGC AGGCTAATCA GCTAAATCTA GATTAAATAT GCTAGTCCTT GGGGACCTTG AGCTGAATTA ATGGAAACCG 000160
000161 AGGTTTTTAA TGTTCCAGAC TAATTTTGGA AGCTCACTTC CAGCAGAGGC AAACTTGAGG CTGTCCAAAG GGAGCAGATG 000240
000241 GCAAGGCTTT CTCCAGGTGC ACCAACCTTT GAAACAGCCC TCGGCAAAAC CATGAGGAAG GAATGCTCTT CCCTTAGAGA 000320
000321 TCCTCGTGGT GAATTCCCTC ATTTCCTTCA AGTCTGCTCA AATGTTACTT CTCAAACAGA GCAAGAAGAT CGAAGCAAAT 000400
000401 GAACAAAGGT GGGATCCGGT TCCAATAATG AAGAATGTGA TGACTACATG TATACCATTC TGGAACAGAG AAAAGACTGC 000480
000481 CTGCAGCAAT ATCTCTAAAA ACTACCTGTG GCAGACTCAG TGGATGGTCA AGTTTGGCTC TGAAACAAGT CTCCTCATTT 000560
000561 GTCAGAGAAA GGACAGAGAA GATCCAGTTT ATGTGATTAA TTCAATGATA TTAGTAGAGA TCATAGATAC AATGATCAAA 000640
000641 ATAAGAACAT TGCTTTTTGC AAAATCAGAT TAAACTGCAC TCCAAAACAA TACTGAATCA TGTAAAAGCA GAACTTCCTC 000720
000721 TCAGAGATTT CTGTATGCAA AGACAAAACA ATTAGGTCAC AGGATACCAT GTTCTTGAGA CTGTATGTCC TTACTAAGTT 000800
000801 TCATTTTCAT TCAGATATGT ACAAGACCAT TTGGTATTTG TAAAGAGGTT ATGTTTATCT GTGTTTTCCT GAATATAGAA 000880
000881 TGAATATCTT TATGACAAAA AACTTTAGCT CAGAAGAATC TTTATCGCAT TTAAAATAAT TAACAACTAT AGTGCATAAT 000960
000961 GTATAATTCA AGGGTTTTCT GTAGCCCATT TCCCATCTAC CCTGTCCCTG TCCCCATACT CCTTCCTCAC CTCCACCCAA 001040
001041 CCTCCACCAG AGATAATTAA GCTTTTAAGA ATGAGATTGA TCACTTAAAA GTAATATATT TTTACTTTAT TTCTTTAATC 001120
001121 CATGTTTAGA CTTTCCTACC TGCCAAAACT TTGTATCTTT TAAACTTGGT ATTCTACTTC TTTCTCCCAT CTCAAACTAT 001200
001201 ACGTAAATCA GTGTGTTAGA AATTCCAGTG AGTTTAATTG AACAGAAAGT TGGGGTATCT TGTATCCCCT CCTTTTTTTT 001280
001281 CTTTTGGGAC ATAACCTTTG TTTCCCACTT GTGATTCCAG AATCTATCTT GGTATGATTC AACACCTTTT ATCAGCTGCT 001360
001361 TCTAAACAAC TTTCACCACC ATATTGCCTC CTCATTCTTC CGAAAATACA CTATTTTCTA GTCAGGCCAA ACTATACATT 001440
001441 CTTTCCAGAG CTGACCACAC TTGTTATGTT ATATTCCCAG CTAAATGCCT TCTGTAACAT TTACTACTTA TCCTTCTGGA 001520
001521 TACAACCCCA ATGTCACCAC CTCCTTGAGA GCACCCCTGA TTCCACAGGG TTGAGAGTCA GGGTGGCCTG GGAAGACCAC 001600
001601 ACATGTCCTC TGCCATCATT CAGAGGTGGG TATAAGTGAA ATGGGAGAGT TCCCTGGTCC CCCTCACAGG ATGTGTGACA 001680
001681 GGGGTATGGC TCTCTTCTTG GCTGCTATGA GCTCAAAGCC CTTACAAGAG GTGGAGCATG CAGATGAGCA GGTGCAGGAA 001760
001761 CCAGAGCGAG AGCTTTTGGG GTCCCGCCCC ACAGCAGTGT CTAGGGGTGA GTTTCTGTGA TTCCCAAAAC CCAAGTGGGC 001840
001841 ACATGTTACA GTGTGCTCTT CTAGTGTTGC TGTCCAAGTG TTAACTAGCT TAGTGGACCC TCTGCCTTTT TGCAAGGGCA 001920
001921 GAGGGCCAAT GTGACAGCTT TCTGTATTCT GAGCTCTTAT CCAGCATCCA GGAAGAATCA GGTGACACAG GGACTTGAAG 002000
002001 GATGAATGTG GGGGTTTTAT TGAGTGGTGG AGGTGGAACT CAGCAGGATA GATGGGAAAC TGGACAGGGG ATGGAGTGGG 002080
002081 AAGATCATCT TCCCCTGGAG CTTGGCCATC CACTGGCTGA TTCTCTGACC ATCCCCAGCT GAACTCCTCC TAACAGTCCT 002160
002161 TCTCTTCTTT CCTCTGCTGC ATTGTTCTGC CATCCAGTTT GCTAATCTCC TTGTTTCCTG TCTCCTCATC TGCTCCTGGA 002240
002241 GCCTGGGGTT CGGAGTTTAT ACGGGTACAG GACAGGGTGT GTGTCAGGTC AAAAGGCAGC TTTTTGGGCA TGAAAACAGG 002320
002321 AATGCCCGTC CTGTATCCAG GCTAGAGGGT GGGGCTTTTG CTGGGGAACT GCCCTCTTCT ACCCAGTATT TCCCTGTCTC 002400
002401 CTTTCTGCAT CATAAAGATG TGTGGTACCC AGTTGGTGCA GACGCGAAAT TCATCTGTTT CTTTATGTTT TCTCCCTTTG 002480
002481 CTCTAAAAGG GTCCATATTT GCCTGCATTC TAGTGCTCAC CACACCACAT TGGGGTATTT GGTTATGTGT CTGGTTTCTT 002560
002561 CAGTAGCCTA ATGGTGTTCC CTGTGCAAGA TTCTTAGATG GGCCTCAATA AAATGGTTGA GTTGCACTGA ATTGTGTTTC 002640
002641 TAAAGTAGAT ATCATGTCTT ATTTCCACCA GGGCACAATG TCTAGCACTT AGAAGCTTAA AAAAAGATTG TTGAGATACT 002720
002721 AAATGAATAG ATTAACCAAA AGTTTAATTT TTCTAGCAGT GTTCATGGGT GGGAACAAAG CAGTAAAACA AACACATACA 002800
002801 AAAATAAATG GAGGTCAAGG ACCTATAATA CTAAATGAGC TAGCACCATT TCATCTGGCT GTGCTACTTC CTAGCTATGC 002880
002881 AACCTGAGAC AAATCACATA TACTCTCTGA TCTTCTATTT CATTATCTGT GAAGTGGACA CAAAAATAAT ACCTACCTCT 002960
002961 TAGCACTGTT TATAAAACCA AATAAAAGTA AGTAAAGTGC TTAGCAGAGT GTGTAAAACA CAAATTTGTG CTCGGTAAGT 003040
003041 GTTAGTTATT ATCATCTTAT TGTTTCCTGA AAGGCCTCCA GCTATGCTTG TTAAGAAAGG CTTTTATTAA GTGTATTTAA 003120
003121 AGATACTAAT GATTAAAGTA ATGAGAGTCA ACACAGTCAA TATTTATTGA GTGCTTACTT TGAGCCAGTC ATCTATTCTC 003200
003201 ACTGGATTAC ACCATTGCAA
[back to top]

Predicted Small Protein

Name NONHSAT057030_smProtein_1640:1819
Length 60
Molecular weight 6567.4063
Aromaticity 0.0508474576271
Instability index 51.7644067797
Isoelectric point 4.95098876953
Runs 9
Runs residual 0.0080858342404
Runs probability 0.0406583053642
Amino acid sequence MGEFPGPPHRMCDRGMALFLAAMSSKPLQEVEHADEQVQEPERELLGSRPTAVSRGEFL
Secondary structure LLLLLLLLLLLLHHHHHHHHHHHLLLLHHHHHHHHHHLLLHHHHHHLLLLLLLLLLEEL
PRMN -
PiMo -