NONHSAT104509

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT104509

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

2545 nt

Genomic location

chr5+:148809849..148812397

Exon number

1

Exons

148809849..148812397

Genome context

Sequence
000001 TAAAGAAGGC CAAGGCTCAG GGGTTAAGTG ACTTACTGGA GGTTATCAGA GAGGAAGTTG CCAAACCCAG GCTAGGAACT 000080
000081 GAATGGTTGG TTTAAATCCA CCCCTCTGGC AGGAGACTGG GGAATACACA TGAGCCGTGC AGACAGCAGA GGGCAGTCCT 000160
000161 GGGGGTGGGG GCGCCAGAGG GTTTCCGGTA CTTTTCAGGG CAATTGAAGT TCCGGTCACT ACTCCCCCCC AGAGCAATAA 000240
000241 GCCACATCCG GCGACGTGTG GCACCCCACC CTGGCTGCTA CAGATGGGGC TGGATGCAGA AGAGAACTCC AGCTGGTCCT 000320
000321 TAGGGACACG GCGGCCTTGG CGCTGAAGGC CACTCGCTCC CACCTTGTCC TCACGGTCCA GTTTTCCCAG GAATCCCTTA 000400
000401 GATGCTAAGA TGGGGATTCC TGGAAATACT GTTCTTGAGG TCATGGTTTC ACAGCTGGAT TTGCCTCCTT CCCACCCCAC 000480
000481 AGTTGCCCCC CAATGGGGCC TCGGCTGGCT CACAGGATGA GGGTTCAAGA AGAAGGCTGT CCCTGGAGGT AAGAGGGCTT 000560
000561 ATGAACCATG TTCCAAACCT TTGCGTTGCT TTTCTTTCCA TCGTGTCTAT TTCATAACAT CCCTGTGAGG CTGGATGTGG 000640
000641 GAACTTCAGC ACTGCCGTAC TCTTGGGAAA TTTGTCCAAG GCCACCCGGC TGAGCAGCGG TTGAACCAGG ACACCATCAG 000720
000721 GCATGCGTTT CTTGTCTCCA CCACACCCTC AACCCACTTC CCAACGCGCC TTGCGACAGG GGCTGCGGTA TTGCATCCAC 000800
000801 ATGACTGATA AACTAGTAAA CACACATGAA TTCATTTTAA AAGTGTATTC AATCAGTTAG GTAAACTAAA AACCTTAAGT 000880
000881 CTTCGTTCGA TTTGGAATGC AGCCAGAGAA CAAATGGAAA ATTTTTCAAG GTAGAGAGGA TGAAAACTCA GAACGCCCTC 000960
000961 TTGTGGCATC TCTACCCACC CTAGGAACAC TATGGCTCTT CCCCTACACA TGGTGATTGC TAACCTTGCT ACAAGACGTT 001040
001041 GGACACACAC ACACACACAC ACACACACAC ACACTGAGGT TCCTTTTGCC CCCTCACTTT TGAGCCAGTG ACTACTGAAA 001120
001121 CCCTCTCCAT TGTTGCACCA CCAGCAATGC CCCCATCACT TCCTCTCATT TACTTCCACA GGCTGGTTCA TCCTCAAAGC 001200
001201 CCTCCTTACG TAGATCTGTG GGATCAGTGA GGCTCAGAGA GGTAAAGTGG CCAGCCCAAG GTCGCCCAGA CAGCAAAAGG 001280
001281 CAGGGCCAGC GCTGATTTCA AGTCCAATGG CCTATGGCAA TTTCTTAGCC AAAAGCAAAA TCTACAAAAA TAAAAAGTCA 001360
001361 GGCACAGTGG TGAGTGCCTA CAGTCCCAGC TACTGAGGAG GCCGACGGGG GAGGACCACT TGAGCTTGGG AGTTCCTGGC 001440
001441 TGCAGAGAGC TATGATTGTG CCTGTGAATA GCCAATGCAC TCCAGCCTGA GCAAGATAGG GAGACCCTGT CTCTAAAAAA 001520
001521 TACCTAAATA ATTTTAAAAG TCAGCCTCTC TGACTGCCTA TAGAGAATGC TAACTAACTG AATGACAGAA GACCTAATGT 001600
001601 AATCCAGGTG CAAAATCAGA ACTTTCCGGC CGGGCGTGGT GGCTCACGCC TGTGGTCCCA GCACTTTGGG AGGCCCAGGC 001680
001681 GGGTGGATCG CGAGGTCAGG AGTTCGGGAC CGGCCTGCCC GGCATGGCGG GGCCCCGTCT CTACTAAAAA TACAAAAAGT 001760
001761 TGGCTGGGCG TGGTGGTGGC CACCTGTGAT CTCAGCTGCT CGGGAGGCTG GGGCGGGAGA GTTGCTTGGA CCCGGGAGGC 001840
001841 GGAGGTTGCA GTGAGCTGGG ATCGCGCCAC TGCACTCCAG CGTGGGGGAC AAAAGCGAAA CTCTGTCTCA AAAAAAAAAA 001920
001921 AATTAGAACT TTCCCCGTAC TCTTGCTAGG GCTTTTCATG GAGATGTAGA AATGGTAGTA AGTGCCAAGG CCCCAGAACC 002000
002001 CTCATGTTTG GGTCCGACTC CCACATTGCC AGAGACTAGG CAGCTCACAC AGGTGTCCCA AGCTGTCTTT CTCACAGGCC 002080
002081 GCATTGAAGG CATTTATGAA ATGAGACCCC CTCTTCCTCA TCCGTAGTGA CAGGGCTGGC CTGACCGTGG AGAAATCAGA 002160
002161 TTTGATCATG ATAGCTTCCA CTCCGTGACT GGTCACGATC TGCCAGGCGC TGTGCTAAAC ACTTTCATAG GCCTTGTCCC 002240
002241 CCTCCATCCT GATAACACCC CAGTGAAGCA GGCCCTGCTC TTCACTCCAC TCCACAAAGC CAAGGCTGAG AGAGGTTAAC 002320
002321 AGACTTACCT AAGGTCACAC AGCTAAGAAG TCGTGAAGCT GGGATCGAAT CAAGTCTGTG TTTATGGGTT TAACTATCAT 002400
002401 GTTGTGGCCC CAACCCCATT TGACCTAGCC TGGCTGGTGG GCCTTCTTGC AGTAGCTTCC CCTGGAGAAG AGGAAAAGCA 002480
002481 AACCTTCATT GAGACCCAAG CGGTCTCTCC TGTGCTCTGT GACAATAATA AAGTTCCAGC CCTTG
[back to top]

Predicted Small Protein

Name NONHSAT104509_smProtein_401:616
Length 72
Molecular weight 8022.3178
Aromaticity 0.0985915492958
Instability index 67.2887323944
Isoelectric point 9.00494384766
Runs 9
Runs residual 0.0217939214233
Runs probability 0.03293489568
Amino acid sequence MLRWGFLEILFLRSWFHSWICLLPTPQLPPNGASAGSQDEGSRRRLSLEVRGLMNHVPNL
CVAFLSIVSIS
Secondary structure LLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLEEEEHHHHHHHHLLLLH
HHHEEEEEEEL
PRMN -
PiMo -