NONHSAT100801
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT100801 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
1535 nt |
Genomic location |
chr5+:27472391..27485935 |
Exon number |
5 |
Exons |
27472391..27472626,27475642..27475802,27476194..27476265,27477746..27477908,27485033..27485935 |
Genome context |
|
Sequence |
000001 GTGGTGACCT GGGAATCAAT GTGTGAGGTG GTGCTATGTA GCAGGAACCC CTCTTGCTTT GCAAATAGTT TTTTGTTTGT 000080
000081 TTTCCTTTTT GCCCAATAGA GCCCTGCTCT ACTGACCCTT CAATGTGCCC GTGTGCCTAA ATATTCCTGG TCGTGTGAAA 000160 000161 AGAACCCAGG TATTAGCTGA ACTAAGGAGC ACAATTCTGC AACATTTTGG CACCCAAACA CGGGGCTTGA GAAATGAATG 000240 000241 CAATTGGAGA AACTGGTTGT TTTACCAGGC GTTGATTGGA AATGTGTGCT TCCCTTTAAG CAGTCAAGCT CAACTTGCAG 000320 000321 AACTGATGGG AACCCCTTGG GAAAACTGGC CTCAAATGTT TGTCTACACA GTCCACATAC AGGGTTCTTA ACCTGCGACT 000400 000401 GACCCTGCTT GTGCCTGTGA ACCAACCAAC AATCTCTGGC TGCAGCTCAG AAAGGACAAA AGAGAATGGA TGAGGGAAAC 000480 000481 TTCCCAGGGC TTGTCTGGGT ATGCCCACAG TGGACTGGAG CCCAAAATGC ACACTGGAGG AAGTGGATGG AGCCACGTGG 000560 000561 ATGTCATGCC TTATGCAGGG GAGGAGCCTG CTCTCTTCAG CTCCTGTGGT AATGTGGGAA TCGATCTGTG AGGAAACACT 000640 000641 TTCTGAAAAT GTCAGAACTC TTGAAAACAT GGATGAGGAA AAGCATAAGT ACTGCTTTAT GCTTTTCTTA TGAATTTTAA 000720 000721 AAGATGTAAT TTAAAGGTAC TTTAAAGTGA TATTTGTGCC CTTGTAACTG ATTCAAAGTA GACTGCTAAG GACAATAAGT 000800 000801 AATAAGAGTC TACAAATATC AGTAAATTAT GAGAAAAAAC GGTCTAATGG TACTTAAAAT TGCATTGTGA TTTATTAAAT 000880 000881 AACATTGAAC AGCTCCACCT GTAACATTAA GATTCCATTT GAGCATATAG AAAATATATT TATCGCAACC TGAAGTGATT 000960 000961 TGGAAAGTAT GCTGGGAAGA AAATATATTT ACAAATTAGT CTCAGAGGGT AGGTATCCTG TAGCCAGATG GAATAAGAAG 001040 001041 AGAGGGAAGG GAGATTTTGT TCAAAGGGGG AGCATAAGCT AAGGAATAAA GCCACAAACA ATGATTAAAT TAGGTAAAGT 001120 001121 GCAATTAGCT GAATATTGCA TAAGCATACA TATGGGAAGA AGTGGAAAAA GGAGTGACAA TATGAAGCTA AAGGGTTTGA 001200 001201 AGTGTTCAAG ATAATACTAG GCTTCCTAGC CTATTCCGAG ACATCTTGTT TTAGAGTGCA ATAGATAAGA ACTCTAAGGA 001280 001281 TGAAAGTAGG AGACATGCAT TCTAGTCTAC GTTTTGGTTA ATGGACTGAA AGTGAAAGAT GGACTGAAAT TGGGGGATGG 001360 001361 AATAAAAATG TGATAGAATA GAAAATAGAA AAAGCAATTA GCCTAATGTC ATTGATACTT AGAGAAAATA TGGAAAATAG 001440 001441 TAAATTAGTG ATATATATCT TTTCTAATTC ACCAGGGTGA GTGAACAAAG GAAATCGGAT TTAAAAGTAT TCATGACATA 001520 001521 GATGCAATAA GGTTT |
Predicted Small Protein
Name | NONHSAT100801_smProtein_500:712 |
Length | 71 |
Molecular weight | 8108.3016 |
Aromaticity | 0.0857142857143 |
Instability index | 72.7942857143 |
Isoelectric point | 4.36846923828 |
Runs | 13 |
Runs residual | 0.0408769448373 |
Runs probability | 0.042395336513 |
Amino acid sequence | MPTVDWSPKCTLEEVDGATWMSCLMQGRSLLSSAPVVMWESICEETLSENVRTLENMDEE KHKYCFMLFL |
Secondary structure | LLLLLLLLLLLHHHLLLHHHHHHHHLLLLLLLLLLEEEEHHHHHHHHHHLHHHHHHLLHH HHHHEEEEEL |
PRMN | - |
PiMo | - |