NONHSAT101244
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT101244 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
2682 nt |
Genomic location |
chr5-:42985708..42993563 |
Exon number |
2 |
Exons |
42985708..42985766,42990941..42993563 |
Genome context |
|
Sequence |
000001 AAAGAAACTT GAGGCAGGGC TAACTCTGCG TTTGTTATGA TTAGGCTACA GGAAACCCGG AGTTGCTGGG AAATGAGTCA 000080
000081 CAGGGTCACT CAGCCTTTCT GTGGCGAAAC ATCGACTTCC TACGACCTAG AAATCGCCGC CAGGGCTAGG CTGGTGTTTG 000160 000161 TCCTAAGTGG GTACTTGCTT GGGTTGAGAA ATCACATTTT GGCTATTCCC CGACTTGAAC CAGGGGATAG AAATCGTAGT 000240 000241 CACCGGCTGG GCGCAGCGGC TCATGCTGTA ATCCTGGCAG TTTGGGAGGC CGAGGCGGGC AGATTATTGA GGTTAGGAGT 000320 000321 TCGAGACCAG CCTGGTCAAT ATGATGAAAC ACCGCCTCTA GTAAAAATAC AAAAATTATC CGGGCGTGGT GGCGAGCGCC 000400 000401 TGTAATCCCA GCTACTCAGG AGGCTGAGGC AGTAGAATCC CTTGAACCTG GGGTGCGGAG GTTGCAGTGA GCCAAGATCG 000480 000481 CGCCACTGCA CTCCAGGCTG GGCGACACGG CGAGACTCTG TCTCAAAACA AAACAAAAGG AAAAGAAGAG AAAAGAAAAA 000560 000561 GAAACGCAGG CACCCTGAAA TCGAAACTTG TTAGGACCAA CAAGCCAACG ACAGACGAAT CTGCGGGGTC CTGCGACAGA 000640 000641 CACGCCAGCC CCGCCGCCAC AGCACAGGGT TCTGGAGGTC GAGCTGTTCT GCGGGTTCTG CGGCGGCGCT GGGAAGAGAC 000720 000721 GGCGCCCGGG CAGCTCCCCT GTCACCGGCT TGGAGGAGCG GCGGGCTCCC CTAGCCCAGC GCCGCCGCCG CCGCCCGGGA 000800 000801 ACGCCAGGCT CACCCCAGCT GGGGAAGCTC CGAATCCCTC AGATCGGCGA CCTGAGTGCT GTCGCCCCGA GAAAATGGAG 000880 000881 GCATCGAGCA ACGAGCTGTG CTGAGACCGA GAGACCTCAG TTCTGCGAAA TGTCGGTGCC TGGAGCTTGC GAATGACTGC 000960 000961 GGCCGGCGGG TTGTCAAGGA CAACATTCCT TATGGCGCAA CCAACAGCGC TGTCACCAAG AAACTGGACT CTGAGAAAAA 001040 001041 AGAGGGTTTC GGCCACCGAG AAACTCCGTG CCACATGTGC CGTGACAGCA AAACCGCCCG CTCCGCGCCG CTGGTGAAAG 001120 001121 AGCTAACGAA CGCCAGGTGC TGCGACAGTG ACACCGGGAC TAGAAGGCCT CGGATGGAGA ATGGGACGCC CAGAGCAAGC 001200 001201 AGGAAAACGT TTTGTTTGGC TCAACTAGCG CTGTCGCAAC CGGAAAACTG TCAGGCCAGT GCTGCTGCGG ATGTGAGAAA 001280 001281 CCGGTGATCC GAAGGCGGAG TGGCGCTGGG CGCTGGACTC GCTGGTGTGA AAATCTGGCT GCTCTAGGAC CCGGCAAGCG 001360 001361 CGGGCACCAG GCGAACGCCA GCGACTTTGC CAGACAAATG CATGTCTTTT TTGGTCGCCA TTCAAAGATG CAGCCGGCAG 001440 001441 TCAGCCCCGG CTCTGAGAAA CAAGTGGAGG TGTCAGCCAT AAACCCTCCG TTCAGGGAAA AGAGAGTCGC TGCGACGGAT 001520 001521 CGACTGGCCG TTGGGTGGAA CTTTCTTTGG TGGCCTAGTG ACTAGAAAAT AAAAACGAAA ACGAAATAAA AAACTGCTAC 001600 001601 TGACATTACT CCCACGCTCC CTCCTCAAGT CAGTGGCTTC AGGCTTAAGA AAACCTGTAT TTTTCTTGGC TGATCCGTGA 001680 001681 TCCCTAATTA CTGAGGGCCC CGCGACCCAC CAAGTGCGAT TCGCAGAGGG AAAGGAGCTG GAATAAGGAT CCAGAATGGG 001760 001761 ACGGATTTGG CTCCTCCAGG AAGAAAGCAA AACAAATGCA GGTCGCGAAC ACTTTACATT GCAGGAAGCT GCTCTTCCAG 001840 001841 TGGCCGCGGC TGGGTCCTTC CACACCACCT CTCTGTCCTA GGCCTTCGTC CCGGTTCAGC CCTCACGGAT TCTAGGCTAG 001920 001921 CGGTCAACAA CCGCATCCCT CCGGAACCTG GAAACTCTAG GTCCGAGATC TCGGTGAAGA ATGAAAGGCA CAGCGCCGCT 002000 002001 CAACCCAGGC TTCAGGCCTG CCCCGCTGGT CCCGTCCCTT CCCTGTGGGA TGGAACTCTG TGGGAATCGC TCCTTATTCT 002080 002081 GATGGTTCAC TCATTCAGAT GTCCTTGATA TTTGGGGATA TTTCTAAAAC AGCTGGGGAG AAAAATGGAC TGCGAGGCTC 002160 002161 CGGTGTTGGA CGTGAAATTG GCAGTGAGAC ATCCGCCTCT AAGCCCAGGG AAAAGCTTAA ATTGTGGTTC TCTGTCTCTA 002240 002241 GCACCGTGAT CTTTGAAATG CGTTATTGAT TTTCTTCTTT CTTCTGGGAA ATGCAATGCA GGTGAACTTG TAAGTGTTGA 002320 002321 ATCCTCCCTC TCCTCACTGT TCAATCAGAC TGAGGCCTAC TTCCTTTCTG ATAAGAAATA TGGAACTGCC ATTGCTCAGT 002400 002401 AGGAAAATGT CCTTTCTACA AGCATTCATA TTTCAATAAA AATCTGCACT TTGTGAGTTG AGCCCCCTCA AAGAGACCTA 002480 002481 CAGACCCTGC CATGAGCATC TCCATCCACA CTCATCATTT CCCATTTTCA CCACTTCTGC TCCTGTGCAA ACCAGTACTA 002560 002561 TCCTTTTCTG GGGGATCCTA GGCAGAAGCG ATGGCAGGAC AAAGGCAGAG AAGGCACTGC CAGAGGTCCT GCATGCCAGT 002640 002641 AAAACTACTC ATGTCAGCTG TTCTGCAGGA TCCCATGAGA AG |
Predicted Small Protein
Name | NONHSAT101244_smProtein_1397:1564 |
Length | 56 |
Molecular weight | 6296.1195 |
Aromaticity | 0.127272727273 |
Instability index | 68.7672727273 |
Isoelectric point | 9.68792724609 |
Runs | 9 |
Runs residual | 0.0121212121212 |
Runs probability | 0.0485779897545 |
Amino acid sequence | MHVFFGRHSKMQPAVSPGSEKQVEVSAINPPFREKRVAATDRLAVGWNFLWWPSD |
Secondary structure | LEEEELLLLLLLLLLLLLLLEEEEEEELLLLLLHHHHHHHHHHHLLLLEEEELLL |
PRMN | - |
PiMo | - |