NONHSAT100939
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT100939 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
2006 nt |
Genomic location |
chr5+:32947549..32962573 |
Exon number |
6 |
Exons |
32947549..32947927,32949049..32949206,32950222..32950389,32958720..32958803,32961247..32961365,32961474..32962573 |
Genome context |
|
Sequence |
000001 GCGAGACTCC ATCTGAATAA ACAAAAAAAA ACCCCAAAAA ACAAAAAAAC AATGTCATGG TATTAGTTAA GGCAAGTGAT 000080
000081 ATTCGCTACC ATAACAAAAA TCTCTAAAAA TCTTATGGCT TAATATAATT AGAATGTACT ATTTGCCCCG GAGAATTCAG 000160 000161 ACAGATGTTT GTTTCTGCTC TAGTCACTTT CCATATAGTC ATTCCAAGTC CAGTCTTCTT CCATCCTGTG GCTCCGACTT 000240 000241 CTTCCAAGAC CTTAAAATCC TTTCTCCTTC CAGCAGATGC AGAAGGAGAG AATGTGTGGA GGATTATGTA GAAGGATTTC 000320 000321 ACGAGCCTGG CCTGGAGAGG ACACATCACT TCCAACTCCA TTCCATTACC AGAGCTTAGT CTATTTCCAG AAGATTATAA 000400 000401 GCTTCATGGG GGTAGATGCT TTACTTCATT CATCACTGCA TCTCCAACGC CTAGCACAGA GTCCTGAATT ACAGGCTCAA 000480 000481 TAACTATTTA CTGAATGTGA AAACAGAGCT GCTTGAAGAT ATATGAGGCT AATTGAGCTT GTGCATTTGT TGGCTATTGC 000560 000561 TGCTACAGTG GATGCTGTTG TTCAGTATTC AGATATCAAG AAACAACTAG CCAAAGTAAG GAGGAACGAG GGGCCCATGA 000640 000641 AGCTCAGTAG AGCTTGTGAT AATATTTGCA AGAGAAGACT ACCTGCCAAA CTAACTTCAG AAAAGGTGTT TCTGGTTGAA 000720 000721 ATCGTCGTGC CCAACATTCC ACCAGGTTCC TCACATGTTG AATACATCAA AGCCTACTCC ACCCATCTGG AATGGACAAG 000800 000801 CAGAAATGAC TCACTGTGTT TTCCAATTGC ACTCAAGACC TGTCTCCATA TGCCCTCCTC CTCTGCTTGA GCAGAAGGTG 000880 000881 AAAGAGGTTG TGTTGACTGT GGAATTTGGC CATTGAATCT TGAAGACAAA TGCTCTTTGA CTTCACTACA ACATCAGAGG 000960 000961 GCACTTGGCC AAGGAAGACT AGTTAGAACC AGCCTTTTTG CAAACACAGC TGAGGTTTTT ATCCTGTCTT TTCTTCATCA 001040 001041 CAAAAGTAGC ATGGAGAATA ATGGACAATA GAGTTCAAGT GGAGCAAGAA CTGTCCAGAA TTTGGGGAAG AAGTCACATA 001120 001121 TCAGTCCTGG CCATGATAAC TGGGGCACAT GTACTATATC CAAGTTGATA TCCTTAATTT GGGGAAAGTT TTCTTGCCTT 001200 001201 TCCCACAGCC TCCCAAGCCA TTGACAATTC CCACATTCAA AACATTCTCA TTCACATTTC TGATTCACTT TGGGCTACAG 001280 001281 GAACAACAGA TGTTGAGTAC TTCTAGGGAG CAAATCCATT ATAGATGGAG GTAGGTCTTG CAAACAGAGA GGTAGAGAGC 001360 001361 TTTCTCGAGA CCTTTGGACC AAGCAGATAA CATCACAAGG TGTACCTTAT GCTAAGTGCC TGATCTCTAA GACCCTCCAC 001440 001441 CAGCGTCAGA GAGACAGCAT GCTCACAACA TGACAGCTGG TCATACAGGT GAAGAAGAAC ATGAAGACCA GCCCACCCCT 001520 001521 GAAAATGACA GATCAAACTC AGCTTCACAA TGTCTCTTAT TCCATCGCTA ATGGTGTTTA AAAACACACT TTGTCTTAAG 001600 001601 GAACAAGGGG AACTGACCAC GTGTTCATGA CGGTTCATTT CTTACCCTCT GCTTTGGCTT GACCAAACTT TAGTCAGGTA 001680 001681 TCTCTCCTCC ACAAAGATTC CTAGACTTTG GCTGTCCCCC AAGTTTAAGC AAGCACTGAA ACACTAAGGA GCAGAACATC 001760 001761 TGTTAACAGC TCATCCTGAA AATCAACTGA CCATAGGAAA AAACACTCCC TATTGAGATA TCCTGGTTTT GCCACCTGCT 001840 001841 CGCCCACACT CTATGCCCTT CTCCTCCACA AAGGTCTGGC TGGTTCTTCT CCCTCCATAC AAAGGAAAAG CTTATTTCTG 001920 001921 TTTGATTTTG AGACATTTGC AGATATCTGA GGTTGGCATG TTCTTCCTAT TGCAATAGTC TTTTTGAGTA AAGTCTCTCT 002000 002001 TTATCT |
Predicted Small Protein
Name | NONHSAT100939_smProtein_1148:1333 |
Length | 62 |
Molecular weight | 7389.5976 |
Aromaticity | 0.180327868852 |
Instability index | 70.5540983607 |
Isoelectric point | 9.69488525391 |
Runs | 12 |
Runs residual | 0.0439042773695 |
Runs probability | 0.0244435612083 |
Amino acid sequence | MYYIQVDILNLGKVFLPFPQPPKPLTIPTFKTFSFTFLIHFGLQEQQMLSTSREQIHYRW R |
Secondary structure | LEEEEEELLLLLLEELLLLLLLLLLLLLLLLLEEEEEEEELLLLHHHHHHLLHHHHEEEE L |
PRMN | - |
PiMo | - |