NONHSAT146520
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT146520 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
sense |
Length |
3114 nt |
Genomic location |
chr17-:19619037..19622150 |
Exon number |
1 |
Exons |
19619037..19622150 |
Genome context |
|
Sequence |
000001 AGGAGACTAT GGAGGACTCA CGGAGCTCTG ACACCCCTCA CCTGGAGGAG GTGGCTTCCC CCCACCCCCC ACTCTGCCTG 000080
000081 CACCACCTTT GCCTGGGGCG GCCAATGCAT GGCAGTGGAG TCAGTAATCG CCCTTGCTCC TCCCCAGGGT CAAACCCACT 000160 000161 GGGGTCTGGG GATTGAAATT CTGAGAGGAG TGTAGGAGGA GAGTGTCTTC CCGAAGAAAA TGTAGGAATG CATTGGCTTC 000240 000241 TATCACAAAA ATCCTGGGAA TGTTCCTGAG AAGGGATTTT GGAGATGCTC GGCCTAGGAC TGGCCCTCGG GAAGATGACC 000320 000321 CGACCTTTGT CCTGTGCAGC TGAGTGCTGA ACACTGGTCA GCTCTGGATA AAGCCTGATG TCACCGAGCT GATGGCGGGG 000400 000401 CGTCTGCACT GAACACCTTC CAGCTTGGTG GGGGCACGGG AACCAGCTGT CCTCATTAAA AGTGACCTGG AGTGAGATGG 000480 000481 attcttctgc ctatacctcc acgtgtgggc tccctgaaca cccccatcgc catgggctcc caagaagtat tgcttctggc 000560 000561 tgcagggctc actgcaatgc tgcgacagga ggtgacaggc caaggccagc aggtgactgc tctccccaga ggccctgtca 000640 000641 gctggaagca gctggtttca cataacagAG ACCCCAGGCC CAGGTCCCCT GGATGCAGGA GGATTTGCGG CTGCCCTTTT 000720 000721 GGAAGAGGTA GGCTGACCAT GAGAACTGGC CAAAGGTGAG AAGGACGTGG AGAAGGTGTT GGAGGAGGGA AGTCAGACAT 000800 000801 ACTAGCAATA GCCTGGCCTG CAGGGCTGTA GCTTTCACCT GTACTTCCTC CTGTTTGGTG TCATCATCAC AATCACGATG 000880 000881 GTGATAAAAT GTCTTCTTTC CTCTAGGCAG GTGGTAAAGG TACTATTTGG TGCACAGGTT GGAGAGTGCT GTGATAAGAA 000960 000961 CCATCAGAAC AAGGGCAGGA ATGTGACCTC CCAGGTGATC CTGGGAGGAT CCAGTGGAGC GTTTCAGAAT TCTGTGTCAA 001040 001041 AAAAGAAGGC TGCGTGTTGA CTGGGGTGGG GCTGTGCCTG TGTTTGGGGC CGCCCATGGG TGTAGCAAAC CCTTTTTTTT 001120 001121 TTTTTGTATT TGTCCTGCCT CCTCTCCTTC AGGGTCTGCT CCTTCCTCCT CGCGGCTCTG GGGGCTGCAT CTCGGAATCC 001200 001201 TGCCGGCCAC TCCTCCGCTG CAGGGGTGGA CATGGGAACC AAGTGGGGCC TTTGGGATGT CTTCAGCCCT CCCACAGCCA 001280 001281 CTGGCCAGCC TGCCTGCTGT CTTCCTGCCC CCCTCGCTCA CTCCTCCTTC TCCTGGGCTG AACCACACCC AAAAGCTTGA 001360 001361 TCCAGGACTG GGCTAGGTGA CCGGTGCAAT CCTTAGATGC CTCTCTTGGT TCCCACTGCC CTGCTGGTCT TGAGGCAGGT 001440 001441 TGAGGGGGGA GTCACAGGCC ATCCCTGCAA GGAAAAGCAG AGAGGGCAAT TTGGGAAGGT GACAGAGGAC CCTGTGGGAC 001520 001521 GTCGAGTCAC AGCCTTGCTG GGCACAGTGA CTGGCGTGCA TAGTGAGCTG CTTTCCCAAG CACTGCCACC CCTGGCTCTC 001600 001601 AGATATGCTG CTGAAAGGGG TGACGGGCAG TGGGGGGCTT GGGACTCCGC CGAGGTGTCC TAGCTCCCAT CCCTATCCCC 001680 001681 AGGCCCAGGC CCAGGCCCCT CCAGCATGAA AGGCTAGGGA CCGCACTGCC ATGGGGTTTG CCAGGCCCCT GTCTCTTGGG 001760 001761 AAGCCACCCC TCCCAGGGAA aagcagggct gagggctgtt caggctctgc caccacctgc tctgtgactt cgggcaggcc 001840 001841 accaattctt ctgaAAAACG GGAGTCACTG TGGGCACCAT CACGCCCGGG TACAGGACGC CCAGTGGCTC AGCTGCCCCA 001920 001921 GCCAGGCTGC CCTGTCCTGG CCTCAGCACA CCTCCCACAG CCACTGCAGG CAGCCCTGGA CCTTCTCCTG CTGAGGCCTT 002000 002001 ACTGCCAATC CCTCTGGGAG ACCAGACACA ACAGGCAGCG GTCTGGGTGA CTCCCAGGAA CCTGTCCTCC CTAGTGGCTG 002080 002081 AGGCAGTAGG CTTGGCAGCA GGGACTCAGT GCCCTAGGAG GCAGGGCTCA TCCCACAAGT TGCCATGGTA GCCCCTCCTC 002160 002161 CTGCCCCTTA CTCATCAGCC AGGCTGTACC GACACGGGCT GCCCAGGCAG GGGGTGGCTC GCTGGACAAA GCTGGCCGTG 002240 002241 CAGGCGCTCA GGCGTGCAGG GTAGCCAGTG CCCCGGCCAG GAATGGACAG CCTCCAGGAC ACAGTGGCCC TGGACCATGG 002320 002321 GGGCTGCTGC CCTGCCCTCA GCAGGCTGGT TCCCAGAGGC TTTGGGACTG AGATGTGGAC TCTCTTTGCC CTTTCTGGAC 002400 002401 CCCTGGTAAG GAGGAGGAGC TGGGGGGCAG GTAGGGAGTG GAGGAAGCTG AGAGGCTGAG GGTGTCTGCC TTGCAGGCAG 002480 002481 GATCCAGAAA TGTGCTCACA TTTTCCCCAG AAACTGCCCA TGTTGACTGC CAGGTCCAGA GTCTGGGGAC AGCAGGATGG 002560 002561 CTCCTGAAAG CTGTCTCTGG ACAATCCCCA GTGGCAGTGG GGCCTGACTT TGCTCAGCAG CAGGCAGCCC CCACACAGAA 002640 002641 GGGGTGGGTG TGCCTCAGCT GGAAACCCAC AAAAGCCTGA GTTGGAAGAT GGCTCCAGGA CAAATGGGAC CCAGAGATGC 002720 002721 ATGAAGCCTG CAGCACCTTC GCAGGGGGAA GAAGGGGCCG GCTACCCATT TCATTTCCAA AGGGGAAATT GTTTTAAATA 002800 002801 CATTTATTTT TTCTGATCAT GAAGAGGAAT Aggctgggcg tgctgcctca cgcctgttat ctcagcactt tgggaggccg 002880 002881 agcagggagg atcacctgag gtcaggagtt ggagaccagc ccggccaaca tggagaaccc cccgactcta ctaaaaatac 002960 002961 aaaaattaac caggcatggt ggcgggcgcc tgtaattcca gctacttggg agagagaggc agagaatcgc ttgaacccgg 003040 003041 gacgggtagg ttgcagtgag ccgagatcgc gccactgcac tccagcctgg gcaacaaggg cgaaactccg tctc |
Predicted Small Protein
Name | NONHSAT146520_smProtein_1730:1993 |
Length | 88 |
Molecular weight | 9314.4562 |
Aromaticity | 0.0574712643678 |
Instability index | 41.7862068966 |
Isoelectric point | 9.42535400391 |
Runs | 13 |
Runs residual | 0.0111658456486 |
Runs probability | 0.0241123476417 |
Amino acid sequence | MGFARPLSLGKPPLPGKSRAEGCSGSATTCSVTSGRPPILLKNGSHCGHHHARVQDAQWL SCPSQAALSWPQHTSHSHCRQPWTFSC |
Secondary structure | LLLEELLLLLLLLLLLLLLLLLLLLLEEEEEELLLLLLEEEELLLLLLLLLHHHLHHHHL LLLLLLLLLLLLLLLLLLLLLLLEEEL |
PRMN | - |
PiMo | - |