NONHSAT054790
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT054790 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
3504 nt |
Genomic location |
chr17-:53028828..53033879 |
Exon number |
2 |
Exons |
53028828..53029473,53031022..53033879 |
Genome context |
|
Sequence |
000001 ctgggcaacg tggcaaaacc ctgtctctac taaatataca aaaaaattag ccaggcgtgg tggcacatgc ttgtagtccc 000080
000081 cagctactca ggaggctgag gcatgagaat tgcctgagcc tgggaggcag aagttgcagt gagccgagat cacccccact 000160 000161 atactccagc ctaatgacag agtgacactc tgtctcaaaa aattaaatta gccaggcatg gtagcacacc ccaatagtct 000240 000241 cagctactca tgaggctgag tagggaggac tgcttcaacc tgggaagtca aggttgcagt aagccgtgat catgccactt 000320 000321 tattctagtt tgagtgacac agtgagaccc tgtcaagaaa aaataataat aacctgtgtt ctcatggaga gttgtctatt 000400 000401 accagatgaa tgaatctcag gtttttttag gggtaacaAG TCTGGTACCC CAGATGAGAC CCAGGGTTGA GCCCTGACTT 000480 000481 GACCTCCCTT ATAGGCTCCT GGCAGGCAAA GCAGTGGGAT GTGGAAGCAT GCGTGGGCCA ACTCACCTCA GTTGCCAAGA 000560 000561 AGGAGACTAG AGAAGGGCCT TGGGACCCCT GACCTGGTGT CTAGTGTAGT CCACTGACCA GCACCACAAG ACCTCTCTGG 000640 000641 GAGAGTAGAA GCGGCTGCCT AGAGCCCAGA CCTTTTGCTC GGTGCTGCTT TGAAGCAGCA GCTCCATGAA CTACTAAAAG 000720 000721 AGTGGTCTGG GTTTTGATCA TTAGGATTCT GGGTTGTCAG GATTAGGACC TAGGTCAAGG GATCCAGGAT TGTTGAGAAA 000800 000801 AAGAAGTTCC TTCTTTCAGG TCTGGGAGAC TGAGAGACAT CGCCATCGTT TTAACAAGGG GAATCCCCAA CAAAAAGACA 000880 000881 AGGATGCTTG TCTTCAGGCT TTGTTCCCTA CGTGGTATAA CTGTAGGGCT CAATATTCTT ACTTGTCTGT GGCCTGAATG 000960 000961 AATATGAAGT CTGGATTCTT CTACCCTTAG TCTGTGGCAA CCCCTAAAAT ATCAGGCTTT GTTTTGCAGC AGTCACAGAA 001040 001041 GGACTGCTGT TAGTTTGATG AGGAGACCCA AGTTTCAAAG GGGGCAGTTT TGGTGGGATC ATGATCCCTG GAACCAGAAT 001120 001121 GCAGTGATAC AAAGGTTACA TGAACAGTGT GTGTGCTTTC TCTGAGGAAA CACCTCACTC ACTTGATTGA TTTACCTACC 001200 001201 TCCCCACCAC CTGAGCTGGA TCCAGAACTA CTTCATTGTC TCCATGAGAT CTCCTGGGAG GCTGCCCGGA CCGAACATGA 001280 001281 TCCAGGGAAA GGAAATCCCT GTTATCCTCT CAGCTTCCCA GGACCTCTGG ATGGCTTTGC ATGGTGGCCA GCAAAGAGGA 001360 001361 CCAGAAGTAT TCAAGCTGAG TTGTGGGATT TGTGTCATCA TTAATCTCCA GAGTTTGGAA GCCCCTCAGA CTGTGTGCAG 001440 001441 AGACCGTAGA ATGTCGTGAA CTAGGTCTTG GAAGGAGAGC TTGTTAGGGT GCCtcctctg gcaatggaaa ctgcatatcc 001520 001521 acccatgacc aaaaagcaaa tgatatatta ctgtaataca gcctggctcc agtattccct agatcagaag agatggtctg 001600 001601 aatatgggtc acttactgca actctccaat tggacttttt ttgcaagaaa catggtaaat gggaagaaat gccctatatg 001680 001681 caatgtttta tggccctcta tcaaaaccct gccttgcaga tgaaatgcag aatatgtAAA TTGGCCAAAG GGAACTATAT 001760 001761 TGCCAATTCT AGAAGCCTCT GGAGAGGAAG CATCAGTCAT GTGGGGGCCT ATAGGGAAGG GAACAAGTAA GAAGAGTCCC 001840 001841 TTTACTACCC CCTCCAAGAG AAGTGAGGCC CTCTAACTCT ACTCGGGGAC TGGAACCTTG GCCAATACAG GGGCATAGTC 001920 001921 CACCTTCTTG GAAATGGTAC CCTCTGCACC TCCAGGGAAA AGCAAGGGCC CGTTCTGTAT TTGCCAGGCC CTCCAAACTC 002000 002001 TGCAGAGCCA GCTGTGGCTC CTGCTATGGC AATAGCAGGC CCCTGTTGCA CTCTTCTTTG CCAGGGAGCA CCACTCATAG 002080 002081 TGGGAACGCA TATAACCCTG GAGGGTGACT GTTGCCACTC AGAGCATGCC TGTTGGAGAA AGAGGTTTTG CTCGGGTCTA 002160 002161 TGTGTTTTTT AAAACTGTTG ACTTGTATAA CTGGAAATCC CATAGTAAGG GCTTATGGGT GAGCCCACGA GAGTTCGTAA 002240 002241 CTCTCATATG GAGAATATTC TCCACACATA ACCCTACCTG GCCAGGTGTG CAGACCGTAA CAGCAACCCT GCTCACAGCA 002320 002321 GAAGATAAAT CTGCCACCAT GGCCAAAACT AAGGAGGAGG CAGACAATAT GTGTGCTGAT AACCTGGTGA CCTGGCCCGT 002400 002401 TGAGGTGTCA GTCCCAATAG CCAACCCAAA CTGGGACCCC AGTGATAACA AGAATCAAGA GTGGCTTCAT CATTATAGGA 002480 002481 ATATGCTTCT CAGAGGTATG AGGGAAGCAA GCCAGTCCCT GGTCAATTGG GGAAATCTCA GAGAAATAGA ACAAGGCCCT 002560 002561 AATGAAAATC CATCAGCATT CCTAAATTGA TAATAAGAAT GCCTCCAGAA TTACACCCCT TGGGACCCAG ATGACCCAAA 002640 002641 AGCTGAGTAG TACTTTAATC TCACTTCATC TCTCAGCCCC AGATATTCAG AGAAAACTCT AAAAAGTGGC AATAAATCCA 002720 002721 CATACTCCCC CTTTCCAACT GGTAGACATC TCCTTTAAGG TCTATAGTAA AGAGATGTGG CATCTGATGA AAAGGAAGAC 002800 002801 AAGAAGATGC GGCAGCCCTA CAGACTACTT CAGGAAGCCC AGGAAGAAGA TGGCATGGat tagaaaaatt tcaactgtag 002880 002881 gtgcctttgc aaccTATCCT GGGAGACCCT AAAACATATT TAGTTTTAGT GCCTAGAGAT TTTCACTCCT CTAACATCTC 002960 002961 TGGATACAGT GTCCCTGGTC AACATGAAGA AGTTACAGAA GAACGCTTTC TGATCCTGGC CCCTAAAGAA Tttacttgtg 003040 003041 ctaagtaata aaattcctat tgatcaaaac ctgtgtcctt gtggagagtc agatatctgt tactatccag gtgaatgaac 003120 003121 cctgggtttt ttgggtggta acaGGACCTG AGGGGctgca gagcctacct gtgtaaagga gatagctact actcaacccc 003200 003201 agcccatagc tagcacgtaa aaattcaggc ttcgtgttga cagagtatct aaatgatcta gaagtccaga atttttattt 003280 003281 tcaaatatcc gagttttcag ctgttggcaa ctgtggtctt atgatacata ctggtttttg tctaaggttc ctggctcaca 003360 003361 actcccatag cccttgttac agtcttttgt tataatgttg ggtgtgtcag gccttaggag ccagcctctg gaagcagaat 003440 003441 ctctcgacct tctctagtcc tcctttcacc tgccccaagg caccactcta atcgcccacc tttg |
Predicted Small Protein
Name | NONHSAT054790_smProtein_2774:2911 |
Length | 46 |
Molecular weight | 5412.4428 |
Aromaticity | 0.133333333333 |
Instability index | 52.7755555556 |
Isoelectric point | 11.6542358398 |
Runs | 8 |
Runs residual | 0.0155303030303 |
Runs probability | 0.0443936583643 |
Amino acid sequence | MWHLMKRKTRRCGSPTDYFRKPRKKMAWIRKISTVGAFATYPGRP |
Secondary structure | LLHHHLLLLLLLLLLLLLLLLHHHHHHHEEEELLLLEEEELLLLL |
PRMN | - |
PiMo | - |