NONHSAT031035
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT031035 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
sense |
Length |
3229 nt |
Genomic location |
chr12-:118636195..118651976 |
Exon number |
2 |
Exons |
118636195..118639268,118651822..118651976 |
Genome context |
|
Sequence |
000001 ATTTCTCTGT CTGCCTGGTT TCCTTCAAGA ACTACTTTAT TGTGTTTCTT TCCTCTTCCA GCGGAACGGA AGCCGCCCCT 000080
000081 TTTCAACATG AATGCAATGA GTGCCTTATA TCACATTGCC CAGAATGACT CCCCAACGTT ACAGTCTAAT GAATGCATGA 000160 000161 CTTTGTTCGA CGAGACCGGC CACTACGTGT CCTCATTGAC CTCATACAGA GGACAAAAGA TGCAGTTCGT GAGCTAGATA 000240 000241 ACCTACAGTA CCGAAAAATG AAAAAAATAC TTTTCCAAGA GACACGGAAT GGACCCTTGA ATGAGTCACA GGAGGATGAG 000320 000321 GAAGTAGGTT CAatttattt tagcaggcac ttatagaatg cccactatgt gaaaaagttc tagactgggc actatgaaaa 000400 000401 acttttgaag ataaaaatta attgatgctg tccgccttaa gaaacatgaa gggttgtaca agagacattc ataaaataac 000480 000481 acttatataa ggcaaaatat gaaaagtgct acagaggtgt aaccccattg caggggtcca tcatggggag aaaataagtc 000560 000561 tgctggtggg attagtgaag gtttcctgga ggacatgaTG TGGGTAGTTT CTGACAAATG GGCATAGGGT GTTCTAGACT 000640 000641 CAGAAAGGTG TTGTGCTTTA GGAAGATTAT CCTAGCAACA GAGTGTAACA TGGATTTGAA TGGGCAACAT CCTGGCTaat 000720 000721 agtccatgca agaggtaata aggtcagtgt cgatgcaaat ggggaaaagg gagcacttta gagaaacatc gtagaggtag 000800 000801 atttcataat ccatggcttc tcattgacag tgacagataa ggaggagttt aaggtgactc tgagcattgg agcctgggtg 000880 000881 gctaggagta tgtcaatacc aagaacagag aaaaagaacc agggagaaag agtagattta ggatgatttc agtttcgtgc 000960 000961 cttttgagtt taaggtcctG GCATTATAAA CTCTTGCCAG TATACAGATA GAAGTTTTCA GTTAGAGAAT CCTTGATGAC 001040 001041 CTTAGTAAAA GCAGTTTTAC AGGACTAGGT AGAATGAATA GTAGATTACT GGAGTTGTAT AATTCATGGG AGTTGATAAG 001120 001121 AGGGAGACAG GAAAAAATAA TTATGTTAAG ATTCAGAAAT TGGGTTAAGA AGGAAAAGTA AGAGTGGGCA GCATGTAGAG 001200 001201 AGATCCATAA ATATAAGGCA GGCTTCTTCA TCTTTAAATG CAGTAGAGAT GGAGCCAGGG CAGAGTCAGA GGCACCGAAG 001280 001281 ATAGAAGAAA GGGATATAGT TAAAGAGGCA GGATTTCAGA ACGTAGACAT AACAAGAACA GGAGTATTTT AGCTTTTTAA 001360 001361 CAGaaaataa aataaacata aaagtatttt atcaataatt ttaaaaatGA AGCTCGGAAA GTTGCATTGA ACTATAAAAT 001440 001441 TTCTTGATTG CATATATAGA AATGATCCCC TTTAATTGGG AAAATAAGTC TGTTAAAAAT AATGTATTAT CTTCCATTTT 001520 001521 ATATTACCCA GTAATTTTTT ATATTGCCAT GGCTAAGTAA CAGTTTATTA AAGTTATATG TGTACCATAT GGTTTTTTGA 001600 001601 GAACAAGGTT TATTTGTAGC TAAATGGATT TAATTTTATT ATTCAATCTT ATATCAAAAG ATATCCCACT GTGATGGATA 001680 001681 AATTATATGT TCTGATTATT TTACCACTTT TCAAATAGTA GTCTTTGGGC CAGTAATGAT GAATACCAGT ATTATTTAAC 001760 001761 AGATATATAT AATAAGATTA CAGAGATAAG CATGATAGAA TCGCTTGAGG GAGGCAGACT TTAATTACCT GTCCAGGAAA 001840 001841 GTAGAATTAA TATTTCTGCT TAGATATGAA GTTACTCAAT AATGACTTGC AGAGGGAGCA TATTGCTTGT CAAATATCTG 001920 001921 AAAACTCTTT CCACTCTACT TTTTCAAGTG ACCACTGCTG TATTTGAAGA TATCCTAGAG CTCACCTGTT TAGTGGCCAG 002000 002001 CAGTCTTCCT CTAGCTCCTT GATGTTCCAG AAGGTCCCTT ACAAACCTGG CTTGATTTTG CTTACTGAAT AGCAACACGA 002080 002081 TAATATGATA GTGTGTATGA CAGGGGAAAG CCCTGGGATG GCAGCATTTG GAGCCTTATG ACCCACGTCC AGAATGGCCC 002160 002161 CTGGGGAAGG CATAGGAAGC AAATATGAGA TGTATGAAAA TGGTGGAGGA GTAGTGGAGC CAGTACATAA AAAGTTTGTC 002240 002241 TTTGCAGATT GGCAGAAAAG CAACTTGACA CTGCAAAATT GAGTGTGCTT GACACTTTTG TCAATGGATT GACTCTTTAT 002320 002321 CTTCAGTCTT TTTTTCTAAC CTCAGTGCCC CATTTTTACA GGACAGTGAA CATGGAACCA GCCTGAACAG GGAAATGGAC 002400 002401 AGCCTGGGCA GCAACCATTC CATTCCAAGC ATGTCCGTGA GCACAGGCAG CCAGAGCAGC AGTGTGAACA GCATGCAGGA 002480 002481 AGTCATGGAC GAGAGCAGTT CCGAACTTGT CATGATGCAC GATGACGAAA GCACAATCAA TTCCAGCTCC TCCGTCGTGC 002560 002561 ATAAGAAAGT AGGTTTCTTG GTACCCTCCA CAGAGGTGAG CTTTGTCAAT CATAGCGAAT CCAGTTCAAG TCACTTAGCT 002640 002641 TCCAGATGTT TCAGTTACCT TTTTGCAACT GCAGTGTGGT ACTAAACCTG GCATGTGACA GTTTTCTTAC AGTCAAAAAG 002720 002721 TAACAAAGGA CTACACAGGC TACATCTGTG CCTATTTCTT CATCTCCCTC TCTCATTACT CTGATGTCCA TCATCATCAA 002800 002801 TAAATAAGTT GAAGAATTTG ATAAGTCCTT CTCTAAACCA ATAATCTTAA TACTGTAGAC CTTCAAGTCT TCCTGAGTGT 002880 002881 ACCTGTTGGT TGTGCCCTTT TAGAAGTATG CTCTTTCAAA AGTATGTTTC TTggctgggc gtggtggctc acatctgtaa 002960 002961 tcccagccct ttgggaggcc taggcgggtg gatcacttga ggtcaggagt ttgagaccag cctggccaac atggcgaaac 003040 003041 cccgtcttta ctaaaaatac aaaaaatagc cgggcctggt gacatatgcc tgtaatccca gctactcgag aggctgaggc 003120 003121 aggagaattg cttgaaccca ggaggcagag gttgtagtga gccgagatcg cgccattgca ctccagcctg ggcaacagag 003200 003201 tgagactcaa aaaaaaaaaa aaaaaaaaa |
Predicted Small Protein
Name | NONHSAT031035_smProtein_152:355 |
Length | 68 |
Molecular weight | 8158.213 |
Aromaticity | 0.0746268656716 |
Instability index | 61.4567164179 |
Isoelectric point | 8.14019775391 |
Runs | 9 |
Runs residual | 0.00898648120653 |
Runs probability | 0.0378025083908 |
Amino acid sequence | MHDFVRRDRPLRVLIDLIQRTKDAVRELDNLQYRKMKKILFQETRNGPLNESQEDEEVGS IYFSRHL |
Secondary structure | LLLEELLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLEEELE EEEEEEL |
PRMN | - |
PiMo | - |