NONHSAT127201
Revision as of 07:01, 17 October 2014 by 124.16.129.48 (talk)
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT127201 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
4260 nt |
Genomic location |
chr8+:72756338..72968547 |
Exon number |
4 |
Exons |
72756338..72756657,72875158..72875283,72964774..72967426,72967437..72968547 |
Genome context |
|
Sequence |
000001 AGGCGGGGAC CGGGTACTCC CGCTGCAGCC CCCGAAGCTC CATCTCCTCC GGATCACTCA CCGAGCCCGT GGACATCCCG 000080
000081 TTGTCCCCCT TGCCCACACG CGTCCTCTTT CCTCCCCCCT GGCCAGTCTC GCTGTCTCCG CCTTCCGCTC CCTGGCGGAG 000160 000161 GCGGAGGCCA GAGAGCGCTC CAAGGAAGAC TAAAAACCCA GGCCGGGAAG CGCGGGGTGA GAAAGCGAGG TGGGTGGCGA 000240 000241 GAGCGTGAGC GCCCCTCTGC TGACCCCGGG GAGCGTGGAC TACGAGTTGG CGCCCAAGTC CAGAATCCGC GCGCACCGCG 000320 000321 GAAAATATTT GCAATACCTG CCACGTCCAA GGTTCAGCTG AGGCCAATGT AAAGCCAGTA TTATAGGACA CTTCTCAGAC 000400 000401 TGAGAGCCCA ATGTCGTTGG AAACCACTGG GCCACAGGAA AGGCAGACTT CAAATCATCC CAACACCAGA GAACACTAAA 000480 000481 TGACAGTGGA CAGGAAGACA GTGTACCTTT TGCTCCTGAT GATCGTAAGA ACAACCTCCT TCCTCTTATT GTGAAGTGCA 000560 000561 AGGTGCAAAA AGGAGGCCTG CTGCTTGTTC AGGACTATGT CAGCATTGTG GCTCAGAAGA AGCGCAACGG CTTTGGCGTG 000640 000641 GCCTTCCCTT GCAGCAAAGT GAAGTGCAGT GTTCTTTTGA AGAAAAACAG ACACAGAAAA CGTGGTGACA GTGTCTAATC 000720 000721 TGTCTTATTT CACTGCTTAA CTGCTTAAAA GACATTAGCG ACACTATTTA ATACCAAAAT ACATAGCACT TAAAACTGGT 000800 000801 GTTCAGAATC AAGCTTCAAA ATAATGAGTT TATAGTTCTA GCATGGTGTG TGTGCATGTG TGGGAAGTGG ATCTGTGTGT 000880 000881 GTGTGTGCGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGATA GATAGAGAAA GAGAGAGAGA GAGAGAGAGA 000960 000961 GAGAGAGAGA GAGAGAGAGA GAGAATGAAT TCCAAGGCAT TTGTTTTGTG TTGCTTTTGC AATGGGATGA TTCCACAAGG 001040 001041 ACAAAGGACA AGCTTTCAGG GTAAAGATGT TGGGTTGAAC ACAGAAGAAA ATCCTCCTGA GATTGATCAT TTGTAGAACA 001120 001121 GTGGCTTACA GATTTGGGGC TTCACAAGCT TGTAAATGTT TTTTAAAATC AGAGACTGTT ATAAGTTTGC CCATTTTTAG 001200 001201 TTTCTCCACT TAAAGACTTT TTTTTTTGAA AAAGGCAACA ACAGTCTTCT ATCCCCATCA TTTTATGAAC AAACGTACAT 001280 001281 TTCAAAAGTT ACTCTCCCCA AAGGAAGGAA AGACATAATA GCACAATAAA GGGCAGCATT TCAAATCAAT ACACTTCAAA 001360 001361 ATAACACTTA TCTTAATGAA AACTCACCCC TGCCTTCATT TCTCTGGACC TACCAGGGAG TAAACGCTTT ACCATCAAAT 001440 001441 TTCATTGGCC TGTGGACCTC GCTGCTCTGT GAGACTGGAG GTTTCCCCAA CATTCACCCA CCCCTTGGTA TTCTCCCAAT 001520 001521 TCTGTTGCCT TTTGATTCAT ACTTCACCTC AAGAATCCAC TTGCTAAACA CCCAACAAAA GGCTGGGAAT AGAGCAAGGA 001600 001601 AGGTAAAAGG TGTCTAGGAG AGTTTTCTGG CAATGTTGGA AGTTTCTGCT ATATTGAGAT TTTGGGTAAG CATGAGAACC 001680 001681 GCAGTACATA CCCCGTCTTC ATCCAAGCGA TCTGTGCACT TCAAATTAGT ATCAAGAATG ACCTTCATGG TCTGAGTGTA 001760 001761 CCCGCCCATG GACGCATGAT GCAAAGCTGT CCAGCCATTG TGGTCACTGG TAAAGAGTTA AGAAAAGGTG TGTGCATACA 001840 001841 CTTAACAGAT GAAATGCGGG ATGGGATGCC ACTTGGCTTA TTCTAGGTAA ATCTGATGAC AACAAAAAGC ATTATTGAGA 001920 001921 TCTGGCAACT AAATATTTAT AAAACATTTA TAATAAATGA GCTAATTCCA AGCTCAATGA TCTTAAATGA CCTATTTCTA 002000 002001 ATGTGATTCA TACACGTTTT TTACTTAGTG GTTTAAAATA TGAAAGTATA TAAACACATT ACCAAACACT TTGGGGTAGG 002080 002081 AGAAGTACGC TTAGCATGTC TGTGTCATAC CTAATTTTTT TAAAGCTATA ATGAACTGAT ACACATCAAT TTAAATCTTG 002160 002161 TTTTATTCTT TCAATGAACA CTGTATCAAA ATTTTTTTAT GAAGTAATGT TAGTCTTTGT GGTCAATAGT TATTGGTAAT 002240 002241 ACTACTAGTA ATATGGTACA TGGCTATAAG ATGCTTAGCT TAGTGTCAGT TATTATTCTA AGAGTTTAAA TCTAAAAAGC 002320 002321 TTCCCTCTTG TGCTCTGAAA AATGCTTACT CAATTGCTTT GAAAAATGTT TATCCTTTTC CACAGAAATT CAAGATGTAA 002400 002401 ATACAACAAA GCCTTGTCAT ATTAGCTATC TTTTTTCAAA TTAGACAGAA AAAAATTCGT GACACTATTT AATTTCAGTG 002480 002481 CTAGTGAGCA GATGTTTTAG GGTGCAAGAG CCATGTTGCT ATTTCAGTTA TGATGTAATT AAGCAACAAC TGTCTGGCCT 002560 002561 GGGTCTACTT CGGTCCCCTT TGCAGGTCAG TTGCTTGGTG TTTCTTTGTA AACCAGAAAA AACTGAGCAA AACATTCATT 002640 002641 CATGTAGCTG TAGCACCAAT TTGCTGGCAT CAATATTATG GTAAGGAAAT GTGTTTTTGA ACATTAGCCT AATAAGCTCT 002720 002721 TTGCTACTGC TATGGTAAGA TATCAAATGA AAAACCTCTG TAAAACATTT CCAGGTTTTT TCTTTCTTAT CACAATATCA 002800 002801 TTTAATTATG CCACATATTT TATTATACTT ACATATTTTA ACTTTGTTTC TTGTACTTGA GTTCATTTAA GTATAATAAA 002880 002881 ATTCAGAAAT TTTTACCATC TCCTGTTGTC ACTTAAAGGG AAATTTTTAT GCCTACCATT CATCAGATTG AATGAAAAGT 002960 002961 AGAGTATTTG AACTTGTATG TTTTGTGTGT GTGTGTACGT GCATACACAC ATACACAACC ACAGTTTTGT ATCTATATAG 003040 003041 GAATATGGCA GCATGATGAG GTAAAATATG AGAAACTATG TTTCAAAGAA AGGTTCTGAG AAAGACTGCA CAAAGCCAAT 003120 003121 TATTCTGTGA CCTTGAACAA GGTTCTTCAA CGCTATAAAC TTCAAAGTCT TTGTCAGTTG GAAATAATAC TGACTTCCCT 003200 003201 TTCAGTATTT TTAGAAGTTA CATATGTTTT GATGAAAAAA TAAATTGGAT AAAAGTACTT AGTGAGATAT AAAGTTAGAC 003280 003281 CTCATAATTA AAAATATAAT GAAAAGATAG CCTGAAAATG GCTAATTAAA GAAATTATAT AAACACTCCA ATCATATATC 003360 003361 CTCACCTGAG AAACAATGCA CCTTTTTTCA GAAGAAGCTG AACTACTTTA TCATGTCCAT TCTTTGCTGC CAGATGGAGA 003440 003441 GGAGTCATTC CATGAAGGTC ACCTTCATTC AGAAGCCTCG TATCACTTAT GTCTTGTAGG AGCCTCTGAC AGGTATTGAT 003520 003521 ACGCCCATAA CTTGGAAAAA ATTAGGTTTC ATATTTTCAA GGCAAATATT AAACAGAAAG AAGACATGTT CATACATTGC 003600 003601 TAGACTGACC CTTACCTGGC TGCAAAATGC AGAGGTGATT TCTTATCTTT GCTTTTGGAA TGAATGGACA CATTAAAGCC 003680 003681 AAGTAGGTTA TTTACAGAAC CAGGGCCCCC CTGTCTACAT GCATAATGTA GAGGAGTACA CCCATCGTTG TCTTCATCCA 003760 003761 TTACCAGCTC TTTGATCTGT TGCATCTATA GGAAAAAATT AATATCATAC AAATAACTCT GCTATTATGC TAATATGATG 003840 003841 GTTTTCAAAC TATTCTGTAG GAAAATATTT TCCAACAAGG GTCAATATGT ATTCCTTGGG GCCAAAATTG TAAATAACCT 003920 003921 AAAGGATTTG AGAGTAATAT AAATATGAAT GAAGTTGAAT GCTCTGTCTT CATGTAGTGT ACATTACCAT ATTAGTGTAC 004000 004001 ATTACTGTAT TAAAGGATCT GAGAGATCAT GCCAATAGGA AAATGTACTT AACATTTTTA ACTCAGCATT TTTCAAATAT 004080 004081 ATTTGATCAT GGCCCTTCCC TCTGCCCCTC CTCTTATTTC CTCCCACCCC CAAATAACAC CCAACAGCTT CATGTTAATA 004160 004161 TGTTCTGTTA GGTAAAATGG ATGGGATGTA AACATTAGGT TTATTTTATG TTTTATATGT ATAGTAAATA AAAAATGAAT 004240 004241 GAAAAAAAAA AAAAAAAAAA |
Predicted Small Protein
Name | NONHSAT127201_smProtein_1775:1885 |
Length | 37 |
Molecular weight | 3977.7834 |
Aromaticity | 0.0277777777778 |
Instability index | 45.8861111111 |
Isoelectric point | 6.51177978516 |
Runs | 7 |
Runs residual | 0.013986013986 |
Runs probability | 0.0485779897544 |
Amino acid sequence | MMQSCPAIVVTGKELRKGVCIHLTDEMRDGMPLGLF |
Secondary structure | LLLLLLEEEELLLLLLLLLEEEELHHHHLLLLLLLL |
PRMN | - |
PiMo | - |