NONHSAT053849
Revision as of 22:18, 16 October 2014 by 124.16.129.48 (talk)
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT053849 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
3023 nt |
Genomic location |
chr17-:41453296..41466266 |
Exon number |
4 |
Exons |
41453296..41455181,41456491..41456630,41458641..41459028,41465657..41466266 |
Genome context |
|
Sequence |
000001 GGAGGAGGGT AAAGGAATCC ACGTCCCAAG CAGAGAAGCA GCTTTCCCTA CACAGCACAG GACACGGTCC GCGCACAGAA 000080
000081 GCCGCAGGAG ACGCAGGCAC AGGGGCTGGG GAGAATCCTT GCTGGGCCCT CGCCGTCCAT GCGCCGCCTC CCTCTGCCCG 000160 000161 GTGTCTGGTG TCAGCCTCCT GCCTGGCAGA GGAACTCCAG CCCCTGCTCC CCGAAGCTCC TCCAGGCCTT CTGCTTCCCT 000240 000241 GACTCGGAAT GGGCCCGCTG TCCAGTGGGT ACAAGGCTGG TCTCCCCGCC CACTGGTGAC GACGTAAAGA TCCAGCTCAG 000320 000321 CCTGGTTTCC CCTGGTGATG ATATCTTGCG ACTCGTCTTT CTCCAGGAAA ACAGGGACCG CTCTCCGCCT ACAGGTCTCT 000400 000401 TCCACAGACC TATCTTGTTT TCACCCCCTG TCTCTTTTGG TTCAGTCTTT TCGCTCATAC AAACAAGGAA GTCGCCCCTA 000480 000481 GCCCAGCCGC GGCTCCTTAG CTGGAGGCGG GCCCGGGGGT GGAGTCAACC GCGGTGGCCA CGCCTCCTGG GAAAGGGCAG 000560 000561 GGCATGCAAA TTCGAAATGA AAGCCCGGAA ACCCCGGAAC TAGAACTGGT ATCTCTTCAA CTACCTGTGA AAACTGATGT 000640 000641 GATGAAAAGG GGAATTTGAA GGAGCCATTC CAGAAGACAG GGCGAAAACT GAAGTGCAAT CAGGGCCAAG AAAAACAGAA 000720 000721 ATAGCAGGAC CTGGAGTTGC ATAGGTTGAA TAGTTGAATA GGCTGCTCTC CTCAGCTGGC AGGAAACCTG GCAGCCTCCT 000800 000801 GGTGCCCAAG GACTGAAAAC CTTAGAAGCA CCTGGAGTTG GCAGCCTTGG CATGGTCAGG TTGGCACCTC TGGAGGTGCC 000880 000881 CAGGCTTCCC TGGCAGCATT GTGAGCAGTG GATGGTGTTG AAGGGCAGCC AGAGGAGGAA TGGAACACAT GCTCCTTGCT 000960 000961 AACCACACGG ACAAGGCCAC GTTCACAGGT ACACAAAGGC AACGCAGTTG CTCAGGTGCT TCGGTATCAC AGCCAAGACC 001040 001041 CCTTCGGGGG AAGCTAGTCG GATACTGGGA CCCACATTCC AGACTACTGA GCCGCGGTCG CGCCCTCGGC TCCGTTTCTG 001120 001121 CTCCCTCCAC CCCACGAGGA CGGGGGTGGA AGGCCACCTT CGATGGGTGC ATCCTCCACG ATGACCTGCT AACAAAGGTG 001200 001201 CATGGATTTC AGAGTCTGAT TGGCCTACAA CAGCATTTGG CTTGTGGAGA CAGTGGTTCC CTGATGAAAA ACTGCCATGA 001280 001281 TGTAAGGAAG AGCCTGTCAG AGCGAGGCTG GGGTGCTGCG TGTTGGGGAG GTGGAGGTGT GGCTTCCCGG GAGAAGCTCC 001360 001361 ACCCGCTGGC TGAGTCTGGC ACATAAACCA GTCTGTGAGG GGATGGATGT GGGTGTAATG GGGGCAATTA CAGTAGGAAG 001440 001441 GAGCCCACGT GGAGCCTGCA TTCTCTGGGA CAGGGCATTA CTGCATTCTC TGGGACAGGC TAAGGCCCAG ATCCTACCTT 001520 001521 CCCAGGTGGC TGGATGGGTC ATAGATGTAT GAACCGGTCC CCTCATTTTC TGATTGCCCT GTGCTTAACG TTTCTGTACC 001600 001601 TTTACTGAGG CTCTTTCCTC CAACTCCAGT GCCCAGACCC CCCTTCTCCT GAACATGAAT GCCTGTCCAT GGAAATTCGA 001680 001681 GTCTCTCTCT CTCACCCAGG CTGGAGTGCA GTGATGCAAT CTCAACTCAC TGCAACCTCT GCCTCCCAGG TTCAAGTGAT 001760 001761 TCTTGTGCCT CAGCCTCTGG AGTATCTAGG ATCACAGGTG CGTGCCACCA TGTCTGGCTA ATGTTTTGTA TTTATAGTAG 001840 001841 AGATGGGTTT CGACATATTG GCCAGGCTGG TCTTGATCTC CTGGCCTCAA AGTGATCTAC CCACCTGGGC CTCCCAAATT 001920 001921 GCTGGGATTA CAGTTGTGAG CCACCACACC CAGCCTGTCC CTGAAATTCT AATGAAATGT GCGATAAAGT TGTTTTGTTT 002000 002001 TTCTTTTTGT TTTCCCTTCT TGGCAAAGCC TGGTGTTTCT ATTTTAGTGG ATTTGCCTGG CACTGAGGAC TGCTATGGTG 002080 002081 GTCTTCAGAG GCTCCTGGTA TTGACTGCTT GTGAAACCGC TTTTGCAAAA TTATGACTGA GACAGTGAAA GAGATCTAAC 002160 002161 TTAACCGACC CAATCTTGCT TCTAACCTCC AAATTGTCCT TATTCATTCC TGAGCATAGC CTGAACTAAC TTTGGGAGAA 002240 002241 GCTTAGTTTA TATTTTATTT TATAGTTTAA AACAAAGATG TTAACAGCCC TTTCCCAAGG CAGACTTCCT TCTTGCCTGG 002320 002321 GGACTAGGTT GCCTTTGGAG GACTAACATT AGCCACGAGA TTAGAAATTA TGGGCTGGGC CTCGTGGCTC ACCCCTGTAA 002400 002401 TCCCAGCACT TTGGGAGGCC ACGGCAGGTA GATCACCTGA GGTCAGGAGT TCAAGACCAG CCTGGCCAAC GTGGTGAAAC 002480 002481 CCCATCTCTA CTAAAGAATA CAAAAATTAG CCGGTTATGG TGGCACATGC CTATACTGCC AGCTACTTGG GAAGCTGAGG 002560 002561 TGGGAGGATC GCTTGAACCT GGGAGGCGGC GTGGAGGTTG CAGTGAGCCA GGATCTTGCC ACTGCACTCC AGCTTGGGCG 002640 002641 ACAGAGTGAG ACTCTGTCTC AAAAAAAAAA AGTTTAGAAA TTATGCTTTA GGAGTCATGC AGCTGGAGGC TACAAGATTC 002720 002721 TGACCCTCCC TAAACTGCTC CTAAGATCAG TGCTTGAGAT ATTTTGCAGA CCCTGCACTT GATGGATCAG CTGGCACCAC 002800 002801 CCAGACTGAT TAACTGGCTC ATGTGATCTT GTGGTCCCCA CCCAGGAACT TAATCAGCAC AAGGAGACAG CTTCAACTCC 002880 002881 CTATGATTTC ATCCCTGACC AATCAGCACT CCTGGGCTCA CTGGCTTCCC CCTACCCACC AAGTTGTCCT TAAAAAGTCT 002960 002961 GCTCCCCAAA TGCTCGGGTA GACTGATTTG GGTAATAATA AAACTCCGGT CTCCCACACA GCC |
Predicted Small Protein
Name | NONHSAT053849_smProtein_1658:1894 |
Length | 79 |
Molecular weight | 8234.3089 |
Aromaticity | 0.0512820512821 |
Instability index | 64.641025641 |
Isoelectric point | 6.93072509766 |
Runs | 7 |
Runs residual | 0.0439337085679 |
Runs probability | 0.0423827188534 |
Amino acid sequence | MPVHGNSSLSLSPRLECSDAISTHCNLCLPGSSDSCASASGVSRITGACHHVWLMFCIYS RDGFRHIGQAGLDLLASK |
Secondary structure | LLLLLLLLLLLLLLLLLLHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLEEEEEEEEEE LLLLLHHHHHHHHHHLLL |
PRMN | - |
PiMo | - |