NONHSAT078943
Revision as of 07:45, 13 October 2014 by 73.162.128.239 (talk)
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT078943 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
3067 nt |
Genomic location |
chr20-:21481165..21484354 |
Exon number |
2 |
Exons |
21481165..21482316,21482440..21484354 |
Genome context |
|
Sequence |
000001 AATGAAGGAT AGTTAGATGT AGAAGGTAAA ACAAAAGGAA TCAGCTTTTT GTGTCTGTGA CAGAGGATCT GATACTTGAT 000080
000081 GGTCTGACCA CAAGGCATCG CTCAGAAATC AGCAGTCCTT TTCACCCAGT GAACAGCCAG CTGCTTTGAG CAGAGAGAGC 000160 000161 TGTGGTTGGA ACTCCTGCTC TGTGACTCAG GGGAGGAGGC AGAGCTTGCC GCAGAATGCA GCCACTGCCC ACAGCCCCTG 000240 000241 TGCACTCCCT TTTCCCTTGT GCTGGCTTCC CGCTGCCAGC ATCTGCAAGT CTCTGCCTGG AGGTGAGCAG GAGGGTGCTG 000320 000321 GGCCCATTAG ACAGGTGGCT AGGGCTGGGG AGTTCATGGT CTTCACGCAG TTTCAGGTGG AGGTTGCAgg atatatggcc 000400 000401 cagcttcctg gccccaggtg gaacaaccct gagacatgtt cgccatttcc cacaggcccc cagtgggact gagccctagt 000480 000481 tgtcttcagt ggcaatcttc tcattaccac ctctcgtgtt gattcttttt cccttcactt tctcattttt ccatttcctt 000560 000561 gttggtcccc attaactact tgcacatgaa tctttgtctc agggtctgca tctgggggac cccaaaGACA CAGACACTTG 000640 000641 AGCTGTTACT CTGCCTTTAT CTTAGACCTA GCATTGGTCT CCAGAGGTCA CTCATGACTT CAGGGCCTAT TTTAATGGCC 000720 000721 TCTTTCCCCA TGTGTTCCTT GGGAATGTCC ATTGTTGATG CGTCCTTCTG TACTAAGTAG AGTTTTTGGG CTGCAAGGAG 000800 000801 GCAGTTAGCA CAGGGAACTA CGTAGTGAGG GACATGGAGA CTAAACAGCA GCTGGCCCAT CTCTCTGGCT TCTCATGGCC 000880 000881 AAGCCTGGGA CACACAGCTG CCCTCGTGCA GGTCCAGAAC TATGGCACTT GTTCCCTCAG GACCCATCCT CAAAGCCAGT 000960 000961 GACTTGGTCT TTTCCTATGA CAAAGGAGCT CCCAGAATGT GGCAGTCCCC CTTCCTAGAG GGAGAATCTG ACTGGGTCAG 001040 001041 GGAGGCACCC CCGGTTTGCA GAGCCCCCAT TGGTCAGGTT CAGTGTCCAA AACACCTGAG GGGCAAGTGG CTTCTCGAAG 001120 001121 TCCAGCCCAT ATGGTTACTT TTGACTTGGA CACTCAAGGC ACAGGGACAC ATGACACAGG CACAGGGTGT GACAACTGCA 001200 001201 TGCAGAAAGA GCCACTCTCA TTCCTCAGAA GAGGCTGTGG AGATGAGTgg ctgagccatg tgccccaaaa ttcatgttga 001280 001281 agtcctaacc cccagtatct taaaatgtga ccatatttag aaatagaacc tttaaagagc taactgggtt aaaatgaatt 001360 001361 cattagggta ggccctactt cagtatgacc ggtgtcttta gcagaagaag aaatcggtac agacacgcag aggggagggc 001440 001441 atgtgaggac atgggaagga aatgccactt gcaagccaag gagagaggcc tcagaagaaa ccagccctcc tgacaccttg 001520 001521 atcctagact tccagcctcc agaactgtaa gaaaatacat ttctgttgtt taagccacct ggtcttagag ttctatCTTG 001600 001601 AATTTCTCTC TGCTTCAGAC TTTTCTGTTT TGTTTTTTAG CCTAAATGCT ATTGAAAAGA GCAGAGAAGA TAGAGTGCAG 001680 001681 GCCCCTAGAG ACAGGATGGA ACGAAACACG CAGTGTGCAT CTGTGTCCTG CTCAGCATAC GCTCTTGGTC CCCACTGGGC 001760 001761 CATCTCGCTG CTGACTGGCA TCCCGAAGAA CAGGCAGATT GCACAGGGGC AGCTCTTTCC TCCAAAGCTG CAATGTGGTG 001840 001841 CATGAAGTCC TCAGATTTGG GAACTCATAC AGGCACTTGA GCTCAAACAT ACAGTTCTGA TCTAAGACAT CACAGGGTCA 001920 001921 TGGTGGAGCT GTTTCAGCTG CAGGTGGTAT TCCACCATTT GGTCCTCCCT GATGTCTCCT GGCACCTTTC CTTCTTCAGT 002000 002001 CTCAGCACTC TGAAGAAGCG TGCTTGTGAG CGCCGCGGGA CGATCCAGGA TGTCCAGGAT TGTGGACGAT CCTGTTCTTC 002080 002081 CCATCTGTGA GGTAAGGCAG ATTAGGAATG TCCACGTCTA GCTTGAATTT CACATCCAGC CATTGGCTGC CACCATGGCC 002160 002161 AGGGGCTTCT GCAGATGGAT CACCGTTTCC CTCCCAGGAG CAGGCAGGTG TCCAGTATCA GCCCCAGAAC ATCCCGGTCA 002240 002241 CCCATCTCCA TAGGGCTTGG ATGACATGAT GATGCTCTCT GTGCCTCTGA GGACGGCGCC TTGGGTCTTT GTACACTAAA 002320 002321 TCCCACCTTG TCCTCTCCTC TATTTGAAAC AGCCATTTCC CAAATCCCAA TCTGATGAAA TCCTGTTCTT CCTTCCCTCA 002400 002401 TGTTTCTCTA GGCCTCCTGT GGACTCCCAG AATGTAAAGT CAGCATTCCC TTGAGCCTTT TGGGGACTTA TAAGCACATG 002480 002481 GCCCAAGGTC CACCTAACTC TAGGTGTATC CTGTAGAACC AGTATTCTAA TCCTAACTTT AAAGATCTTA CAGGACCATA 002560 002561 GGGTGTGCGT AAAAAGCCTG CTGCAAATCA GAGTGCTTCA ACTGCGTATT AATGAGATGG AGGAAAGAGG CTGTGAGTGC 002640 002641 AGACAATGAG TCCCTGAGTA AATAGACTCC AGAGCCAAAG GTGTTCTCCA CACCGAAAGA ATGACTGTAC CCAGGGAAGG 002720 002721 AGCTCAGACT CACCTGGACA AACCCTGAGG GGACAGCCAT GTTCCAGTGT GGCTAGGAGG GTCTGGAGCT GGGCACGTCC 002800 002801 CCCATGTCTG CTTGATTCCC CTCTGGCCGA GTTCTACTCA GGTGAGTTGT GCTCATTCTT GCTTCCCCAG GGCCTGGATC 002880 002881 ACAgttcctc aagctttggg atacaaacga atcacctggg atttggataa aatgcagatt cgtgttcagc agccctggca 002960 002961 ttctgcattt ctcatgagct ccagggagac gctggtgctg ctggggcagg accacacttt gggtggcaag gGCTTGGCAG 003040 003041 GCAATTCCAG GTGTCACCCT AGAGATG |
Predicted Small Protein
Name | NONHSAT078943_smProtein_215:478 |
Length | 88 |
Molecular weight | 9590.9472 |
Aromaticity | 0.103448275862 |
Instability index | 90.8724137931 |
Isoelectric point | 11.1616821289 |
Runs | 13 |
Runs residual | 0.0111658456486 |
Runs probability | 0.0308725014608 |
Amino acid sequence | MQPLPTAPVHSLFPCAGFPLPASASLCLEVSRRVLGPLDRWLGLGSSWSSRSFRWRLQDI WPSFLAPGGTTLRHVRHFPQAPSGTEP |
Secondary structure | LLLLLLLLLLLLLLLLLLLLLLLLLLLHHHLHHHLLLLLLEELLLLLLLLHHHHHHHHHH LHHHLLLLLLEEEEELLLLLLLLLLLL |
PRMN | - |
PiMo | - |