NONHSAT100124
Revision as of 01:13, 17 October 2014 by 124.16.129.48 (talk)
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT100124 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
3198 nt |
Genomic location |
chr5-:3417266..3536208 |
Exon number |
5 |
Exons |
3417266..3419430,3421185..3421418,3422396..3422736,3531867..3531992,3535876..3536208 |
Genome context |
|
Sequence |
000001 ATGCGCGGCC TGCCGGGGGC GGCCTCTTGC AGTTCCCGCC CTGACGCAGC GCGAGATTAG AGCTCCTGCT AGTCCCCAGC 000080
000081 TGACCCTGGG CCTCAAAGGC AAGCCCTGAC CTCAGTCACA CGGGGTGGCG GTGATATGGT TTGGCTCTGT GTGCTTCACT 000160 000161 CTGCCTCACT CGCCTTCCGC CTGCTGGACG CTGCCCTGTT CCCCGCAGGC GGCCGCCGTG GAGGGCAGGC ACCGCGCTCC 000240 000241 CTGGCTGCTC CCGGGGCCTT CCAAGCACAG CCACGGCCAT GGGGCCGAAA CCCAGACCTC GAGGGGCCTG CGGGCAGAGC 000320 000321 TCGACAAGCC CAGCTGCTAC ACTGAAGGGA AAAACCTGAG GAAGCCCCTG ACCTTCTTGA ATAACCTGGA TATTGCAGAA 000400 000401 TCATGCCTGG CACACGGCAC TGGCAGAGAC GCAGAAGGGA AAGTGTCCAG CGATGAGAGA TACCGGCCAG GGGGCAAATC 000480 000481 TTCAGCTTGC AGCACTCACT TTGCAGATCC AGTTGGAATT ATTTGTGTTG ACGCTCTCTG CTCTCTCCGC TCTCCCCTGG 000560 000561 AAATGACGAG GAACTGAGTA CAGTGGACAA TGGCTTCCGT GTCCCTAGCT CCCCCCGAGC CATGAGCTGC TACATCAAAA 000640 000641 GTCTGCAGTG GGCTGCCCTT CCCATAATAT TTCAATTTAA GCACCGACTT GCTGCTGCGG AGTCATATCT CATTCATGCT 000720 000721 GCAGCCTTTG CAGGAGTCAG GGATCATTAT GGAGCAAGCA CTGAGAAAAA ACAGATTACA GCTTGGCACC GAGCAGCCAG 000800 000801 GTTGCACCCC AGATGCTTCA GGGACATGGT GCCTGCTCTG GAGAATGGGG CAGCTTCCAC ACTGCCCAGG AGCCAGGGCT 000880 000881 TCTGACCCAG GAGCGAAAGT CTGTCTCTTC CATTTCTGGG AGCTGGCAGT ATTTGCACGA CTTTCTGGGC CTCAGGCATC 000960 000961 CCATTGCCCA CCAGGAATTA CCTTCCTCCA GGATCATGGT GAAGATGACA TGAGATGCTA ATGCTTGGCC TGCGGTTGCT 001040 001041 ATTTTTAGTT CTCCACTTTC CTGGAAAGCA TGCAGTTGAA GTTCATGACG GTATTCTCTC AACGTGTGAC TCAAAGACAG 001120 001121 AGGCCGTTTC CCATCCACCG GTTCTCACAC TGCCCCATTC TGCAGCTTCA CCATCCACGC GCTAAGAAGG CAGGCCCCAA 001200 001201 AGTCTCAGCA CCCAGCCGCT GGGGTCCTCG CTCTTATGGA CAGAAAACTC AAGAATGAAT ATGTTTCCCC ATCTTGTGCC 001280 001281 CTTTTGCCTG GAAACTATAC AGAGCAGGGG TCCCTGCTGG CATTCCTTCT CTGTACTAGT ACGGCTTCGG AAAGTACTGG 001360 001361 TTCTCTGAGG AGCGCTCAGT TGCATCTGGA AGGTGCAGTG TGCACAGCGA TGCCCCTTGA CACCAAGGTG TGAGCATTTG 001440 001441 AGCTGTGCTG GAAGGTTCTC CATGGAATAG TGCCCACCCT GGAGGGTCCT GTGCCGCGCA CTTGAGGAGG CTGTGGTGGT 001520 001521 GAGGAGAGAC TCTGGAGCAA CCCACACTCC CGATAAGTCA GATAAGAGTG CCTCCTCTAG GTAAGTTTAT GTTGCAAATT 001600 001601 TAAAGAGAGG AACCAAGAAA CAGCAAGGAC ACTGAGCCAG GACTGAGAGT TTGGGTCTTA GGTGGCTTTG GGTCATTACG 001680 001681 CCCTCTCAAG TGGATTGCTC TAAGCCCTGT TTATTTGTTA ATGTGAATAG TTGGATGGAA TTGCCTTTGT TCTAAAATCT 001760 001761 TTTATTCTGC CTAAAGTCTC TGGGTCAGCA AGCACATGCC ACATGGCTTC TCTCTCTGCA TGGGAATGCA CACCTGAGTA 001840 001841 GGGAGGCTGG CCAGCCCGTG CTGCCCTGGT GGGGTGAGTG TTAGCTGGCT AGGGTTGCTG GAACAGAGGG GATTCAACCA 001920 001921 CAGGAATTTA TTCTCTCACA CCATGGAGGC CCAAAGCCCC AGATCATGGT GCTGTAGGGC CAGGCTCCCT CTGAGGGCAC 002000 002001 TAGGGAGGAC CTGTGCCCAT CCTCTCCAGC TCCTGGTGGT TCCTTGGCTC GTGGCAGCAC AGCTTGTACC TTCACGTGGC 002080 002081 ACTCTCCCTT TATGTGTGTG GATGTGTCCA ATTTTCCTTT TCTGTTGGGG CCCCAGCCGT AGTGGCTTAG AGGCTCACCT 002160 002161 ACTCTAACAG GACCTCGTCT GAACTCATGA CATCTGCAAG ACTGTGTATC CAAATAAGAG CACATTCTGA GATGCTGGGG 002240 002241 GTTAGGACTT AAGCACAGAA GATTTGTGGG GAGGTACACA GTTCAATCCA TCTGGAAGGT GGATAGCCAT GGCAGGGTGG 002320 002321 AAAGACGTGT GTGGGTATAG GCCGGAGGTG GCCTCACTGG AGAAACCTGT GCACGGCCCC ACAAGGCAGC AGCTGAGCGT 002400 002401 GGAAGGGGCT GGAGCTGCCC CGTGGTAGGG AAGGGGCCGT GTGGCCAGCA GCAGCATCTG CCTTTGTCAT CACTGGGTGG 002480 002481 GACCACGAGG GCAATGCTAC TCTTCCCCTC ACACCTTCTA CAGGCAAGTT GGCCAGGTGT CAATGCAGCC AGGTGACACG 002560 002561 GCCGCCTGGA GGGCAGGAGC AGACAGCCCC ATGGAGGCTC TAGGAGATAG ACAGGGAGGC CGCAGCTGTC CAGGAGCTGA 002640 002641 AGTGAGCAGC AGAGGGCGCC GGGGGAGATG CGCCCTAGCA CCAGGCTCTG AGTGCCCCAG CGAGCTTCCT TGCCTTGACT 002720 002721 TTCCTCACTG GGAACCCGGG CCTGGAGTCC ATATGAAGCC TCAGCCCAGG GGAGGGTCTA CCTGTCCAGG AGGCTCCCAC 002800 002801 CATCCAGCTC GCCCTGGATG GTGCCCTCAG AGGTGCCTGG GCATGGGGTC ACCTCGGAGA GTCTGGAGGG GCCTCCGGAC 002880 002881 AGTGCGGCCA CGGGCTGTCC TCTGCTTCTC AGTGCCCAAG ACAGGGAACC AGAGCACAGC ACTGAACACG TTTTAGTGCT 002960 002961 CTTTTAGTTC ATGTGCTGCT TGCTGAATTT CCCTGAATTC AGAGAAGAGC CTGACGCAGA GGAAGCATCT GAGATCTTCT 003040 003041 GTATGTTTTC TCGTCTGTGC AAAGTGTTTT CAACTCTTTC AAAGGTCGGC ATTCACTGAA TTTCAGCTCC CCTTGGCTTG 003120 003121 ATCTCTCATA CAAAGGTTCT CAAATTCCCT GAAAATATGC AATGATTTTC TCACTCAATA AACGACTTTT CTGTCACC |
Predicted Small Protein
Name | NONHSAT100124_smProtein_812:1012 |
Length | 67 |
Molecular weight | 7348.2582 |
Aromaticity | 0.0757575757576 |
Instability index | 43.4712121212 |
Isoelectric point | 6.95941162109 |
Runs | 7 |
Runs residual | 0.0445472690982 |
Runs probability | 0.0190400484518 |
Amino acid sequence | MLQGHGACSGEWGSFHTAQEPGLLTQERKSVSSISGSWQYLHDFLGLRHPIAHQELPSSR IMVKMT |
Secondary structure | LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLEEELLLLHHHHHHHHLLLLLLLLLLLLLLL EEEELL |
PRMN | - |
PiMo | - |