LINC01133
Revision as of 05:29, 23 June 2016 by Chunlei Yu (talk | contribs) (Created page with "==Annotated Information== ===Approved Symbol=== LINC01133 ===Approved Name=== long intergenic non-protein coding RNA 1133 ===Previous Symbols=== _ ===Synonyms=== _ ===Chromoso...")
Contents
Annotated Information
Approved Symbol
LINC01133
Approved Name
long intergenic non-protein coding RNA 1133
Previous Symbols
_
Synonyms
_
Chromosome
1q23.2
RefSeq ID
NR_038849
OMIM ID
_
Ensembl ID
ENSG00000224259
pubmed IDs
25908174
Sequence
>gi|336391100|ref|NR_038849.1| Homo sapiens long intergenic non-protein coding RNA 1133 (LINC01133), long non-coding RNA
000001 CTGATGTAAC AGCCTTGGGA AAGAGGTTGC AGTGAAAAGC TGGTCCTGCT GTGGTGGAGA GAATGGAGGA AAGATAATAA 000080
000081 AAGGCCAAAC CTTTGCTCCA ACTTTCTCCT TAGCTTCCCT TTGGATCTGG AAAGCTGGGG ACCCACACGG CAGAGCCATG 000160
000161 GTACTGGAGG AGCCATTAAC AAAGCTTTCA ATAAACCTCT CTTTCTTGAA GTTACCTGAG AATGGATCCA TTCCCTGCAA 000240
000241 CTGAAGATTC TAAGGAACTG GGTTTCTCAG TATACAATGG GAATGGTTGG GAGGAGGTAA AGAGTAGAAG ACAGTATCAA 000320
000321 GAATCCAGAG CCCAGCACCT GTAGTCCTAA CTATTCAGAT TCCTTGAGCC CAGGAGTTTG AGTCCAGCCT GGACAACATA 000400
000401 TTGAGACCCC CATCTCTCTA AAAAAAAAGA GAAAGAAAGA AGGAAAGAAA AAAAGAAAGA AAGAAAGAAA GAAAGAAAGA 000480
000481 AAGAAAGAAA GAAAGAAAGA GAAAGAAAGA AGGAAAGAAG GAAAGAAGGA AAGAAAGAAA GAAAGAGAAA GAAAGAAAAG 000560
000561 AAGATTGTAG CTAGGGGGAG AGTAGGTGAA AAGATGAACA ACATGACCGG GAAGATTTCC TAATCTCACC ACAGCCTGGC 000640
000641 TCTACCTTAA GTCTTTAATA AAAGCTTGAC TGAAGGTACC AAGGTGTGCT GAAGTGGAAG CAAAGTTCTC CAAAGTCCAG 000720
000721 CATGGTAGAC ATCAGTGGTG GTAACCAAGG ACAGACCCCA AGGCAAGGTG AACCTCAAAA ATGGAACCTC AAGTCTATGC 000800
000801 AGTCCAGCTG CCCTCCCCAC CAGAAAGTCC TTGTTCCAGC CCAACATCAG TGCCTCTGAG TTTGTTTACT AGAAACAAAG 000880
000881 GAAGAATTTC CTTGTAAAAA TATAGACAGA GTAGTCCCTG GCTTTCTCCT CTTGCAGGAA GGATGGATTC TCCCATTCCA 000960
000961 TACCATCTTT CCCCCACACT GGCCCCAGAA ATACTTAATT CAACTATGTG AAAATAAAGA TTGTTTTTGG TTTGAGGGCA 001040
001041 TAGGGATCCA TTTATCCTTA TTCTTTATGA GGCACTAAAT TAGCTTTGTA TGTTATTAAA TGTGTCTCGT CAATGCTGTT 001120
001121 GGCATTGTTT CATTTTAAAA AAAAAAAAAA AAAA
000081 AAGGCCAAAC CTTTGCTCCA ACTTTCTCCT TAGCTTCCCT TTGGATCTGG AAAGCTGGGG ACCCACACGG CAGAGCCATG 000160
000161 GTACTGGAGG AGCCATTAAC AAAGCTTTCA ATAAACCTCT CTTTCTTGAA GTTACCTGAG AATGGATCCA TTCCCTGCAA 000240
000241 CTGAAGATTC TAAGGAACTG GGTTTCTCAG TATACAATGG GAATGGTTGG GAGGAGGTAA AGAGTAGAAG ACAGTATCAA 000320
000321 GAATCCAGAG CCCAGCACCT GTAGTCCTAA CTATTCAGAT TCCTTGAGCC CAGGAGTTTG AGTCCAGCCT GGACAACATA 000400
000401 TTGAGACCCC CATCTCTCTA AAAAAAAAGA GAAAGAAAGA AGGAAAGAAA AAAAGAAAGA AAGAAAGAAA GAAAGAAAGA 000480
000481 AAGAAAGAAA GAAAGAAAGA GAAAGAAAGA AGGAAAGAAG GAAAGAAGGA AAGAAAGAAA GAAAGAGAAA GAAAGAAAAG 000560
000561 AAGATTGTAG CTAGGGGGAG AGTAGGTGAA AAGATGAACA ACATGACCGG GAAGATTTCC TAATCTCACC ACAGCCTGGC 000640
000641 TCTACCTTAA GTCTTTAATA AAAGCTTGAC TGAAGGTACC AAGGTGTGCT GAAGTGGAAG CAAAGTTCTC CAAAGTCCAG 000720
000721 CATGGTAGAC ATCAGTGGTG GTAACCAAGG ACAGACCCCA AGGCAAGGTG AACCTCAAAA ATGGAACCTC AAGTCTATGC 000800
000801 AGTCCAGCTG CCCTCCCCAC CAGAAAGTCC TTGTTCCAGC CCAACATCAG TGCCTCTGAG TTTGTTTACT AGAAACAAAG 000880
000881 GAAGAATTTC CTTGTAAAAA TATAGACAGA GTAGTCCCTG GCTTTCTCCT CTTGCAGGAA GGATGGATTC TCCCATTCCA 000960
000961 TACCATCTTT CCCCCACACT GGCCCCAGAA ATACTTAATT CAACTATGTG AAAATAAAGA TTGTTTTTGG TTTGAGGGCA 001040
001041 TAGGGATCCA TTTATCCTTA TTCTTTATGA GGCACTAAAT TAGCTTTGTA TGTTATTAAA TGTGTCTCGT CAATGCTGTT 001120
001121 GGCATTGTTT CATTTTAAAA AAAAAAAAAA AAAA
Predicted Small Protein
Name | LINC01133_smProtein_221:343 |
Length | 40 |
Molecular weight | 4688.0242 |
Aromaticity | 0.125 |
Instability index | 74.1425 |
Isoelectric point | 5.09027099609 |
Runs | 7 |
Runs residual | 0.00302419354839 |
Runs probability | 0.0493729023141 |
Amino acid sequence | MDPFPATEDSKELGFSVYNGNGWEEVKSRRQYQESRAQHL |
Secondary structure | LLLLLLLLLLLLLLEEEELLLLLEEEELHHHHHHHHHLLL |
PRMN | - |
PiMo | - |