LINC00528
Revision as of 08:34, 23 June 2016 by Chunlei Yu (talk | contribs) (Created page with "==Annotated Information== ===Approved Symbol=== LINC00528 ===Approved Name=== long intergenic non-protein coding RNA 528 ===Previous Symbols=== C22orf37 ===Synonyms=== FLJ4054...")
Contents
Annotated Information
Approved Symbol
LINC00528
Approved Name
long intergenic non-protein coding RNA 528
Previous Symbols
C22orf37
Synonyms
FLJ40542
Chromosome
22q11.21
RefSeq ID
NR_103718
OMIM ID
_
Ensembl ID
ENSG00000269220
pubmed IDs
14702039
Sequence
">gi|512388702|ref|NR_103718.1| Homo sapiens long intergenic non-protein coding RNA 528 (LINC00528), long non-coding RNA"
000001 GTCTTCCTGT GAAAATTCAG CCCCTCTTCC TCAGTTACCC CTTTGCAACA GAACGAGGCC GGCATTTTCT AAAGCGTCCT 000080
000081 TGAGGAGTCC AGTTCCTAAA AGATTATTTC AGAGTCCACT GTTGTGGGCT GACTTGAATG CTCTCAAAAT TCGTAAGTTG 000160
000161 AAGCCCTGAC CCCCAGGACC TCAGAACGTG ACTGTGCTTG GACACAGGGA CTTTCCAGAG GTGATTAAGG TGAAGTGAGG 000240
000241 TCACTGGATG GCCCTCCTCC TATCTGACTG GTGTCCTGAT GGGGACGCGG ACACCCACAC AGGAACGGAC CCGGGGAGAA 000320
000321 CAACACATCG TCTCTGCGCC AGGGAGAGAG GCGTCCGGGG AACCCAGCCG TGCCCACGCA TTTATCTCCG CCTTCCGGCT 000400
000401 CAGAATTGTG AGGAAACCCG CTTCTGTTGT GCAAGTCCCG GGTCTGTGGT GCTCGGCCAT GGCGCCCCCA GAACTGCATC 000480
000481 CCCACCTTCT GCTCTGAGCC ACCCATCCCC TCTGGAAGGG CTGAGCTTCT CTCCCTTTCC ACCTTCTGTT CTGAGCCACC 000560
000561 CGTCCCCTCC AGAAGGGCTG AGCTTCTCTC TCTTTCACTG CCTATGCTCT GGGAAACTTT CAGAATCCCC AGGCTGTTTC 000640
000641 TGGAATAGTC TGGGATGGTC ATTTTCGGTC CTCACAGAAC CGGGGGTTTG GAAAGTAGGT GAAGCCATAT GGGTTGCAGA 000720
000721 GAATCTGGCC CAGCCCCTCA CTAGTCCCTG TGCTTGTTAG TTATTCTTTA ATTACTTGGA TATCAACTCT GATTTAAAAC 000800
000801 ATCAATTTAA CATACAGATA ATGCTGGGAC TGCAGCTTCG CCAGGTGGCC TTCCTCTCCC GGTCTCGGGA GGCGTCACCC 000880
000881 CCATCTTGTC CCCCCAAGAT CAAGCGAAAA GGGCAAAGTT ATCTTTTTTT TTTGCAAAAT TCAAACCTTT CCCTCCTTCC 000960
000961 CAGGTCTGGG TCACTTGCGA CCCTCCTCTC AGGCTGCTTC TGACAGGCCC AAGTCACAGC CAACTTCAGG GGAGCTGTGT 001040
001041 TGGATCCCTC CAAACTCCGG GAATGATGTG ACCTCCCCGG CGCTCACCCT GGGCCTCTAG GTTAGACTCA GCCCAGACCC 001120
001121 TGGCTCACAG GGAAGCAGAG CCCATGGCTG GCCTCCACCC GCTCAGACGC CCGCACACAG GCCAGGGGAG GGCTTGGGGG 001200
001201 CAGCTCTGGG AGGGTCCCTG TTTGCGCTGT GGGTGTCCAA GGCTGGGTGT GCAGAGCTGA GGTTGCCACA CACCTGAGGG 001280
001281 ACTCTCGCAT AAGGATGGCG CAGGGCTGGA AGCTGGACTG AGACAGTGAA TGAGACAGCG CTGCCGGCTC TCTCCGCGCC 001360
001361 GTGGACTGGC TCCTAGGCAG TCTCACGAGC AAGTGTGCTG TTGCTCGTGC ACGTGTGTTT TAACTGAGAT GTGAGCTGGC 001440
001441 GCGTTTTCTC CACCCGGCTC CATCTGCTGG GTCCCTCCAC ATTGCTGCGC GTGTGCCTCC AGTGCCATGG GGTCCCCACT 001520
001521 GTGGGCAGCC CTGCATTTTG CCCAACTGCC TCCCAAACCA TGGAACCCCA GTGACCTCCA ACTTGCCGCC ATGACGAGCC 001600
001601 ACTCTCATTC ATATCCTCAA ACACGGTGCT CTGAGCCCGA ATGAGGGTTT CTTTAGAGAG ATATCTGGGG ACCAAACTGT 001680
001681 AGGGACAGAG CTCCCGTCTA AGGTGGGCCT GTGGTGAGGT CTCCCCAGTG GTACACTTTA ATCCCACAGC AGGGCTTGCA 001760
001761 GGCCGCCCTG CTCCATCCCC ACCAACACTG GGCAGGATCT AGCCTCCAAA CCTTTGCCAG TCTAATAGTA TAAAATGATA 001840
001841 GCTCGGTGCT ATTTTGGTTT TGCATTTCCA TGGTGAACAA TGAGTTTGAG AGTTTTTGTT CAAAAGTCCT TTGGGAGTTC 001920
001921 GAGACCAGAC TGGGCCACCA GGAGTTCAAG ACCAGCCCGG GCAATTCAGT GAGATGTTCA TCTCTACAAA AAATAAGAAA 002000
002001 GATAAAAAAT TAGCCAGGCA TGTTTGTGTG TGCCTGTGGT CCCAGCTCCT CAGGAGGCTG AGGTGGGAGG ATGGCTTGAC 002080
002081 CCCGGGAGGT GGAGGCTGCA GTGAGTTGTG ATCGTGTCAG TGCACGGCTA GCCTGGGCAA CAGAGTGAGA CCCCGTTTCT 002160
002161 AAAATAAAAA TAAAAAACAA AAGAGAAAAG TC
000081 TGAGGAGTCC AGTTCCTAAA AGATTATTTC AGAGTCCACT GTTGTGGGCT GACTTGAATG CTCTCAAAAT TCGTAAGTTG 000160
000161 AAGCCCTGAC CCCCAGGACC TCAGAACGTG ACTGTGCTTG GACACAGGGA CTTTCCAGAG GTGATTAAGG TGAAGTGAGG 000240
000241 TCACTGGATG GCCCTCCTCC TATCTGACTG GTGTCCTGAT GGGGACGCGG ACACCCACAC AGGAACGGAC CCGGGGAGAA 000320
000321 CAACACATCG TCTCTGCGCC AGGGAGAGAG GCGTCCGGGG AACCCAGCCG TGCCCACGCA TTTATCTCCG CCTTCCGGCT 000400
000401 CAGAATTGTG AGGAAACCCG CTTCTGTTGT GCAAGTCCCG GGTCTGTGGT GCTCGGCCAT GGCGCCCCCA GAACTGCATC 000480
000481 CCCACCTTCT GCTCTGAGCC ACCCATCCCC TCTGGAAGGG CTGAGCTTCT CTCCCTTTCC ACCTTCTGTT CTGAGCCACC 000560
000561 CGTCCCCTCC AGAAGGGCTG AGCTTCTCTC TCTTTCACTG CCTATGCTCT GGGAAACTTT CAGAATCCCC AGGCTGTTTC 000640
000641 TGGAATAGTC TGGGATGGTC ATTTTCGGTC CTCACAGAAC CGGGGGTTTG GAAAGTAGGT GAAGCCATAT GGGTTGCAGA 000720
000721 GAATCTGGCC CAGCCCCTCA CTAGTCCCTG TGCTTGTTAG TTATTCTTTA ATTACTTGGA TATCAACTCT GATTTAAAAC 000800
000801 ATCAATTTAA CATACAGATA ATGCTGGGAC TGCAGCTTCG CCAGGTGGCC TTCCTCTCCC GGTCTCGGGA GGCGTCACCC 000880
000881 CCATCTTGTC CCCCCAAGAT CAAGCGAAAA GGGCAAAGTT ATCTTTTTTT TTTGCAAAAT TCAAACCTTT CCCTCCTTCC 000960
000961 CAGGTCTGGG TCACTTGCGA CCCTCCTCTC AGGCTGCTTC TGACAGGCCC AAGTCACAGC CAACTTCAGG GGAGCTGTGT 001040
001041 TGGATCCCTC CAAACTCCGG GAATGATGTG ACCTCCCCGG CGCTCACCCT GGGCCTCTAG GTTAGACTCA GCCCAGACCC 001120
001121 TGGCTCACAG GGAAGCAGAG CCCATGGCTG GCCTCCACCC GCTCAGACGC CCGCACACAG GCCAGGGGAG GGCTTGGGGG 001200
001201 CAGCTCTGGG AGGGTCCCTG TTTGCGCTGT GGGTGTCCAA GGCTGGGTGT GCAGAGCTGA GGTTGCCACA CACCTGAGGG 001280
001281 ACTCTCGCAT AAGGATGGCG CAGGGCTGGA AGCTGGACTG AGACAGTGAA TGAGACAGCG CTGCCGGCTC TCTCCGCGCC 001360
001361 GTGGACTGGC TCCTAGGCAG TCTCACGAGC AAGTGTGCTG TTGCTCGTGC ACGTGTGTTT TAACTGAGAT GTGAGCTGGC 001440
001441 GCGTTTTCTC CACCCGGCTC CATCTGCTGG GTCCCTCCAC ATTGCTGCGC GTGTGCCTCC AGTGCCATGG GGTCCCCACT 001520
001521 GTGGGCAGCC CTGCATTTTG CCCAACTGCC TCCCAAACCA TGGAACCCCA GTGACCTCCA ACTTGCCGCC ATGACGAGCC 001600
001601 ACTCTCATTC ATATCCTCAA ACACGGTGCT CTGAGCCCGA ATGAGGGTTT CTTTAGAGAG ATATCTGGGG ACCAAACTGT 001680
001681 AGGGACAGAG CTCCCGTCTA AGGTGGGCCT GTGGTGAGGT CTCCCCAGTG GTACACTTTA ATCCCACAGC AGGGCTTGCA 001760
001761 GGCCGCCCTG CTCCATCCCC ACCAACACTG GGCAGGATCT AGCCTCCAAA CCTTTGCCAG TCTAATAGTA TAAAATGATA 001840
001841 GCTCGGTGCT ATTTTGGTTT TGCATTTCCA TGGTGAACAA TGAGTTTGAG AGTTTTTGTT CAAAAGTCCT TTGGGAGTTC 001920
001921 GAGACCAGAC TGGGCCACCA GGAGTTCAAG ACCAGCCCGG GCAATTCAGT GAGATGTTCA TCTCTACAAA AAATAAGAAA 002000
002001 GATAAAAAAT TAGCCAGGCA TGTTTGTGTG TGCCTGTGGT CCCAGCTCCT CAGGAGGCTG AGGTGGGAGG ATGGCTTGAC 002080
002081 CCCGGGAGGT GGAGGCTGCA GTGAGTTGTG ATCGTGTCAG TGCACGGCTA GCCTGGGCAA CAGAGTGAGA CCCCGTTTCT 002160
002161 AAAATAAAAA TAAAAAACAA AAGAGAAAAG TC
Predicted Small Protein
Name | LINC00528_smProtein_278:496 |
Length | 72 |
Molecular weight | 7892.0345 |
Aromaticity | 0.0416666666667 |
Instability index | 41.2736111111 |
Isoelectric point | 11.1874389648 |
Runs | 10 |
Runs residual | 0.00202757502028 |
Runs probability | 0.033306836248 |
Amino acid sequence | MGTRTPTQERTRGEQHIVSAPGREASGEPSRAHAFISAFRLRIVRKPASVVQVPGLWCSA MAPPELHPHLLL |
Secondary structure | LLLLLLLLLLLLLLLEEEELLLLLLLLLLLHHHHHHHHHHHHEEELLLEEEELLLLEEEL LLLLLLLLLLLL |
PRMN | - |
PiMo | - |