FAM201A
Contents
Annotated Information
Approved Symbol
FAM201A
Approved Name
family with sequence similarity 201 member A
Previous Symbols
C9orf122
Synonyms
_
Chromosome
9p12
RefSeq ID
NR_027294
OMIM ID
_
Ensembl ID
ENSG00000204860
pubmed IDs
12477932
Sequence
>gi|224591428|ref|NR_027294.1| Homo sapiens family with sequence similarity 201 member A (FAM201A), long non-coding RNA
000001 GAGTGCACCT GGCCTGAGAG CCCGCCAGGC CCTGCCCCTG CTCGGCTCCT CGTCTGCTGA AGCTCGGGAC CTCTAGCCAG 000080
000081 TGGCCCTCTG CAGCCACGGG GAATGGGGCT GAGAGCCGGT TCCCGCTGCA GGGCAGACCA CCTAGCTCAG CCGCAGCCAC 000160
000161 AGGGACATCT GGCCCCGGTT CTGAGATGTG GGGAGTGCTG GCGGGCTCGG GGGTTGCCTG GAGGCTGCTG CCTGCACACA 000240
000241 GAAGGCGGAT GCAGCTTGGG TGCCCAGGCG GGCTGGAGGA CAGCGGGCTG GGAAGCCAGG GGCCGCCGGG ACCTGGGGCT 000320
000321 GGAAACCACA TCTGCCCACA GCCGCAGCCT CCTGCACCTG TCATCCTGGC GGCGCCCTGA CGTCGGAGAA GCAGCAGGCG 000400
000401 CGGAGCTCCT GCAGCGGGCA CCGCTGCAGC AACCCGACCC GGCGCAGGCG GCAGTGGAAG GCGGCCTGCT AGCACGGCTC 000480
000481 CCGCGGCCCC AGGACCAGGG GTGCGGCCAG CACCGCCCTC ACTCCCCGCG TCTCGTGGAT ATTGCCCTCC CTGGAGGCGG 000560
000561 CTGGACCTAA GCGAGGCTCG CAGCGATCGG CCCCGCGAAG ACTCGCATGC AGGGCTCAGG ATCCCGCATT CTGGCGCCGC 000640
000641 GGCACCGCAC CTTTCAGCAG CCCTGCTCCT CTTCTTGTTG GTAAACTTTT GTCACTCTGC CATGAAAAGC ATCCAGGCGT 000720
000721 CCTTGATTTC GACATTGCGT GCACGCTCAG CCGAGCTCAT CAGACGGGCG GTTGTCCCCA GAGCCCGCGG GTAGCTTGTT 000800
000801 CCCGGGCGGC GGCTCCTGCT GGATGCAGCT CCTTGTGGGC CTACCAGGCT TGGTGGAGCC ACAGCAGACC TGTGGAGAGG 000880
000881 AAGAGTAAGG CGGGCTCTGC AGGGCAGTGG GCCGGGACCA TGAGAGAGGC GGGTCAGTCT GGGCTCCAAG CTCAGCCTCT 000960
000961 CGGATTCCCC GGGACCCACG GCTTATAATG CGCTTAAATC CCACGCCTCG CCCGAGAGAC AGCACGTCAC CGTCACCGTC 001040
001041 ACCGCCTAGC GCCCCTGACC CGCTCCCACT CCGCTGCAGC GGAGGGTGTG TGAGGGAGAG GGACGCAGGG AGGGAAAAGC 001120
001121 GTTGGGAGGG CAAACATCTT TTCATAAGCT TTTCCCCTTC TATATGCCAT CTCTGATGGG AGCCTCTTTA GATCTTTCGT 001200
001201 CCATTTACTA ATTGGGTTGT TCGATTTCTT ATTGTTGAGT TGTAAGTGGT TTTTAATGGT CTGGATGCCA GACAGGTGTT 001280
001281 TTGCAAATAT TTTCTCCGTC TGTGGCTTGT TTCTCCATTC TCTTATTTCC TTTCCCAGAG CAAAAGTTTT TAATTGTAAC 001360
001361 GACTTCATAC CAATATCTTC TTTCATGGTA GAAATTTGTC TTTTATGTAC TTTACTGTTG TATCTACAAA GTAATTGCCA 001440
001441 AACCCAAAGT TACCTAAATA TCCCTTTTGT TATTTTACAG AAGTTTTACA GCTTTTGGAT TTAAATTTAG GTCTAAAAAT 001520
001521 TGAATTCATA AAAGTAGAGA GTATAATGAT GCTTACCAGA GGCTGTGTTG CTGGGGAGAA AAATGGGAAG TTGTTCACAG 001600
001601 ATGCAAAGAT TCAGTTAAAC CGGAGAAATG CGTTTTGAAA TCTATTGCAC AGCAGGATGA CTATAGTCAA TAATAATGTA 001680
001681 CTGTATATTT AAATAACAAA AAGTAAATTT CAAATATCTA ACCACAAAAA AAAAAAAAAA A
000081 TGGCCCTCTG CAGCCACGGG GAATGGGGCT GAGAGCCGGT TCCCGCTGCA GGGCAGACCA CCTAGCTCAG CCGCAGCCAC 000160
000161 AGGGACATCT GGCCCCGGTT CTGAGATGTG GGGAGTGCTG GCGGGCTCGG GGGTTGCCTG GAGGCTGCTG CCTGCACACA 000240
000241 GAAGGCGGAT GCAGCTTGGG TGCCCAGGCG GGCTGGAGGA CAGCGGGCTG GGAAGCCAGG GGCCGCCGGG ACCTGGGGCT 000320
000321 GGAAACCACA TCTGCCCACA GCCGCAGCCT CCTGCACCTG TCATCCTGGC GGCGCCCTGA CGTCGGAGAA GCAGCAGGCG 000400
000401 CGGAGCTCCT GCAGCGGGCA CCGCTGCAGC AACCCGACCC GGCGCAGGCG GCAGTGGAAG GCGGCCTGCT AGCACGGCTC 000480
000481 CCGCGGCCCC AGGACCAGGG GTGCGGCCAG CACCGCCCTC ACTCCCCGCG TCTCGTGGAT ATTGCCCTCC CTGGAGGCGG 000560
000561 CTGGACCTAA GCGAGGCTCG CAGCGATCGG CCCCGCGAAG ACTCGCATGC AGGGCTCAGG ATCCCGCATT CTGGCGCCGC 000640
000641 GGCACCGCAC CTTTCAGCAG CCCTGCTCCT CTTCTTGTTG GTAAACTTTT GTCACTCTGC CATGAAAAGC ATCCAGGCGT 000720
000721 CCTTGATTTC GACATTGCGT GCACGCTCAG CCGAGCTCAT CAGACGGGCG GTTGTCCCCA GAGCCCGCGG GTAGCTTGTT 000800
000801 CCCGGGCGGC GGCTCCTGCT GGATGCAGCT CCTTGTGGGC CTACCAGGCT TGGTGGAGCC ACAGCAGACC TGTGGAGAGG 000880
000881 AAGAGTAAGG CGGGCTCTGC AGGGCAGTGG GCCGGGACCA TGAGAGAGGC GGGTCAGTCT GGGCTCCAAG CTCAGCCTCT 000960
000961 CGGATTCCCC GGGACCCACG GCTTATAATG CGCTTAAATC CCACGCCTCG CCCGAGAGAC AGCACGTCAC CGTCACCGTC 001040
001041 ACCGCCTAGC GCCCCTGACC CGCTCCCACT CCGCTGCAGC GGAGGGTGTG TGAGGGAGAG GGACGCAGGG AGGGAAAAGC 001120
001121 GTTGGGAGGG CAAACATCTT TTCATAAGCT TTTCCCCTTC TATATGCCAT CTCTGATGGG AGCCTCTTTA GATCTTTCGT 001200
001201 CCATTTACTA ATTGGGTTGT TCGATTTCTT ATTGTTGAGT TGTAAGTGGT TTTTAATGGT CTGGATGCCA GACAGGTGTT 001280
001281 TTGCAAATAT TTTCTCCGTC TGTGGCTTGT TTCTCCATTC TCTTATTTCC TTTCCCAGAG CAAAAGTTTT TAATTGTAAC 001360
001361 GACTTCATAC CAATATCTTC TTTCATGGTA GAAATTTGTC TTTTATGTAC TTTACTGTTG TATCTACAAA GTAATTGCCA 001440
001441 AACCCAAAGT TACCTAAATA TCCCTTTTGT TATTTTACAG AAGTTTTACA GCTTTTGGAT TTAAATTTAG GTCTAAAAAT 001520
001521 TGAATTCATA AAAGTAGAGA GTATAATGAT GCTTACCAGA GGCTGTGTTG CTGGGGAGAA AAATGGGAAG TTGTTCACAG 001600
001601 ATGCAAAGAT TCAGTTAAAC CGGAGAAATG CGTTTTGAAA TCTATTGCAC AGCAGGATGA CTATAGTCAA TAATAATGTA 001680
001681 CTGTATATTT AAATAACAAA AAGTAAATTT CAAATATCTA ACCACAAAAA AAAAAAAAAA A
Predicted Small Protein
Name | FAM201A_smProtein_185:379 |
Length | 64 |
Molecular weight | 6456.4265 |
Aromaticity | 0.03125 |
Instability index | 83.928125 |
Isoelectric point | 9.00665283203 |
Runs | 9 |
Runs residual | 0.00796875 |
Runs probability | 0.0270073995564 |
Amino acid sequence | MWGVLAGSGVAWRLLPAHRRRMQLGCPGGLEDSGLGSQGPPGPGAGNHICPQPQPPAPVI LAAP |
Secondary structure | LEEEELLLLHHHHLLHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLEE EELL |
PRMN | - |
PiMo | - |