FAM226A
Contents
Annotated Information
Approved Symbol
FAM226A
Approved Name
family with sequence similarity 226 member A (non-protein coding)
Previous Symbols
CXorf50, NCRNA00246, NCRNA00246A, LINC00246A
Synonyms
MGC34827
Chromosome
Xq13
RefSeq ID
NR_026595
OMIM ID
_
Ensembl ID
_
pubmed IDs
12477932
Sequence
>gi|532525026|ref|NR_026595.2| Homo sapiens family with sequence similarity 226 member A (non-protein coding) (FAM226A), long non-coding RNA
000001 GGGCGGCGGT CTCCGACTCA AGACCCTGGG GCTCCGGGTC TTCTTAGCAG CCGCCGCAGC AGCCGCGGCG ACGTCACTGT 000080
000081 CCTCTCGGCC TGGCACACAG CGCTCCGGCC GCCGAATGCC CGTGGACGCG CATCTCTTCC CAGAGTTCCT GCTTCCTGGG 000160
000161 CCAGCCAAAG CCGCAGGTGA GAACGTCCGC GGTAGAGGTG ACACCCAGGG CCTGCGGGTC TCAGAGGCCC GGTGGCCTGC 000240
000241 TCCTCTCCTC CTTGCCCAAG AAACCCCCAC GGGGGCGCCT GCAGCGGGGA CACCAAGTGC TCCCCGGAGC CCTGGGAGCA 000320
000321 CTTCTTCTGA GGCATGGAGC CCCTGCTCAG GTGCTTCTGA AAGTTCCTTT GTGGACCCCA GAAACCCCTA GGCAACTGTC 000400
000401 TGCCCCGCCC CGCGCCCCCG CCGCCCCCCG CCCCCGCAGT ACCCCTCCGG CCCCCGACAT CCCTGATGCG CATGCCCAGG 000480
000481 GGCCCTGTGA GCTAGAAAGT AGCTGAGCTG GTGCAGTACC TGCTGGTTAA GAACCGGAAG ACGGTGGCGA TCAAAAGGGC 000560
000561 AGACATGCTG AAGTATGTCA TCAAAAGGTA CAGGAGCTTC CTCCCTGAGA TTTTCAAGAA AGCCTCTGAC CTCCCCGAGT 000640
000641 TAGTCTTTGG GTTCTATCTG AAGGAACTTG ATCCAGCAGA GCACTCCTAT GTCTTGATCA GAAAAATCGA TCCTGCCCTG 000720
000721 GTTTGGGGCC TGACAGGCGA CCAGGGCACA CCAAAGACCC GGCTCCTGAT GATTACTCTG GACTCGATCT TCATGCAGGC 000800
000801 CAGCTGTGTC CCCGAGGAGG TGGTCTGGGA GGTGTTGAGG GTGTTGGAGG CACATTTCGT CTAAAAAGCA TTTCGTCTTT 000880
000881 GGGGAGTCCA TGAAGCTCAT CACCAAAGCT AGTGTGCAGC AGGAGTATCT GGTGCACAAA TAGGTGTCCC ACAGCAATCC 000960
000961 CACGCTCTAG GTATTCTTGT GGGGGCTCTC AAAGGAAACA AGACAGATGG AAGTCCCGGA GTTTGTGGCC AAAGTGAATG 001040
001041 ACACCCACCC CAGTTCCTTT CCGTGGCAGT AAATGAGGCA TTGAGAGAAG AGGAGGAGAT ACCCCGTGCC CGAGATGGCA 001120
001121 GCTGCCGTTG GTGACGTGAC AAGTGCCAGT GTTAGTGCCA GTTCCAGTTC CAGGGCCTAT GCCATGGCTG AAGCAAGCAT 001200
001201 CAGCACCAGC ACCAATGCAA GTGCCAGTTG CCAGTGCCAG GGCTAGAGCC ATGGCTGGAG CAAGTATCAG TACCTGGGAC 001280
001281 AGTGCCAGTG TGGTTGCAAT CTCAGGCTAG TGTCAGGGAC TCCTCCTGCC AGCAGTGAAG GCTGGGGCTG AGTCTTCACT 001360
001361 TTGTTTTGCT GTGGGTAGTC AAAAGGGCCC CACAGCAGTG GGTGCTGGGG TCCTGACTTT TCAAGAGTCG AGGGGTAGAG 001440
001441 TGGGGTTAGG GAGAACCTGC CGCCCATGGT ATCTGTGTTC CAGTTCTATT TGTCTTTCTC AGTGATTTAG CTTTCAATTT 001520
001521 GCATACTGCA AAGTTTTGTT TGCCTTAATT AACTTTCTTT TATAATGATG ATCATTTTAC AGGAAATAAA CTGGTTAAAA 001600
001601 CTACATGATG ACAGAATTAT ATCAGAGTTG AAACAAACGC CATACCTAAG CATTTTTTTC AAAATCCTTT GTTCCATAGA 001680
001681 CACTTGATTG AGTACTTAAG TTGAACATCT AGGTCTATGA ATGACGTTGG TCAAATGTTT TATTGTTCTC TGTTTCGGTT 001760
001761 TTAGCAGGAG AGATTTGCTG TTTCATAAAA GAAATTGGGA GAGTATATCA TTTTATGCCT GTAACTTATT ATAGCATTGG 001840
001841 AATAAGCTGT TCTTTGGAGG TTTGAGAGAC TTTACCAGTA CAATCATTCC CCCCCTCCCC CAAATATAAA AAGATAAAAT 001920
001921 AAAAAGCCGG TCAGTGTCTG TTGCACAAAA TTACAACCGC TCTCTGCTTG TATTTGCCTA GTTCTCCAGA ATGTAGGGAA 002000
002001 AAATAAAAAT TCAATGAATT AGA
000081 CCTCTCGGCC TGGCACACAG CGCTCCGGCC GCCGAATGCC CGTGGACGCG CATCTCTTCC CAGAGTTCCT GCTTCCTGGG 000160
000161 CCAGCCAAAG CCGCAGGTGA GAACGTCCGC GGTAGAGGTG ACACCCAGGG CCTGCGGGTC TCAGAGGCCC GGTGGCCTGC 000240
000241 TCCTCTCCTC CTTGCCCAAG AAACCCCCAC GGGGGCGCCT GCAGCGGGGA CACCAAGTGC TCCCCGGAGC CCTGGGAGCA 000320
000321 CTTCTTCTGA GGCATGGAGC CCCTGCTCAG GTGCTTCTGA AAGTTCCTTT GTGGACCCCA GAAACCCCTA GGCAACTGTC 000400
000401 TGCCCCGCCC CGCGCCCCCG CCGCCCCCCG CCCCCGCAGT ACCCCTCCGG CCCCCGACAT CCCTGATGCG CATGCCCAGG 000480
000481 GGCCCTGTGA GCTAGAAAGT AGCTGAGCTG GTGCAGTACC TGCTGGTTAA GAACCGGAAG ACGGTGGCGA TCAAAAGGGC 000560
000561 AGACATGCTG AAGTATGTCA TCAAAAGGTA CAGGAGCTTC CTCCCTGAGA TTTTCAAGAA AGCCTCTGAC CTCCCCGAGT 000640
000641 TAGTCTTTGG GTTCTATCTG AAGGAACTTG ATCCAGCAGA GCACTCCTAT GTCTTGATCA GAAAAATCGA TCCTGCCCTG 000720
000721 GTTTGGGGCC TGACAGGCGA CCAGGGCACA CCAAAGACCC GGCTCCTGAT GATTACTCTG GACTCGATCT TCATGCAGGC 000800
000801 CAGCTGTGTC CCCGAGGAGG TGGTCTGGGA GGTGTTGAGG GTGTTGGAGG CACATTTCGT CTAAAAAGCA TTTCGTCTTT 000880
000881 GGGGAGTCCA TGAAGCTCAT CACCAAAGCT AGTGTGCAGC AGGAGTATCT GGTGCACAAA TAGGTGTCCC ACAGCAATCC 000960
000961 CACGCTCTAG GTATTCTTGT GGGGGCTCTC AAAGGAAACA AGACAGATGG AAGTCCCGGA GTTTGTGGCC AAAGTGAATG 001040
001041 ACACCCACCC CAGTTCCTTT CCGTGGCAGT AAATGAGGCA TTGAGAGAAG AGGAGGAGAT ACCCCGTGCC CGAGATGGCA 001120
001121 GCTGCCGTTG GTGACGTGAC AAGTGCCAGT GTTAGTGCCA GTTCCAGTTC CAGGGCCTAT GCCATGGCTG AAGCAAGCAT 001200
001201 CAGCACCAGC ACCAATGCAA GTGCCAGTTG CCAGTGCCAG GGCTAGAGCC ATGGCTGGAG CAAGTATCAG TACCTGGGAC 001280
001281 AGTGCCAGTG TGGTTGCAAT CTCAGGCTAG TGTCAGGGAC TCCTCCTGCC AGCAGTGAAG GCTGGGGCTG AGTCTTCACT 001360
001361 TTGTTTTGCT GTGGGTAGTC AAAAGGGCCC CACAGCAGTG GGTGCTGGGG TCCTGACTTT TCAAGAGTCG AGGGGTAGAG 001440
001441 TGGGGTTAGG GAGAACCTGC CGCCCATGGT ATCTGTGTTC CAGTTCTATT TGTCTTTCTC AGTGATTTAG CTTTCAATTT 001520
001521 GCATACTGCA AAGTTTTGTT TGCCTTAATT AACTTTCTTT TATAATGATG ATCATTTTAC AGGAAATAAA CTGGTTAAAA 001600
001601 CTACATGATG ACAGAATTAT ATCAGAGTTG AAACAAACGC CATACCTAAG CATTTTTTTC AAAATCCTTT GTTCCATAGA 001680
001681 CACTTGATTG AGTACTTAAG TTGAACATCT AGGTCTATGA ATGACGTTGG TCAAATGTTT TATTGTTCTC TGTTTCGGTT 001760
001761 TTAGCAGGAG AGATTTGCTG TTTCATAAAA GAAATTGGGA GAGTATATCA TTTTATGCCT GTAACTTATT ATAGCATTGG 001840
001841 AATAAGCTGT TCTTTGGAGG TTTGAGAGAC TTTACCAGTA CAATCATTCC CCCCCTCCCC CAAATATAAA AAGATAAAAT 001920
001921 AAAAAGCCGG TCAGTGTCTG TTGCACAAAA TTACAACCGC TCTCTGCTTG TATTTGCCTA GTTCTCCAGA ATGTAGGGAA 002000
002001 AAATAAAAAT TCAATGAATT AGA
Predicted Small Protein
Name | FAM226A_smProtein_1178:1309 |
Length | 43 |
Molecular weight | 4326.9747 |
Aromaticity | 0.046511627907 |
Instability index | 44.723255814 |
Isoelectric point | 10.8344116211 |
Runs | 7 |
Runs residual | 0.0054873268879 |
Runs probability | 0.044893633129 |
Amino acid sequence | MPWLKQASAPAPMQVPVASARARAMAGASISTWDSASVVAISG |
Secondary structure | LLHHHHLLLLLLLLLLHHHHHHHHHHLLLLLLLLLLEEEEELL |
PRMN | - |
PiMo | - |