FAM226A

From LncRNAWiki
Revision as of 05:42, 23 June 2016 by Lin Liu (talk | contribs) (Created page with "==Annotated Information== ===Approved Symbol=== FAM226A ===Approved Name=== family with sequence similarity 226 member A (non-protein coding) ===Previous Symbols=== CXorf50, N...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Annotated Information

Approved Symbol

FAM226A

Approved Name

family with sequence similarity 226 member A (non-protein coding)

Previous Symbols

CXorf50, NCRNA00246, NCRNA00246A, LINC00246A

Synonyms

MGC34827

Chromosome

Xq13

RefSeq ID

NR_026595

OMIM ID

_

Ensembl ID

_

pubmed IDs

12477932

Sequence

>gi|532525026|ref|NR_026595.2| Homo sapiens family with sequence similarity 226 member A (non-protein coding) (FAM226A), long non-coding RNA

000001 GGGCGGCGGT CTCCGACTCA AGACCCTGGG GCTCCGGGTC TTCTTAGCAG CCGCCGCAGC AGCCGCGGCG ACGTCACTGT 000080
000081 CCTCTCGGCC TGGCACACAG CGCTCCGGCC GCCGAATGCC CGTGGACGCG CATCTCTTCC CAGAGTTCCT GCTTCCTGGG 000160
000161 CCAGCCAAAG CCGCAGGTGA GAACGTCCGC GGTAGAGGTG ACACCCAGGG CCTGCGGGTC TCAGAGGCCC GGTGGCCTGC 000240
000241 TCCTCTCCTC CTTGCCCAAG AAACCCCCAC GGGGGCGCCT GCAGCGGGGA CACCAAGTGC TCCCCGGAGC CCTGGGAGCA 000320
000321 CTTCTTCTGA GGCATGGAGC CCCTGCTCAG GTGCTTCTGA AAGTTCCTTT GTGGACCCCA GAAACCCCTA GGCAACTGTC 000400
000401 TGCCCCGCCC CGCGCCCCCG CCGCCCCCCG CCCCCGCAGT ACCCCTCCGG CCCCCGACAT CCCTGATGCG CATGCCCAGG 000480
000481 GGCCCTGTGA GCTAGAAAGT AGCTGAGCTG GTGCAGTACC TGCTGGTTAA GAACCGGAAG ACGGTGGCGA TCAAAAGGGC 000560
000561 AGACATGCTG AAGTATGTCA TCAAAAGGTA CAGGAGCTTC CTCCCTGAGA TTTTCAAGAA AGCCTCTGAC CTCCCCGAGT 000640
000641 TAGTCTTTGG GTTCTATCTG AAGGAACTTG ATCCAGCAGA GCACTCCTAT GTCTTGATCA GAAAAATCGA TCCTGCCCTG 000720
000721 GTTTGGGGCC TGACAGGCGA CCAGGGCACA CCAAAGACCC GGCTCCTGAT GATTACTCTG GACTCGATCT TCATGCAGGC 000800
000801 CAGCTGTGTC CCCGAGGAGG TGGTCTGGGA GGTGTTGAGG GTGTTGGAGG CACATTTCGT CTAAAAAGCA TTTCGTCTTT 000880
000881 GGGGAGTCCA TGAAGCTCAT CACCAAAGCT AGTGTGCAGC AGGAGTATCT GGTGCACAAA TAGGTGTCCC ACAGCAATCC 000960
000961 CACGCTCTAG GTATTCTTGT GGGGGCTCTC AAAGGAAACA AGACAGATGG AAGTCCCGGA GTTTGTGGCC AAAGTGAATG 001040
001041 ACACCCACCC CAGTTCCTTT CCGTGGCAGT AAATGAGGCA TTGAGAGAAG AGGAGGAGAT ACCCCGTGCC CGAGATGGCA 001120
001121 GCTGCCGTTG GTGACGTGAC AAGTGCCAGT GTTAGTGCCA GTTCCAGTTC CAGGGCCTAT GCCATGGCTG AAGCAAGCAT 001200
001201 CAGCACCAGC ACCAATGCAA GTGCCAGTTG CCAGTGCCAG GGCTAGAGCC ATGGCTGGAG CAAGTATCAG TACCTGGGAC 001280
001281 AGTGCCAGTG TGGTTGCAAT CTCAGGCTAG TGTCAGGGAC TCCTCCTGCC AGCAGTGAAG GCTGGGGCTG AGTCTTCACT 001360
001361 TTGTTTTGCT GTGGGTAGTC AAAAGGGCCC CACAGCAGTG GGTGCTGGGG TCCTGACTTT TCAAGAGTCG AGGGGTAGAG 001440
001441 TGGGGTTAGG GAGAACCTGC CGCCCATGGT ATCTGTGTTC CAGTTCTATT TGTCTTTCTC AGTGATTTAG CTTTCAATTT 001520
001521 GCATACTGCA AAGTTTTGTT TGCCTTAATT AACTTTCTTT TATAATGATG ATCATTTTAC AGGAAATAAA CTGGTTAAAA 001600
001601 CTACATGATG ACAGAATTAT ATCAGAGTTG AAACAAACGC CATACCTAAG CATTTTTTTC AAAATCCTTT GTTCCATAGA 001680
001681 CACTTGATTG AGTACTTAAG TTGAACATCT AGGTCTATGA ATGACGTTGG TCAAATGTTT TATTGTTCTC TGTTTCGGTT 001760
001761 TTAGCAGGAG AGATTTGCTG TTTCATAAAA GAAATTGGGA GAGTATATCA TTTTATGCCT GTAACTTATT ATAGCATTGG 001840
001841 AATAAGCTGT TCTTTGGAGG TTTGAGAGAC TTTACCAGTA CAATCATTCC CCCCCTCCCC CAAATATAAA AAGATAAAAT 001920
001921 AAAAAGCCGG TCAGTGTCTG TTGCACAAAA TTACAACCGC TCTCTGCTTG TATTTGCCTA GTTCTCCAGA ATGTAGGGAA 002000
002001 AAATAAAAAT TCAATGAATT AGA

Predicted Small Protein

Name FAM226A_smProtein_1178:1309
Length 43
Molecular weight 4326.9747
Aromaticity 0.046511627907
Instability index 44.723255814
Isoelectric point 10.8344116211
Runs 7
Runs residual 0.0054873268879
Runs probability 0.044893633129
Amino acid sequence MPWLKQASAPAPMQVPVASARARAMAGASISTWDSASVVAISG
Secondary structure LLHHHHLLLLLLLLLLHHHHHHHHHHLLLLLLLLLLEEEEELL
PRMN -
PiMo -