Difference between revisions of "Os03g0307800"
(→Evolution) |
m (→Evolution) |
||
| Line 27: | Line 27: | ||
Phylogenetic analysis done by using DNASTAR MegAlign 4.03, based on homology in SET domain, classifies SET domain-containing proteins into different subgroups and shows that OsiEZ1 belongs to subgroup I, which also includes AtMEA, AtCLF, OsCLF, AtEZA1 and AtPcG(Fig. 4A). The multiple sequence alignment of these proteins in conserved regions was done using Gene Runnerversion 3.04 and DNASTAR MegAlign 4.03, and is shown in Fig. 4B<ref name="ref2" />. | Phylogenetic analysis done by using DNASTAR MegAlign 4.03, based on homology in SET domain, classifies SET domain-containing proteins into different subgroups and shows that OsiEZ1 belongs to subgroup I, which also includes AtMEA, AtCLF, OsCLF, AtEZA1 and AtPcG(Fig. 4A). The multiple sequence alignment of these proteins in conserved regions was done using Gene Runnerversion 3.04 and DNASTAR MegAlign 4.03, and is shown in Fig. 4B<ref name="ref2" />. | ||
| − | [[File:1-s2.0-S0378111903007236-gr2r1.gif|left|thumb| | + | [[File:1-s2.0-S0378111903007236-gr2r1.gif|left|thumb|800px|''Fig. 4. (A) Based on the alignment of amino acid sequences of SET domain, a phylogenetic tree of SET domain-containing proteins from different organisms is shown. Sub-grouping of the proteins in I–IV subgroups is done according to a previous classification by Jenuwein et al. (1998) depending on the homology in SET domain. (from reference <ref name="ref2" />).'']] |
[[File:1-s2.0-S0378111903007236-gr2r2.gif|right|thumb|500px|''Fig. 4. (B) Alignment of deduced amino acid sequence of OsiEZ1 with amino acid sequences of other plant SET domain-containing proteins belonging to subgroup I. Names of the domains are mentioned on the top. Fully and partially conserved amino acid residues are highlighted with black and grey boxes, respectively. Numerals indicate position of amino acids. (from reference <ref name="ref2" />).'']] | [[File:1-s2.0-S0378111903007236-gr2r2.gif|right|thumb|500px|''Fig. 4. (B) Alignment of deduced amino acid sequence of OsiEZ1 with amino acid sequences of other plant SET domain-containing proteins belonging to subgroup I. Names of the domains are mentioned on the top. Fully and partially conserved amino acid residues are highlighted with black and grey boxes, respectively. Numerals indicate position of amino acids. (from reference <ref name="ref2" />).'']] | ||
Key of sequence designations: AAB80647, AAK28967, CAB41104, AAC61820, AAD15582, AAK28966, AAD55657, AAD26896, AtEZA, AtCLF1, AtPCG, AtCLF, CAA71599, AtMEDEA, AtMEALIKE, AAC23419, CAB75815, AAC23419, AAC34358 and AAF04434, all from Arabidopsis thaliana; NtSET1; from Nicotiana tabaccum; CLR4, from Schizosaccharomyces pombe; G9a and ENX1, both from Homo sapiens; SET1 and SET2, both from S. cerevisiae; TRX, DME(Z), DMEZ2MM, DMEZA2 and DMEZH2, all from Drosophila melanogaster; CEMES2, from C. Elegans<ref name="ref2" />. | Key of sequence designations: AAB80647, AAK28967, CAB41104, AAC61820, AAD15582, AAK28966, AAD55657, AAD26896, AtEZA, AtCLF1, AtPCG, AtCLF, CAA71599, AtMEDEA, AtMEALIKE, AAC23419, CAB75815, AAC23419, AAC34358 and AAF04434, all from Arabidopsis thaliana; NtSET1; from Nicotiana tabaccum; CLR4, from Schizosaccharomyces pombe; G9a and ENX1, both from Homo sapiens; SET1 and SET2, both from S. cerevisiae; TRX, DME(Z), DMEZ2MM, DMEZA2 and DMEZH2, all from Drosophila melanogaster; CEMES2, from C. Elegans<ref name="ref2" />. | ||
Revision as of 17:04, 6 June 2014
Please input one-sentence summary here.
Contents
Annotated Information
Function
A novel SET-domain-containing gene OsSET1 was isolated from rice (Oryza sativa L.). Its deduced protein consists of 895 amino acids. OsSET1 has a high degree of structure similarity to other SET-domain-containing genes such as CLF in higher plants and E(z) in animals[1].
The SET domains are conserved amino acid sequences present in chromosomal proteins that contribute to the epigenetic control of gene expression by altering regional organization of the chromatin structure. The SET domain proteins are divided into four subgroups as categorized by their Drosophila members; enhancer of zeste (E(Z)), trithorax (TRX), absent small or homeotic 1 (ASH1) and supressor of variegation (SU(VAR)3–9). Homologs of all four classes have been characterized in yeast, mammals and plants. We report here the isolation and characterization of rice (Oryza sativa L. subspecies indica) cDNA, OsiEZ1, as a monocot member of this family. The OsiEZ1 cDNA is 3133 bp long with an ORF of 2799 bp, and the predicted amino acid sequence (895 residues) corresponds to a protein of ca. 98 kDa. All the characteristic domains known to be conserved in E(Z) homologs (subgroup I) of SET domain containing proteins are present in OsiEZ1. In the rice genome, a 7499 bp long OsiEZ1 sequence is split into 17 exons interrupted by 16 introns. Southern analysis indicates that OsiEZ1 is represented as single copy in the rice genome. Expression studies revealed that the OsiEZ1 transcript level was highest in rice flowers, almost undetectable in developing seeds of 1–2 days post-fertilization but increased significantly in young seeds of 3–5 days post-fertilization. The OsiEZ1 transcript was barely detectable in mature zygotic embryos, but its levels were significantly higher in callus derived from rice scutellum, somatic embryos and young seedlings. The OsiEZ1/GUS recombinant protein was confined to the nucleus in living cells of particle-bombarded onion peels. The expression of OsiEZ1 complemented a set1Δ Saccharomyces cerevisiae mutant that is impaired in telomeric silencing. We suggest that the nuclear-localized OsiEZ1 has a role in regulating various aspects of plant development, and this control is most likely brought about by repressing the activity of downstream regulatory genes [2].
Expression
So far, 14 rice SET-domain-containing genes can be found in the SMART database, but in contrast to the two putative OsCLFs(AP005813; AP003044), the other 11 putative SET-containing genes have low sequence similarity with OsSET1 even in the SET domain.The detailed sequence analysis revealed that the OsSET1 gene has all known conserved regions, e.g. SET-N and SET-C in the SET domain, but lacks post-SET (Fig.1). Similar to other plant SET-domain-containing genes such as CLF and MEZ1-3, OsSET1 only has a cysteine-rich region, no pre-SET domain. Based on the sequence characteristics, the OsSET1 could be grouped into theSET1 family.
The expression pattern of OsSET1 was similar to that of the SET-domain-containing genes investigated in Arabidopsis and maize in terms of lacking organ specificity (data not shown). A transient expression assay revealed that the fusion protein of OsSET1 and green fluorescent protein (GFP) was located in the nuclei (Fig.2).This was also similar to other SET-containing proteins such as E(z)and CLF. To investigate the function of the OsSET1 gene, a series of transgenic Arabidopsis and rice lines were constructed. Among them, about 53.8% transgenic Arabidopsis that over-expressed the SET domain resulted in altered shoot development shown in Fig.3B,as well as large cotyledons (Fig.3A,B). No tunic-corpus structure was observed in the sections of the shoot apex of transgenic plants with abnormal shoots (Fig.3D). Further investigation on the function of the OsSET1 is still being undertaken[1].
To isolate SET‐domain genes from rice, a conserved SET‐domain sequence was first isolated with RT‐PCR using the degenerated primers determined by the published sequences of CLF, E(z), MEA (CEM1: 5′‐TCTGA(TC)T(TC)(TCG)(AC)(TC)GG(TAC)TGGGG TGC‐3′; CEM2: 5′‐GC(AT)(TC)C(TAC)TCTGG(TC)(CT)C(AG) TA(GCT)C(AGT)GTA‐3′). A 344 bp PCR product was cloned in pGEM‐T easy vector (Promega). After the PCR product was confirmed as a SET domain by sequencing, it was used as a probe to screen a cDNA library constructed from young panicles. A full length cDNA containing the SET domain was obtained by conducting 5′‐RACE after the library screening. This cDNA is 2957 bp, contains an ORF that encodes a putative protein of 895 amino acids with calculated molecular mass of 99.8 kDa. This gene was designated as OsSET1 (GenBank accession number AF407010). It localizes at chromosome three in rice genome at the contig 1300 (http://www.softberry.com/berry.phtml?topic=gfind&prg=FGENESH; GenBank accession number AAAA01003815). Interestingly, five genes were predicted at this contig and the OsSET1 cDNA sequence was predicted as gene four and five by FGENESH1.1. The OsSET1 sequence data now rectified the prediction. According to the rice genome sequence data, the OsSET1 contains 17 exons (data not shown)[1].
Evolution
Phylogenetic analysis done by using DNASTAR MegAlign 4.03, based on homology in SET domain, classifies SET domain-containing proteins into different subgroups and shows that OsiEZ1 belongs to subgroup I, which also includes AtMEA, AtCLF, OsCLF, AtEZA1 and AtPcG(Fig. 4A). The multiple sequence alignment of these proteins in conserved regions was done using Gene Runnerversion 3.04 and DNASTAR MegAlign 4.03, and is shown in Fig. 4B[2].
Key of sequence designations: AAB80647, AAK28967, CAB41104, AAC61820, AAD15582, AAK28966, AAD55657, AAD26896, AtEZA, AtCLF1, AtPCG, AtCLF, CAA71599, AtMEDEA, AtMEALIKE, AAC23419, CAB75815, AAC23419, AAC34358 and AAF04434, all from Arabidopsis thaliana; NtSET1; from Nicotiana tabaccum; CLR4, from Schizosaccharomyces pombe; G9a and ENX1, both from Homo sapiens; SET1 and SET2, both from S. cerevisiae; TRX, DME(Z), DMEZ2MM, DMEZA2 and DMEZH2, all from Drosophila melanogaster; CEMES2, from C. Elegans[2].
Labs working on this gene
Please input related labs here.
References
Please input cited references here.
Structured Information
| Gene Name |
Os03g0307800 |
|---|---|
| Description |
SET domain-containing protein |
| Version |
NM_001056434.1 GI:115452596 GeneID:4332612 |
| Length |
7498 bp |
| Definition |
Oryza sativa Japonica Group Os03g0307800, complete gene. |
| Source |
Oryza sativa Japonica Group ORGANISM Oryza sativa Japonica Group
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BEP
clade; Ehrhartoideae; Oryzeae; Oryza.
|
| Chromosome | |
| Location |
Chromosome 3:11003001..11010498 |
| Sequence Coding Region |
11003231..11003419,11003494..11003571,11003663..11003791,11003880..11003927,11004018..11004108 |
| Expression | |
| Genome Context |
<gbrowseImage1> name=NC_008396:11003001..11010498 source=RiceChromosome03 preset=GeneLocation </gbrowseImage1> |
| Gene Structure |
<gbrowseImage2> name=NC_008396:11003001..11010498 source=RiceChromosome03 preset=GeneLocation </gbrowseImage2> |
| Coding Sequence |
<cdnaseq>atggcgtcgtcctcgtccaaggcctccgattcctcttcccaacgccccaagcggccggatcaagggccgtcgggcaaggacgcggccggcctcgtcgcgctgcacgggaagctggcgcagctgaagcgtcaggtccaatccacccgcctcgccgcgatcaaggagagggtggaggcgaacaggaaggcgctgcaggtgcacacgtgcgctctgttcgacgtcgccgcggcggcggaggtggcgtcgcggggcgccgagggcggcaacgcgctgtcgcggggcgcggcggaggggcaccgcaggttcgtggggtgggactcggcgagcgggccgggggagagggagttggtgcatgtgcaggaggagaatctggtcgccgggacgctcgtgctctcgagctctggtggtagcggcgcctcgcataggaccgtcgtgcagcttgtgaagctgcctgtggtcgacaagattccgccctacaccacgtggatcttcctggacaaaaaccaaagaatggccgatgatcagtcagttggtaggaggagaatttactacgacccaattgtcaatgaggctctgatctgcagtgaaagtgacgatgatgttccagagccagaggaagagaaacatgttttcacagaaggagaagatcagctaatatggaaagctactcaagatcatgggttaagtcgagaggttttaaatgtcctctgccagtttgttgatgcaactccttcagaaattgaggaaagatcagaagttctttttgagaaatatgagaagcagtctcaatcttcttacaagacagatttgcaactttttcttgacaagaccatggatgtggctttagattcttttgataatctcttctgtcggagatgtttggtttttgattgccgtctccatgggtgctcccagaacttggtattccctagcgagaagcaaccatatggtcatgaacttgatgaaaacaagagaccgtgtggcgatcagtgctaccttcgaaggagagaagtatatcaagatacgtgcaatgatgaccgaaatgcttgtacaacatataatatggattcaagatcttcctcactcaaagttagtgctaccatattgtctgaatcagaagattcaaacagagatgaagataacatcaaatccacttctattgttgaaaccagcagatcaaaaataactaattctgaatatgctgacaaaagtgtgacaccacctcctggagatgcttctgaaactgaaaatgtgtcccctgacatgcccctaagaactttaggcaggcgtaagatttcaaagcatgcctccaagtctaacgatcattcacctgataaaaggcagaagatatatagctcaccgtttccttttgcaatgagtgtactgaacaagcaatctgttccagaaattggtgagacatgtccagattccatagaatctgcagttgatcaacttccaagtctggatgaccctaacaagaaaatttctaccaaagatatgtgtgctggaagcacaactaacactactgaaaatacattacgagataataataataatttgttcatctccaacaaggagcactctatttctcattggagtgctttagagagagatttgtacttgaagggaattgagatatttgggaaaaacagttgtctcatagctagaaacctattgtctggcctgaagacctgcatggaagtggccagctacatgtacaacaatggtgcggcaatggcaaagagacctctatctggtaaatccattttaggtgactttgcagaggctgaacaaggttacatggagcaagatttggtggcaaggacaagaatctgtcgtcgtaagggccgagctcgaaagctcaaatacacttggaagtctgcagggcatccaactgtaagaaaaagaatcggtgatggaaagcaatggtacactcagtataacccatgtgggtgtcagcaaatgtgtggcaaagattgcgcctgtgtggaaaatggaacttgttgcgagaagtactgcgggtgctcaaagagctgcaaaaataggtttagaggatgtcattgcgcaaaaagtcaatgcagaagcagacagtgcccgtgttttgctgccagtcgtgaatgtgatccagatgtttgcagaaactgctgggtgagctgcggagatggctcactaggtgagccactggcaagaggtgatggctatcagtgtggaaacatgaaactcctcttaaaacaacaacaacgtatattgcttggaaaatctgatgttgcgggttggggtgcattcattaagaacccagtaaatagaaatgattaccttggtgaatacactggcgaattgatttctcatagagaagcagataagcgtggcaaaatatatgatcgagcaaattcatcgttcctatttgatttaaatgagcagtatgtactggatgcttatcgcaagggggataaactgaagtttgcaaatcactcgtcgaatcctaactgctatgcgaaggttatgttggtggctggcgatcatcgagttggtatctatgcaaaggaccgcattgaggctagcgaggaactcttttatgattaccgctatggacctgaccaagccccagcttgggctaggagaccggaagggtcaaaaaaggatgaagcatctgtctctcaccaccgagcgcacaaagttgctagatag</cdnaseq> |
| Protein Sequence |
<aaseq>MASSSSKASDSSSQRPKRPDQGPSGKDAAGLVALHGKLAQLKRQ VQSTRLAAIKERVEANRKALQVHTCALFDVAAAAEVASRGAEGGNALSRGAAEGHRRF VGWDSASGPGERELVHVQEENLVAGTLVLSSSGGSGASHRTVVQLVKLPVVDKIPPYT TWIFLDKNQRMADDQSVGRRRIYYDPIVNEALICSESDDDVPEPEEEKHVFTEGEDQL IWKATQDHGLSREVLNVLCQFVDATPSEIEERSEVLFEKYEKQSQSSYKTDLQLFLDK TMDVALDSFDNLFCRRCLVFDCRLHGCSQNLVFPSEKQPYGHELDENKRPCGDQCYLR RREVYQDTCNDDRNACTTYNMDSRSSSLKVSATILSESEDSNRDEDNIKSTSIVETSR SKITNSEYADKSVTPPPGDASETENVSPDMPLRTLGRRKISKHASKSNDHSPDKRQKI YSSPFPFAMSVLNKQSVPEIGETCPDSIESAVDQLPSLDDPNKKISTKDMCAGSTTNT TENTLRDNNNNLFISNKEHSISHWSALERDLYLKGIEIFGKNSCLIARNLLSGLKTCM EVASYMYNNGAAMAKRPLSGKSILGDFAEAEQGYMEQDLVARTRICRRKGRARKLKYT WKSAGHPTVRKRIGDGKQWYTQYNPCGCQQMCGKDCACVENGTCCEKYCGCSKSCKNR FRGCHCAKSQCRSRQCPCFAASRECDPDVCRNCWVSCGDGSLGEPLARGDGYQCGNMK LLLKQQQRILLGKSDVAGWGAFIKNPVNRNDYLGEYTGELISHREADKRGKIYDRANS SFLFDLNEQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGIYAKDRIEASEE LFYDYRYGPDQAPAWARRPEGSKKDEASVSHHRAHKVAR</aaseq> |
| Gene Sequence |
<dnaseqindica>7080..7268#6928..7005#6708..6836#6572..6619#6391..6481#6076..6207#5638..5855#4453..4603#3715..4364#2938..3009#2739..2786#2482..2619#2279..2363#1502..1660#586..922#353..464#215..265#cggcggggtctcatccgattggaaacagattgggaagggggagggggtaggaatacgtggcgtcggcagtattaggtagagagagaaaccctttccatcctttgtctcttagccccgaaggagagagaaaaatcagaaaaaaaaaaccctccgcgtgtgggggaagcagagctccggacgctggcgccgctcgcgccaccgcacccgcaccgccatggcgtcgtcctcgtccaaggcctccgattcctcttcccaacgccccaaggtgcgcgccctcgcctcagtctcgactccccgcgcccgattccccactccgatctccttccccaatctgatcggcctcgctgtgcagcggccggatcaagggccgtcgggcaaggacgcggccggcctcgtcgcgctgcacgggaagctggcgcagctgaagcgtcaggtccaatccacccgcctcgccgcgatcaaggcaagcgcccgcccgccgatcacgcatctctacagtttcagctcgcagggggaagtgtgagtagtcggttggattttgctcaatctggcgtggattggtttggtttgtggtttttgagcaggagagggtggaggcgaacaggaaggcgctgcaggtgcacacgtgcgctctgttcgacgtcgccgcggcggcggaggtggcgtcgcggggcgccgagggcggcaacgcgctgtcgcggggcgcggcggaggggcaccgcaggttcgtggggtgggactcggcgagcgggccgggggagagggagttggtgcatgtgcaggaggagaatctggtcgccgggacgctcgtgctctcgagctctggtggtagcggcgcctcgcataggaccgtcgtgcagcttgtgaagctgcctgtggtcgacaagattccgccctacaccacgtggatcttcctggacaagtataatgcttctacacgctcattgttccaatcagatcacttgtcgctaggttcttgaacttgcagtttaagttttacaagagttcaggagaagttggtatattgccatgtaatacttcttaaataaaaagtagggtatgccatgtaatgcacctagatatgatattgtggaaaatatactactcaaaattcacttatgatgcatgaagcaaaattgaactatttacccatacaagccatagaaaccagacaacattttcagacagatattgcaaacttgggactagtagaaatatttggaatataagtgttagctcaaacattacactcaggtagtggtgttcatgtgtttttatgtttgctgatgagcgagagcactagtttgtgttaggatgatggtcagtttgaaaagattcagctacttgaacgatgtgtcatgctacttatcttcagtttatttgcttgattaattccaaagtggtatttattttctcatatacgtgctgcattggtgtaacaaagaatacttgatccctgttgtattcgatattttactctggtttaaacaacacagcgcagaaaccaaagaatggccgatgatcagtcagttggtaggaggagaatttactacgacccaattgtcaatgaggctctgatctgcagtgaaagtgacgatgatgttccagagccagaggaagagaaacatgttttcacagaaggagaagatcagctaatatggtaaaaacctggacattcaagaattctctctttttttttcattgaatatgctgtgatttttcccaatcatgatatctttctgttgcattttcttttcaataaaatgattcagatctaagttaaacatggaagaaggcccagagccatgaacttacattaacataacaaaccatccgtgcaaagttagttgcgctgtgcagttgttttactcagtctgcatgattaatccacctgttttctaccttaaggatcgtgcatactctaagagcttccaaccttcatgtaccctgttaattctgaaatgtaggtggaaggtaatgtgttgtattaaatcatctttttaggatggatcaacggatgccattttaatttaaccaatgcattattggtgaaatactggctggtttgactttacacttccagttgtgacaataatctgtaacccaagggcacccttcttttatttaccttttacacaatataatcagaataacaatttctattataaattatactggtaaaaatcggtaatcatcccaatactatccgctaccttcagaataaataaacatttttatatgtaaatttatgtctatctgatcttgcttaactatacaggaaagctactcaagatcatgggttaagtcgagaggttttaaatgtcctctgccagtttgttgatgcaactccttcagaaattgaggttgcccatttctcttttatgaacattttagttcgtcattactattttttttaactctgcagctgttttttattatcctacaatgtgatcatgactcttattccgatcaaaatactaggaaagatcagaagttctttttgagaaatatgagaagcagtctcaatcttcttacaagacagatttgcaactttttcttgacaagaccatggatgtggctttagattcttttgataatctcttctgtcggagatgtttggtatattatacttctggcattctgttgcttacagttcttttttgaattccatgttttaaacccctttgttcaactacattcgcaaggtctgaattacttgttgatatctatttattcaggtttttgattgccgtctccatgggtgctcccagaacttggtattccctgtaagtcataagttatttcctgttggactcttgtaaagttttgatctttctactttgctcatcctatcaatgaatggaggctgttgtcctgccatttggtccaaagtgaaattaaaaaaataccgctaactactaaccttttgcttaccagagcgagaagcaaccatatggtcatgaacttgatgaaaacaagagaccgtgtggcgatcagtgctaccttcgagtatgtgccccctcaagttttgcagttatatcaattaataaagttgtttatgattcctgaatttgcataatttgccagtccccccctgtgggaatctaagtttcactggtgtggaatattgagattgtattaaaaaaacttcgccaagatgtgtagttaatataagttttgccatcccagcagtgttgttcaaagaaatttcatagcctaggctttggttaaccgggtgtgagccttgaccagacctacatatacttctacaatattaaagcaataaattaggagtatattatttttctaatgaaaaatctaattaactttatgttgaagatatactattagatctgttgtcttgcttgttggacttcaggacccaaaagtagaccctccatacagtacatacactgttatgaactatggatcgtacaagtggcatcagagctaacaccatggcttctgtgggaagtactttttaaactatggtctcatttttctcctaaactccttcactcccattcggccatgctatctgatgtaacttttagtccgtatgcttatcaacattttcattgacccttttaatgggtttgattgatgcaacttctttagtttatttgtctattgaagcgagcctatgaattattctgagaattgcagcaattgctgcattgttgctaagactgattttgattaaaaaattcacagaggagagaagtatatcaagatacgtgcaatgatgaccgaaatgcttgtacaacatataatatggattcaagatcttcctcactcaaagttagtgctaccatattgtctgaatcagaagattcaaacagagatgaagataacatcaaatccacttctattgttgaaaccagcagatcaaaaataactaattctgaatatgctgacaaaagtgtgacaccacctcctggagatgcttctgaaactgaaaatgtgtcccctgacatgcccctaagaactttaggcaggcgtaagatttcaaagcatgcctccaagtctaacgatcattcacctgataaaaggcagaagatatatagctcaccgtttccttttgcaatgagtgtactgaacaagcaatctgttccagaaattggtgagacatgtccagattccatagaatctgcagttgatcaacttccaagtctggatgaccctaacaagaaaatttctaccaaagatatgtgtgctggaagcacaactaacactactgaaaatacattacgagataataataataatttgttcatctccaacaaggagcactctatttctcattggagtgctttagagagagatttgtacttgaagggaattgagatatttgggaaaaacaggtaattcttttatatttcccatgctatttgaaaatattgtttattgaaaaacgtaatttaattttaagttcataatggtatgccacagttgtctcatagctagaaacctattgtctggcctgaagacctgcatggaagtggccagctacatgtacaacaatggtgcggcaatggcaaagagacctctatctggtaaatccattttaggtgactttgcagaggctgaacaaggttacatggtacgtattgactggcctcgttaggttatgcttttatccagatcaaagaagtcaaaagaatctataactttggtagttatttttttctgaatgttttcagcacgtctgcattatagtggctgtatttggttgttgttcatcctcctgatatttagttctgtagtcgttatattatattaaatttctttcggcacccacaataaagatggacatggttctttttttggaaacaagcgtattgttcctaaaactggctagtgctagaattgtaggttgtctcatgttttcatatattaggtggttaggggaaattatattttattgtgaagtatagacaaattccaccggtcctataggtttgccttccttttttgcttctaagcacaccatctataacaatgtacaaaaaggagaaaaactgacaattcaatactccctccatctacttttgatagtcatattttcaaatctgaaaaatttatttttgataggcatatttcaatccaacaacatatcctcttaatgactttctcggatttaatgcgtgactctccattcttccacacaagattggctacatgggcatcgagaaatgtaaatattaatgaatcgcttgtttacgaggaatgactagtagcatgtttcaatagatgataagtagaattacttatccttggtctgtgtgccaagatgaaatatgactatcaaaagtagatggagggagtatttcatagtatttacagaaaacaaattgatctatggctagtatatgagtgcatgccaccgttttccatttgaaatcaagcctgttacttgttactactgtttacaatcgaaaaaccagttttcctgccttgtgccttcacagtttagttaagataatctcagtgctactgcacagttaaaagccaaaacacaaacatattttggcttcatttacaactgctatttttactccagtaacctaattgttgtcccagtttcaatcattattgttactccagtgacacaattatttactcatctctttgcaggagcaagatttggtggcaaggacaagaatctgtcgtcgtaagggccgagctcgaaagctcaaatacacttggaagtctgcagggcatccaactgtaagaaaaagaatcggtgatggaaagcaatggtacactcagtataacccatgtgggtgtcagcaaatgtgtggcaaagattgcgcctgtgtggaaaatggaacttgttgcgagaagtactgcgggtatgtcaagaaattttttattttccctttttgttgagtagaaatctgatttttcctttacttctcttatagaaatacttgtatgaacctgtgttgagctgtttgataaagaaatcaaattaaatacggtgtcaattcacttgtgcatttctgttttatgatcatgttagtgtaaagcaaaagcttgacgctcacttgttttaattattatcatcaacaggtgctcaaagagctgcaaaaataggtttagaggatgtcattgcgcaaaaagtcaatgcagaagcagacagtgcccgtgttttgctgccagtcgtgaatgtgatccagatgtttgcagaaactgctgggtgaggtaatcagcatgttgtgcacttaaggaccctcttgccctcgtattcaaggcaatacccatgctttgggatagtcttttaacagaaactaacaaaaatattgattggctatcatgtgctaatttattcagatttattttgtttaattaattacacgttacacgcatggcgtttattctcttcagctgcggagatggctcactaggtgagccactggcaagaggtgatggctatcagtgtggaaacatgaaactcctcttaaaacaacaacaacgtgtgagagctctagtcttgtattgcttgtttatctttccacatcattgtgctgtactatcttaactgcgttgtttgttgatcatggatcagatattgcttggaaaatctgatgttgcgggttggggtgcattcattaaggttagatcaatgaaaaagcacctttggatatgatgcattttgcattttggctaattccttttttttctttctttcttgcaacatgtagaacccagtaaatagaaatgattaccttggtgaatacactggcgaattgatttctcatagagaagcagataagcgtggcaaaatatatgatcgagcaaattcatcgttcctatttgatttaaatgagcaggtgtgaactaggtcttcatttctggccagttggagtgtcctgggtttgttaaaagtgaaacaggtaacctttatttgatgtgttcttgcagtatgtactggatgcttatcgcaagggggataaactgaagtttgcaaatcactcgtcgaatcctaactgctatgcgaaggtaatgacttatttttactgaatattcatcatgctctgttgcctttctcgttctgattccgtctgcccgtccaggttatgttggtggctggcgatcatcgagttggtatctatgcaaaggaccgcattgaggctagcgaggaactcttttatgattaccgctatggacctgaccaagccccagcttgggctaggagaccggaagggtcaaaaaaggatgaagcatctgtctctcaccaccgagcgcacaaagttgctagatagtccaacagcagctccagatgataatatcaactgtaaattataccgtcattgaaacacatagttcaatcctagtccattatacggccaatcgttggcataataagcatattctatattccttagttccttggtaaataaactgagatatcgagtatgcgaataaaagaaaaataaggcactgtaagtttatttgtacaaagtttggaatttatgctatgtatagttttgcc</dnaseqindica> |
| External Link(s) |