INSIGHT REVIEW NATUREIVol 447 24 May 2007 doi: 10.1038/nature05917 Epigenetic inheritance in plants lanR. henderson'& steven e. jacobsen' The function of plant genomes depends on chromatin marks such as the methylation of dna and the post- translational modification of histones. Techniques for studying model plants such as Arabidopsis thaliana have enabled researchers to begin to uncover the pathways that establish and maintain chromatin modifications, scale. Small RNAs seem to be important in determining the distribution of chromatin modifications, ano o and genomic studies are allowing the mapping of modifications such as dNa methylation on a genom RNA might also underlie the complex epigenetic interactions that occur between homologous sequences. Plants use these epigenetic silencing mechanisms extensively to control development and parent -of-origin imprinted gene expression. akaryotic genomes are covalently modified with a diverse set of chro- highly transcribed and constitutively expressed By contrast, genes matin marks, which are present on both the DNA and the associated with methylated promoters had lower expression levels and frequent mary DNA sequence, they are frequently heritable through cell division, methylation is in contrast to that observed in mammalian genomes, sometimes for multiple generations, and can thus often be classified as which are often densely methylated but have hypomethylated CG islands pigenetic marks. These conserved epigenetic marks have been found in gene promoters. It will be important to describe the methylome of to influence many aspects of gene expression and chromosome biology, other repeat-rich plant genomes, such as those of the grasses, to test the and they have characteristic genomic distributions generality of the patterns observed in A thaliana. Here, we review the The size of eukaryotic genomes v tensively and does not cor- emerging and prominent role of RNA in epigenetic inheritance in plants relate with gene number. This is often because of the presence of large and how such mechanisms are used to control development. amounts of non-gene sequences, which can include pseudogenes, transposable elements, integrated viruses and simple repeats' At the Mediating silencing with RNA hromosomal level, genomes are organized into euchromatin, which is A central question in understanding the epigenetic regulation of gene-rich, and heterochromatin, which is repeat-rich Heterochromatin genomes is how sequences are recognized or avoided as targets for is defined by three main properties: greater compaction than other silencing. There is an increasing appreciation that siRNAs, whic genomic regions during interphase, lower accessibility than other are generated by the RNA interference(RNai)pathway, can provide regions to transcription and recombination machinery, and the for- sequence specificity to guide epigenetic modifications in a diverse range mation of structured nucleosome arrays"(see page 399). The defining of eukaryotes. Well-studied examples include transcriptional silen characteristics of heterochromatin depend on epigenetic information, cing in yeast(see page 399), cytosine methylation in plants. and including post-translational modification of histones and methylation nome rearrangements in ciliates. RNA-directed DNA methylation of cytosine bases in DNA". The silencing of transposable-element was discovered in tobacco, in which genomic sequences homologous uences within heterochromatin is probably a genome-defence strat- to infectious RNA viroids were found to become cytosine methy gy. However, heterochromatin can also have important roles during ated. Subsequently, the expression of double-stranded RNA (dsRNA) chromosomal segregation, and transposons and epigenetic silencing in plants was shown to generate siRNAs and cause dense cytosine ave been shown to both modulate gene expression and contribute to methylation of homologous DNA in all sequence contexts. This is cis-regulatory sequences". Plant systems have been a rich source for the reflected by the high coincidence of endogenous siRNA clusters with study of epigenetic inheritance, and examples of important discoveries methylated sequences and repeats in A. thaliana.215.20 include transposable elements, Paramutation, small interfering RNAs All known de novo DNA methylation in A. thaliana is carried out (siRNAs) and RNA-directed DNA methylation by DOMAINS REARRANGED METHYLTRANSFERASE 2(DRM2), Genomic resources for studying the model plant Arabidopsis thaliana which is a homologue of the mammalian DNA methyltransferase 3 have begun to provide insight into the epigenetic landscape' of this (DNMT3)enzymes(Fig 2b). DRM2 can be targeted to a sequence organism.A thaliana has a compact-130-megabase(Mb)genome, by siRNAs generated from the expression of either direct or inverted although it contains considerable amounts of heterochromatin, which is repeats. Plants encode multiple homologues of the RNAi-machinery repeat-rich and largely located in the centromeric and pericentromeric components, some of which are specialized for function in RNA regions(Fig. 1). High-resolution mapping of cytosine methylation directed DNA methylation. 26. The endoribonuclease DICER-LIKE3 by using whole-genome microarrays has confirmed previous reports, (DCL3)generates 24-nucleotide siRNAs, which are loaded into the PA nowing that this modification co-localizes with repeat sequences and and PIWI-domain-containing protein ARGONAUTE 4(AGO4) with the centromeric regions2.5. Fewer than 5% of expressed genes (Fig 2a). These AGO4-associated siRNAs are proposed to guide the were shown to have methylated promoters, although about one-third cytosine-methyltransferase activity of DRM2 (refs 26-31). The mecha of genes were methylated in their open reading frame.. The signif- nism by which siRNAs target epigenetic modifications is poorly under ance of methylation in the body of a gene is not fully understood, stood and could involve either DNA-RNA or RNA-RNA hybridization but such methylation was found to correlate with genes that are both events. Interestingly, epigenetic modifications guided by AGo4 in Department of Molecular, Cell and Developmental Biology, Howard Hughes Medical Institute, University of California, Los Angeles, California 90095, USA. @2007 Nature Publishing Group
Eukaryotic genomes are covalently modified with a diverse set of chromatin marks, which are present on both the DNA and the associated histones (see page 407). Although these changes do not alter the primary DNA sequence, they are frequently heritable through cell division, sometimes for multiple generations, and can thus often be classified as epigenetic marks. These conserved epigenetic marks have been found to influence many aspects of gene expression and chromosome biology, and they have characteristic genomic distributions. The size of eukaryotic genomes varies extensively and does not correlate with gene number1 . This is often because of the presence of large amounts of non-gene sequences, which can include pseudogenes, transposable elements, integrated viruses and simple repeats1 . At the chromosomal level, genomes are organized into euchromatin, which is gene-rich, and heterochromatin, which is repeat-rich2 . Heterochromatin is defined by three main properties: greater compaction than other genomic regions during interphase, lower accessibility than other regions to transcription and recombination machinery, and the formation of structured nucleosome arrays2 (see page 399). The defining characteristics of heterochromatin depend on epigenetic information, including post-translational modification of histones and methylation of cytosine bases in DNA2,3. The silencing of transposable-element sequences within heterochromatin is probably a genome-defence strategy. However, heterochromatin can also have important roles during chromosomal segregation4 , and transposons and epigenetic silencing have been shown to both modulate gene expression and contribute to cis-regulatory sequences5,6. Plant systems have been a rich source for the study of epigenetic inheritance, and examples of important discoveries include transposable elements7 , paramutation8 , small interfering RNAs (siRNAs)9 and RNA-directed DNA methylation10. Genomic resources for studying the model plant Arabidopsis thaliana have begun to provide insight into the epigenetic ‘landscape’ of this organism11,12. A. thaliana has a compact ~130-megabase (Mb) genome, although it contains considerable amounts of heterochromatin, which is repeat-rich and largely located in the centromeric and pericentromeric regions13,14 (Fig. 1). High-resolution mapping of cytosine methylation by using whole-genome microarrays has confirmed previous reports, showing that this modification co-localizes with repeat sequences and with the centromeric regions11,12,15. Fewer than 5% of expressed genes were shown to have methylated promoters, although about one-third of genes were methylated in their open reading frame11,12. The significance of methylation in the body of a gene is not fully understood, but such methylation was found to correlate with genes that are both highly transcribed and constitutively expressed11,12. By contrast, genes with methylated promoters had lower expression levels and frequently had tissue-specific expression patterns11,12. This distribution of cytosine methylation is in contrast to that observed in mammalian genomes, which are often densely methylated but have hypomethylated CG islands in gene promoters3 . It will be important to describe the ‘methylome’ of other repeat-rich plant genomes, such as those of the grasses, to test the generality of the patterns observed in A. thaliana. Here, we review the emerging and prominent role of RNA in epigenetic inheritance in plants and how such mechanisms are used to control development. Mediating silencing with RNA A central question in understanding the epigenetic regulation of genomes is how sequences are recognized or avoided as targets for silencing. There is an increasing appreciation that siRNAs, which are generated by the RNA interference (RNAi) pathway, can provide sequence specificity to guide epigenetic modifications in a diverse range of eukaryotes. Well-studied examples include transcriptional silencing in yeast16 (see page 399), cytosine methylation in plants10,17 and genome rearrangements in ciliates18. RNA-directed DNA methylation was discovered in tobacco, in which genomic sequences homologous to infectious RNA viroids were found to become cytosine methylated10. Subsequently, the expression of double-stranded RNA (dsRNA) in plants was shown to generate siRNAs and cause dense cytosine methylation of homologous DNA in all sequence contexts19. This is reflected by the high coincidence of endogenous siRNA clusters with methylated sequences and repeats in A. thaliana11,12,15,20. All known de novo DNA methylation in A. thaliana is carried out by DOMAINS REARRANGED METHYLTRANSFERASE 2 (DRM2), which is a homologue of the mammalian DNA methyltransferase 3 (DNMT3) enzymes21–24 (Fig. 2b). DRM2 can be targeted to a sequence by siRNAs generated from the expression of either direct or inverted repeats23,24. Plants encode multiple homologues of the RNAi-machinery components, some of which are specialized for function in RNAdirected DNA methylation25,26. The endoribonuclease DICER-LIKE 3 (DCL3) generates 24-nucleotide siRNAs, which are loaded into the PAZand PIWI-domain-containing protein ARGONAUTE 4 (AGO4)26–31 (Fig. 2a). These AGO4-associated siRNAs are proposed to guide the cytosine-methyltransferase activity of DRM2 (refs 26–31). The mechanism by which siRNAs target epigenetic modifications is poorly understood and could involve either DNA–RNA or RNA–RNA hybridization events. Interestingly, epigenetic modifications guided by AGO4 in Epigenetic inheritance in plants Ian R. Henderson1 & Steven E. Jacobsen1 The function of plant genomes depends on chromatin marks such as the methylation of DNA and the posttranslational modification of histones. Techniques for studying model plants such as Arabidopsis thaliana have enabled researchers to begin to uncover the pathways that establish and maintain chromatin modifications, and genomic studies are allowing the mapping of modifications such as DNA methylation on a genome-wide scale. Small RNAs seem to be important in determining the distribution of chromatin modifications, and RNA might also underlie the complex epigenetic interactions that occur between homologous sequences. Plants use these epigenetic silencing mechanisms extensively to control development and parent-of-origin imprinted gene expression. 1 Department of Molecular, Cell and Developmental Biology, Howard Hughes Medical Institute, University of California, Los Angeles, California 90095, USA. 418 INSIGHT REVIEW NATURE|Vol 447|24 May 2007|doi:10.1038/nature05917
NATUREIVol 447 24 May 2007 INSIGHT REVIEW A. thaliana have been shown to depend partly on the RNaseH(slicer) revert to an active state. This gives rise to the concept of the epigenetic talytic activity of AGo4(ref. 30). This could be taken as support for allele(epiallele), which is defined as an allele that shows a heritable di INA-RNA hybridization having an important role in the targeting of ference in expression as a consequence of epigenetic modifications and epigenetic modifications not changes in DNA sequence. For example, hypermethylated(silent) The accumulation of siRNAs associated with RNA-directed epialleles of SUPERMAn (which is involved in floral development) DNA methylation in A thaliana often depends on RNA-DEPEND- known as clark kent are stable during many generations of inbreeding, ENT RNA POLYMERASE 2 (RDR2)and the plant-specific protein but they can revert to an unmethylated (active)state at a frequency of NUCLEAR RNA POLYMERASE IV A(also known as NUCLEAR -3%per generation". Another notable characteristic of certain epialleles RNA POLYMERASED 1A; NRPDlA), which are involved in a putative is their ability to influence other homologous sequences both in cis and amplification pathway 32-3(Fig 2a). Together, RDR2 and NRPDIA might generate dsRNA substrates for DCL3 to process into siRNAs, Genes although how these proteins are recruited to target loci is unknown. everal loci also show dependence on AGo4 and DRM2 for siRNA 35 accumulation, suggesting that there might be a feedback loop between ional silencing and siRNA generation2 426 NRPDIA functions in a complex with NRPD2. A variant of this 20 required for RNA-directed DNA methylation but participates less fre- 9 15/ NRPD complex, which contains NRPDIB instead of NRPDlA, is also quently in siRNA accumulation.(Fig 2a). One possible function for the NRPDIB-containing complex is to generate a target transcript that can hybridize with siRNA-loaded AGO4-containing complexes Indeed, AGO4 has been observed to bind directly to NRPDiB.The#100,000 Repeats SWI-SNF-family chromatin-remodelling protein DEFECTIVE IN g RNA-DIRECTED DNA METHYLATION 1(DRDI)is also required for RNA-directed DNA methylation and could function to facilitate access of DRM2 to target DNA. Recently, several proteins in the RNA-directed 60.000 DNA-methylation pathway have been found to localize bodies, including the Cajal body, which is a centre for the processing and modification of many non-coding RNAs2 .Localization to these bodies 20.000 might be required for the efficient loading of AGO4-containing com- plexes with siRNA before these complexes travel to the nucleoplasm and, together with DRM2, direct RNA-directed DNA methylation. 200250 Plants show extensive methylation of cytosine bases in the CG, CNG Cytosine methylation ce contexts. By contrast, most cytosine methylation in mammals is found in the CG sequence context. CG methylation is maintained DNMTI in plants and mammals, respectively(Fig. 2 b). DNMTI 40,001 has a catalytic preference for hemimethylated substrates, providing an attractive model for the efficient maintenance of CG methylation after DNA replication and during cell division. Most non-CG methylation in lants is maintained redundantly by dRM2 and the plant-specific protein 2体 200 CHROMOMETHYLASE 3(CMT3)(Fig 2b); however, some loci siRNA show residual non-CG methylation in drmI drm2 cmt3 triple mutants, which might be maintained by METl(ref. 25). Non-CG methylation 2500 differs from CG methylation, because it seems to require an active maintenance signal after DNA replication. At some loci, siRNAs seem g2000 to provide this signal, acting through dRM2 activity: for example, at the 1.500 MEA-ISRlocus(MEDEA INTERSTITIAL SUBTELOMERIC REPEATS 31,000 locus, an array of seven tandem repeats located downstream of the MEDEA gene), the repeats lose all non-CG methylation in drm2 mutants 2500 and in several RNAi-pathway mutants such as ago4 and rdr2 (refs 23, 37) By contrast, other loci-for le. the sIne-class 150200250 ArSNI-completely lose non-CG methylation only in drmI drm2 cm Distance along chromosome (100 kb riple mutants. At AtSNI, CMT3 contributes to the maintenance of both Centromere CNG methylation and asymmetrical( CHH)methylation. The activ ity of CMT3 largely depends on the main methyltransferase for H3K A thaliana chromosome e lysine residue at position 9 of histone H3)-SU(VAR)3-9 HOM OLOGUE 4(SUVH4; also known as KRYPTONITE)-showing that Figure 1I The epigenetic"landscapeof A thaliana. The relative abundance histone methylation is also an important al for the maintenance of tive importance of the RNAi pathway and histone methylation for the siRNAs (doned siRNAs Per 100 kb; ref 20) is shown for the lengthor A thaliana chromosome 1, which is -30 Mb. Numbers on the x axis represent 100-kb windows along the chromosome. A diagram of Communication of silent information chromosome 1 is also shown, with white bars indicating euchromatic arms, grey bars indicating pericentromeric heterochromatin and the black bar Epigenetically silent expression states can show remarkable stability indicating the centromeric core(Figure courtesy of X.Zhang, University throughout mitosis and meiosis, although they can retain the ability to of California, Los Angeles) @2007 Nature Publishing Group
A. thaliana have been shown to depend partly on the RNaseH (‘slicer’) catalytic activity of AGO4 (ref. 30). This could be taken as support for RNA–RNA hybridization having an important role in the targeting of epigenetic modifications. The accumulation of siRNAs associated with RNA-directed DNA methylation in A. thaliana often depends on RNA-DEPENDENT RNA POLYMERASE 2 (RDR2) and the plant-specific protein NUCLEAR RNA POLYMERASE IV A (also known as NUCLEAR RNA POLYMERASE D 1A; NRPD1A), which are involved in a putative amplification pathway26,32–35 (Fig. 2a). Together, RDR2 and NRPD1A might generate dsRNA substrates for DCL3 to process into siRNAs, although how these proteins are recruited to target loci is unknown. Several loci also show dependence on AGO4 and DRM2 for siRNA accumulation, suggesting that there might be a feedback loop between transcriptional silencing and siRNA generation24,26. NRPD1A functions in a complex with NRPD2. A variant of this NRPD complex, which contains NRPD1B instead of NRPD1A, is also required for RNA-directed DNA methylation but participates less frequently in siRNA accumulation33,35 (Fig. 2a). One possible function for the NRPD1B-containing complex is to generate a target transcript that can hybridize with siRNA-loaded AGO4-containing complexes. Indeed, AGO4 has been observed to bind directly to NRPD1B28. The SWI–SNF-family chromatin-remodelling protein DEFECTIVE IN RNA-DIRECTED DNA METHYLATION 1 (DRD1) is also required for RNA-directed DNA methylation and could function to facilitate access of DRM2 to target DNA27,36. Recently, several proteins in the RNA-directed DNA-methylation pathway have been found to localize to distinct nuclear bodies, including the Cajal body, which is a centre for the processing and modification of many non-coding RNAs28,29. Localization to these bodies might be required for the efficient loading of AGO4-containing complexes with siRNA before these complexes travel to the nucleoplasm and, together with DRM2, direct RNA-directed DNA methylation. Plants show extensive methylation of cytosine bases in the CG, CNG (where N denotes any nucleotide) and CHH (where H denotes A, C or T) sequence contexts37. By contrast, most cytosine methylation in mammals is found in the CG sequence context3,38. CG methylation is maintained by the homologous proteins METHYLTRANSFERASE 1 (MET1) and DNMT1 in plants and mammals, respectively39,40 (Fig. 2b). DNMT1 has a catalytic preference for hemimethylated substrates, providing an attractive model for the efficient maintenance of CG methylation after DNA replication and during cell division38. Most non-CG methylation in plants is maintained redundantly by DRM2 and the plant-specific protein CHROMOMETHYLASE 3 (CMT3)23,37 (Fig. 2b); however, some loci show residual non-CG methylation in drm1 drm2 cmt3 triple mutants, which might be maintained by MET1 (ref. 25). Non-CG methylation differs from CG methylation, because it seems to require an active maint enance signal after DNA replication. At some loci, siRNAs seem to provide this signal, acting through DRM2 activity: for example, at the MEA-ISR locus (MEDEA INTERSTITIAL SUBTELOMERIC REPEATS locus, an array of seven tandem repeats located downstream of the MEDEA gene), the repeats lose all non-CG methylation in drm2 mutants and in several RNAi-pathway mutants such as ago4 and rdr2 (refs 23, 37). By contrast, other loci — for example, the SINE-class retrotransposon AtSN1 — completely lose non-CG methylation only in drm1 drm2 cmt3 triple mutants. At AtSN1, CMT3 contributes to the maintenance of both CNG methylation and asymmetrical (CHH) methylation. The activity of CMT3 largely depends on the main methyltransferase for H3K9 (the lysine residue at position 9 of histone H3) — SU(VAR)3-9 HOMOLOGUE 4 (SUVH4; also known as KRYPTONITE) — showing that histone methylation is also an important signal for the maintenance of non-CG methylation41,42. At present, the factors that determine the relative importance of the RNAi pathway and histone methylation for the maintenance of non-CG methylation at different loci remain unclear. Communication of silent information Epigenetically silent expression states can show remarkable stability throughout mitosis and meiosis, although they can retain the ability to revert to an active state2 . This gives rise to the concept of the epigenetic allele (epiallele), which is defined as an allele that shows a heritable difference in expression as a consequence of epigenetic modifications and not changes in DNA sequence. For example, hypermethylated (silent) epialleles of SUPERMAN (which is involved in floral development) known as clark kent are stable during many generations of inbreeding, but they can revert to an unmethylated (active) state at a frequency of ~3% per generation43. Another notable characteristic of certain epialleles is their ability to influence other homologous sequences both in cis and siRNA 500 1,000 1,500 2,000 2,500 3,000 0 20,000 40,000 60,000 80,000 5 10 15 20 25 30 35 40 0 0 50 300 100 150 200 250 0 0 50 300 100 150 200 250 0 250 50 300 100 150 200 0 0 250 50 300 100 150 200 20,000 40,000 60,000 80,000 100,000 Repeats Genes No. of genes per 100 kb No. of repeat bases per 100 kb No. of methylated bases per 100 kb No. of siRNAs per 100 kb Cytosine methylation A. thaliana chromosome 1 Centromere Distance along chromosome (100 kb) Figure 1 | The epigenetic ‘landscape’ of A. thaliana. The relative abundance of genes (number of annotated genes11), repeats (repeat bases per 100 kb; ref. 11), cytosine methylation (methylated bases per 100 kb; ref. 11) and siRNAs (cloned siRNAs per 100 kb; ref. 20) is shown for the length of A. thaliana chromosome 1, which is ~30 Mb. Numbers on the x axis represent 100-kb windows along the chromosome. A diagram of chromosome 1 is also shown, with white bars indicating euchromatic arms, grey bars indicating pericentromeric heterochromatin and the black bar indicating the centromeric core. (Figure courtesy of X. Zhang, University of California, Los Angeles.) 419 NATURE|Vol 447|24 May 2007 INSIGHT REVIEW
INSIGHT REVIEW NATURE Vol 447 24 May 2007 in trans?. One example is paramutation, which was discovered in plants and is defined as allelic interactions that cause a meiotically heritable change in the expression of one of the alleles. Trans-phenomer NRPDZ to paramutation have also been described in mammals, including at a 八八八八八 chimaeric version of the mouse Rasgrfl (Ras protein-specific guanine SSRNA dsRNA nucleotide-releasing factor 1)locus that contained the imprinting con- trol region from the insulin-like growth factor 2 receptor gene One of the best-studied paramutation systems is the maize(Zea mays)locus bl, which encodes a transcription factor that is required DNA for accumulation of the pigment anthocyanin. The paramutagenic epiallele B, which light pigm siRNA DRM2 NRPD1B low frequency from its paramutable parent allele B-I, which causes dark pigmentation. B'epialleles convert B-lalleles to B'epialleles when \\siRNA. heterozygous with 100% penetrance, and the newly created paramutated B'epialleles can pass on their silent state in subsequent crosses(Fig 3) B' epialleles are transcribed at one-twentieth to one-tenth the rate of DRM2 AGO. B-I alleles but have identical gene sequences".Fine-structure recom- bination mapping of alleles resulting from a cross between individuals with paramutagenic alleles and those with neutral alleles(which can- not participate in paramutation) enabled the sequences required for paramutation to be defined; these sequences are present as an array of 7 tandem 853-base repeats, which is located -100 kilobases(kb) upstream ofb(refs 45, 46). The sequences are present as a single copy in neutral alleles. Recombinant alleles with three repeats show partial A tha METT aramutational ability, whereas alleles with seven repeats are fully active in paramutation". These repeats were also shown to have a closed H sapiens DNMT1 chromatin structure and more cytosine methylation in B'epialleles thar BAR in B-I alleles. However, for B, cytosine methylation was found to b A thaliana DRM2 Zinc finger established after the silent state, so it is unlikely to be the cause.There H Cytosine methyltransferase are several models of trans-communication between alleles, including physical pairing of alleles and transmission of an RNA signal. A model UBA domain for paramutagenic interactions being mediated by siRNA is supported H sapiens DNMT3B PWWP domain by the finding that a genetic suppressor of paramutation, mediator of paramutation(mop1), encodes the maize orthologue of the RNA- A thaliana CMT3 dependent RNa polymerase RDR2(refs 47, 48). So far, siRNAs homol- gous to the tandem repeats upstream of B have not been reported, Figure 2 RNA-directed DNA methylation. a, Putative pathway for rNA. although such repeats are commonly associated with small RNAs directed DNA methylation in A thaliana. Target loci(in this case tandemly The mopI gene is also required for silencing transgenes and Mutator repeated sequences; coloured arrows)recruit an RNA polymerase I\ like transposons, indicating that RNA-dependent RNA polymerases and of NRPDlA and NrPD2 through an unknow siRNAs have a role in heterochromatic silencing in monocotyledonous mechanism, and this results in the ion of as -stranded RNa plants 0. The detailed relationships between siRNAS,chromatin struc ssRNA)species. This ssRNA is converted to double-stranded RNA sRNA)by the RNA-dependent RNA polymerase RDR2. The dsrNA ture at the repeats upst of B, and the ability is then processed into 24-nucleotide siRNAs by DCL3. The siRNAs are states will be intriguing to determine. ubsequently loaded into the PAZ- and PIwl-domain-containing prote The A thaliana gene FWA has similarities to maize bl in that it AGo4, which associates with another form of the RNa polymerase lv silencing of expres has tandem repeats upstream that, when methylated, cause heritable complex, NRPDIB-NRPD2 AGO4 that is programmed 'with siRNAs silencing of expression. Stably hypomethylated fwa-1 epialleles have can then locate homologous genomic sequences and guide the protein been found to be generated spontaneously and in metI mutant back DRM2 which has de n sine methyltransferase activity. Targeting causing overexpression of the transcription factor FWA DRM2 to DNA sequences also involves the SWl-SNF-family chromatin and a dominant late-flowering phenotype. In contrast to B' epialleles, tein dRDl. The NRPDIB-NrPD nerate a target transcript (ssRNA) to which the AGO4-associated sirNas presence of one another in heterozygotes.49,51.However, introduc- drm2 mutants and agod mutants, it is possible that DNA methylation (blue tion of unmethylated transgenic copies of FWA by Agrobacterium circles)also stimulates siRNA generation and reinforces silencing. b, DNA tumefaciens-mediated transformation leads to efficient de novo silen- methyltransferase structure and function. Plant and mammalian genomes DRM2 and the RNa-directed DNA-methylation RNAi pathway ing of the incoming transgene, in a process that depends on both encode homologous cytosine methyltransferases, of which there ar lants and two in mammals. A thaliana meti and homo Fig 3). Intriguingly, an unmethylated FWA transgene obtained after sapiens(human) DNMTI both function to maintain CG methylation after transformation into a drm2 mutant does not become remethylated DNA replication, through a preference for hemimethylated substrates after outcrossing to wild-type A thaliana. This finding suggests ology(Bah) hat, during the transformation process, there is a'surveillance' win- domains of unknown function. De novo DNA methylation is carried out dow when the incoming FWA transgene is competent to be silenced. by the homologous proteins DRM2(in A thaliana) and DNMT3A and DNMT3B(both in H. sapiens). Despite s thol winos these proteinate during transformation, but introduction of FWA into DRM2/drm2 A tumefaciens targets the female gametophyte(which is haploid) cytosine methyltransferase domain are ordered differently in DRM2 and heterozygotes revealed that the silencing window must be present after the dnmt3 proteins. Plants also have another class of methyltransfera fertilization" Structure-function analysis of an FWA transgene showed which is not found in mammals. CMT3 functions together with DRM2 to that the upstream tandem repeats are necessary and sufficient for trans- maintain non-CG methylation PwWP, Pro-Trp-Trp-Pro motif; formation-dependent silencing and were also found to produce homol- ogous siRNA. Interestingly, the efficiency by which an incoming @2007 Nature Publishing Group
in trans2 . One example is paramutation, which was discovered in plants and is defined as allelic interactions that cause a meiotically heritable change in the expression of one of the alleles8 . Trans-phenomena similar to paramutation have also been described in mammals, including at a chimaeric version of the mouse Rasgrf1 (Ras protein-specific guaninenucleotide-releasing factor 1) locus that contained the imprinting control region from the insulin-like growth factor 2 receptor gene44. One of the best-studied paramutation systems is the maize (Zea mays) locus b1, which encodes a transcription factor that is required for accumulation of the pigment anthocyanin8 . The paramutagenic epiallele Bʹ, which causes light pigmentation, arises spontaneously at a low frequency from its paramutable parent allele B-I, which causes dark pigmentation45. Bʹ epialleles convert B-I alleles to Bʹ epialleles when heterozygous with 100% penetrance, and the newly created paramutated Bʹ epialleles can pass on their silent state in subsequent crosses45 (Fig. 3). Bʹ epialleles are transcribed at one-twentieth to one-tenth the rate of B-I alleles but have identical gene sequences45,46. Fine-structure recombination mapping of alleles resulting from a cross between individuals with paramutagenic alleles and those with neutral alleles (which cannot participate in paramutation) enabled the sequences required for paramutation to be defined; these sequences are present as an array of 7 tandem 853-base repeats, which is located ~100 kilobases (kb) upstream of b1 (refs 45, 46). The sequences are present as a single copy in neutral alleles. Recombinant alleles with three repeats show partial paramutational ability, whereas alleles with seven repeats are fully active in paramutation45,46. These repeats were also shown to have a closed chromatin structure and more cytosine methylation in Bʹ epialleles than in B-I alleles46. However, for Bʹ, cytosine methylation was found to be established after the silent state, so it is unlikely to be the cause46. There are several models of trans-communication between alleles, including physical pairing of alleles and transmission of an RNA signal. A model for paramutagenic interactions being mediated by siRNA is supported by the finding that a genetic suppressor of paramutation, mediator of paramutation1 (mop1), encodes the maize orthologue of the RNAdependent RNA polymerase RDR2 (refs 47, 48). So far, siRNAs homologous to the tandem repeats upstream of Bʹ have not been reported, although such repeats are commonly associated with small RNAs20,49. The mop1 gene is also required for silencing transgenes and Mutatorlike transposons, indicating that RNA-dependent RNA polymerases and siRNAs have a role in heterochromatic silencing in monocotyledonous plants50. The detailed relationships between siRNAs, chromatin structure at the repeats upstream of Bʹ, and the ability to transfer epigenetic states will be intriguing to determine. The A. thaliana gene FWA has similarities to maize b1 in that it has tandem repeats upstream that, when methylated, cause heritable silencing of expression51. Stably hypomethylated fwa-1 epialleles have been found to be generated spontaneously and in met1 mutant backgrounds39,40,51, causing overexpression of the transcription factor FWA and a dominant late-flowering phenotype51. In contrast to Bʹ epialleles, methylated and unmethylated fwa epialleles are not influenced by the presence of one another in heterozygotes23,49,51. However, introduction of unmethylated transgenic copies of FWA by Agrobacterium tumefaciens-mediated transformation leads to efficient de novo silencing of the incoming transgene, in a process that depends on both DRM2 and the RNA-directed DNA-methylation RNAi pathway22,23 (Fig. 3). Intriguingly, an unmethylated FWA transgene obtained after transformation into a drm2 mutant does not become remethylated after outcrossing to wild-type A. thaliana22,23. This finding suggests that, during the transformation process, there is a ‘surveillance’ window when the incoming FWA transgene is competent to be silenced. A. tumefaciens targets the female gametophyte (which is haploid) during transformation, but introduction of FWA into DRM2/drm2 heterozygotes revealed that the silencing window must be present after fertilization49. Structure–function analysis of an FWA transgene showed that the upstream tandem repeats are necessary and sufficient for transformation-dependent silencing and were also found to produce homologous siRNA49. Interestingly, the efficiency by which an incoming a b NRPD2 DRM2 DRM2 NRPD2 NRPD1B AGO4 AGO4 DCL3 AGO4 DRD1 NRPD1A RDR2 ssRNA siRNAs siRNA ssRNA dsRNA DNA A. thaliana DRM2 H. sapiens DNMT1 H. sapiens DNMT3A A. thaliana MET1 H. sapiens DNMT3B A. thaliana CMT3 BAH domain Cytosine methyltransferase UBA domain PWWP domain Zinc finger Chromodomain Figure 2 | RNA-directed DNA methylation. a, Putative pathway for RNAdirected DNA methylation in A. thaliana. Target loci (in this case tandemly repeated sequences; coloured arrows) recruit an RNA polymerase IV complex consisting of NRPD1A and NRPD2 through an unknown mechanism, and this results in the generation of a single-stranded RNA (ssRNA) species. This ssRNA is converted to double-stranded RNA (dsRNA) by the RNA-dependent RNA polymerase RDR2. The dsRNA is then processed into 24-nucleotide siRNAs by DCL3. The siRNAs are subsequently loaded into the PAZ- and PIWI-domain-containing protein AGO4, which associates with another form of the RNA polymerase IV complex, NRPD1B–NRPD2. AGO4 that is ‘programmed’ with siRNAs can then locate homologous genomic sequences and guide the protein DRM2, which has de novo cytosine methyltransferase activity. Targeting of DRM2 to DNA sequences also involves the SWI–SNF-family chromatinremodelling protein DRD1. The NRPD1B–NRPD2 complex might generate a target transcript (ssRNA) to which the AGO4-associated siRNAs can hybridize. Given that siRNAs homologous to some loci are absent in drm2 mutants and ago4 mutants, it is possible that DNA methylation (blue circles) also stimulates siRNA generation and reinforces silencing. b, DNA methyltransferase structure and function. Plant and mammalian genomes encode homologous cytosine methyltransferases, of which there are three classes in plants and two in mammals. A. thaliana MET1 and Homo sapiens (human) DNMT1 both function to maintain CG methylation after DNA replication, through a preference for hemimethylated substrates, and both have amino-terminal bromo-adjacent homology (BAH) domains of unknown function. De novo DNA methylation is carried out by the homologous proteins DRM2 (in A. thaliana) and DNMT3A and DNMT3B (both in H. sapiens). Despite their homology, these proteins have distinct N-terminal domains, and the catalytic motifs present in the cytosine methyltransferase domain are ordered differently in DRM2 and the DNMT3 proteins. Plants also have another class of methyltransferase, which is not found in mammals. CMT3 functions together with DRM2 to maintain non-CG methylation. PWWP, Pro-Trp-Trp-Pro motif; UBA, ubiquitin associated. 420 INSIGHT REVIEW NATURE|Vol 447|24 May 2007
NATUREIVol 447 24 May 2007 INSIGHT REVIEW FWA transgene is silenced can be influenced by the methylation state with fwa-I(ref. 49).Hence, recruitment of siRNA machinery to a locus of endogenous FWA". Whereas introduction of an FWA transgene is not always sufficient for RNA-directed DNA methylation and prob into a background in which the endogenous FWA gene is methylated ably also requires modifications of chromatin leads to extremely efficient silencing of the transgene, transformation Maintenance of silencing at FWA depends mainly on CG methylation into the fwa-1 background, which contains an unmethylated endogen- because metl alleles generate hypomethylated fwa-l epialleles at a high ous gene, leads to inefficient methylation and silencing of the Fwa frequency.. Although the tandem repeats upstream of FWA are als transgene(Fig 3). Furthermore, an introduced transgene can occa- methylated at non-CG sequences, loss of this methylation in drmI drm2 sionally cause silencing of the unmethylated fwa-1 endogenous gene". cmt3 triple mutants does not cause reactivation and late flowering' These results reveal extensive communication between the transgenic Genome-wide analysis of cytosine methylation and transcription in and endogenous FWA gene copies during transformation, and this drmI drm2 cmt3 triple mutants has identified genes with methylated communication depends on the DNA methylation state of the endogen- promoters, the expression of which depends strongly on DRM-and ous gene. Surprisingly, these differences between fwa-I epialleles are CMT3-mediated non-CG methylation. These methylated genes might SiRNAs accumulate equally in plants with wild-type FWA and those triple mutants, which include misshapenleaotypes of drmI drm2cmt3 not accounted for by siRNA production, because the repeat-derived be responsible for the developmental phene 0% Spontaneous conversion 99999 Crossing B"with B-4 Heterozygote 八八八八 八 Wild-type TFWA endogenous gene (NRPD1A)(NRPD1B)(DRD1 A tumefaciens-mediated transformation 99 Transgene Figure 3 Trans-epiallele interactions at b1 and FWA. a, Paramutation at is methylated at cytosine bases in a pair of tandem repeats in its the bl locus in maize. The B-I allele(pink) of the bl gene in maize has an promoter, silencing its expr Mutations that decrease dna pstream tandem-repeat region(coloured arrows)and spontaneously hylation give rise to hypomethylated fwa-I epialleles(blue), t e more heavily methylated at cytosine bases in the repeat region and are late flowering. Introduction of an unmethylated FWA trangene py of B-I by crossing of maize plants, the B-1 allele is paramutated results in efficient methylation and silencing of the incoming transge to a silenced b' state with 100% penetrance Trans-communication depends on DRM2, AGO4, DCL3, RDR2, NRPDIA, etween epialleles requires MoPl, the maize homologue of A. thaliana NRPDIB and DRDl. By contrast, transformation of an fwa-1 background DR2, suggesting that siRNA-mediated silencing might be involved in the results in inefficient silencing of the transgene, indicating that the nversion of B-I to B: b, De novo silencing of FWA transgenes in wild methylation state of endogenous FWA is important for transgene type and fwa-1 A thaliana. The FWA gene in wild-type A thaliana(pi 4 @2007 Nature Publishing Group
FWA transgene is silenced can be influenced by the methylation state of endogenous FWA49. Whereas introduction of an FWA transgene into a background in which the endogenous FWA gene is methylated leads to extremely efficient silencing of the transgene, transformation into the fwa-1 background, which contains an unmethylated endogenous gene, leads to inefficient methylation and silencing of the FWA transgene49 (Fig. 3). Furthermore, an introduced transgene can occasionally cause silencing of the unmethylated fwa-1 endogenous gene49. These results reveal extensive communication between the transgenic and endogenous FWA gene copies during transformation, and this communication depends on the DNA methylation state of the endogenous gene. Surprisingly, these differences between fwa-1 epialleles are not accounted for by siRNA production, because the repeat-derived siRNAs accumulate equally in plants with wild-type FWA and those with fwa-1 (ref. 49). Hence, recruitment of siRNA machinery to a locus is not always sufficient for RNA-directed DNA methylation and probably also requires modifications of chromatin. Maintenance of silencing at FWA depends mainly on CG methylation, because met1 alleles generate hypomethylated fwa-1 epialleles at a high frequency39,40. Although the tandem repeats upstream of FWA are also methylated at non-CG sequences, loss of this methylation in drm1 drm2 cmt3 triple mutants does not cause reactivation and late flowering37. Genome-wide analysis of cytosine methylation and transcription in drm1 drm2 cmt3 triple mutants has identified genes with methylated promoters, the expression of which depends strongly on DRM- and CMT3-mediated non-CG methylation11. These methylated genes might be responsible for the developmental phenotypes of drm1 drm2 cmt3 triple mutants, which include misshapen leaves and reduced stature27,37. a b Heterozygote B-I ~10% Spontaneous conversion Paramutation B’ Crossing B’ with B-l B’ Wild-type endogenous gene FWA Transgene FWA fwa-1 FWA endogenous gene FWA Transgene fwa-1 FWA endogenous gene A. tumefaciens-mediated transformation Wild-type endogenous gene Transgene B-I B’ B’ FWA FWA DRM2 NRPD1B DCL3 NRPD1A DRD1 AGO4 RDR2 MOP1 Figure 3 | Trans-epiallele interactions at b1 and FWA. a, Paramutation at the b1 locus in maize. The B-I allele (pink) of the b1 gene in maize has an upstream tandem-repeat region (coloured arrows) and spontaneously gives rise to silenced Bʹ epialleles (blue) at a low frequency. Bʹ epialleles are more heavily methylated at cytosine bases in the repeat region and are less frequently transcribed. When the Bʹ epiallele is brought together with a new copy of B-I by crossing of maize plants, the B-I allele is paramutated to a silenced Bʹ state with 100% penetrance. Trans-communication between epialleles requires MOP1, the maize homologue of A. thaliana RDR2, suggesting that siRNA-mediated silencing might be involved in the conversion of B-I to Bʹ. b, De novo silencing of FWA transgenes in wildtype and fwa-1 A. thaliana. The FWA gene in wild-type A. thaliana (pink) is methylated at cytosine bases in a pair of tandem repeats in its promoter, silencing its expression. Mutations that decrease DNA methylation give rise to hypomethylated fwa-1 epialleles (blue), which overexpress the transcription factor FWA, thereby causing late flowering. Introduction of an unmethylated FWA transgene (green) by A. tumefaciens-mediated transformation of wild-type plants results in efficient methylation and silencing of the incoming transgene. This process depends on DRM2, AGO4, DCL3, RDR2, NRPD1A, NRPD1B and DRD1. By contrast, transformation of an fwa-1 background results in inefficient silencing of the transgene, indicating that the methylation state of endogenous FWA is important for transgene silencing. 421 NATURE|Vol 447|24 May 2007 INSIGHT REVIEW
INSIGHT REVIEW NATURE Vol 447 24 May 2007 In contrast to the independently segregating epialleles that arise in DNA glycosylase-lyase DEMETER(DME), which can directly excise backcrossing drmI drm2 cmt3 triple mutants to wild-type plants or differentiating extra-embryonic tissue, this mechanism does not neces- introducing either DRM2 or CMT3 by transformation immediately state remethylation of FWA. This is in contrast to mammals, in which rescues these morphological phenotypes". This finding suggests that demethylation of imprinted genes occurs in primordial germ cells(the non-CG methylation can be more easily re-established, possibly allowing cells that ultimately generate the germ line) and is followed by germline- flexible regulation of genes. However, it is unclear how commonly this specific remethylation and silencing(see page 425). Other imprinted type of regulation is used, because few examples of DNA-methylation- genes such as MEA and FERTILIZATION-INDEPENDENT SEED 2 also regulated plant genes have been described. have cytosine-methylated regions in their promoters that are associated with maternally restricted expression.However, only for FWA has Silencing through time and development it been shown that differential methylation of particular sequences is The life cycles of plants differ from those of animals in that the prod- required for the regulation of imprinting ss. ucts of meiosis undergo mitotic proliferation to form multicellular Cytosine demethylation is also likely to have an important role in gametophytes(that is, the embryo sac and the pollen in flowering the control of silencing in situations other than gametophytic genera The embryo sac(female) contains an egg cell, which is haploid, tion and imprinting. DMe belongs to a small A. thaliana gene family is fertilized by a sperm nucleus, which is also haploid, to form a that includes the somatically expressed gene REPRESSOR OF SILEN- embryo. A second sperm nucleus fertilizes the central cell, which CING I(ROSI)..Mutations in ROSI l en shown to increase is diploid, to form triploid endosperm, an extra-embryonic tissue that RNA-directed DNA methylation, and ROSI has been shown to func- endosperm show parent-of-origin-dependent monoallelic expression, ies have defined a long-sought cytosine demethylation pathway, and or imprinting, which is important for proper seed development. For they raise many interesting questions. For example, to what extent are example, in A thaliana, the tandem repeats of maternal FWA alleles are genomic methylation patterns balanced by the targeting of de novo specifically demethylated in the central cell and the endosperm, lead DNA methyltransferases and DNA glycosylases? Furthermore, there ng to expression of FWA in these tissues. Demethylation and activa- are indications of a similar mechanism for cytosine demethylation in tion of FWA depend on maternal expression of the gene encoding the vertebrates Adult plant Vegetative c Flowering Ovary Anther Germination Flower FLC Embryo sac Mitosis Pollen Figure 4 I PeG-protein-mediated silencing throughout the A thaliana be induced by other cues. d, During flower development, the anthers cycle. The activation state of the PcG protein target FLC is illustrated d ovaries are sites of meiotic differentiation, giving rise to haploid throughout the plant life cycle. a, FLC is transcriptionally active in seeds cells known as microspores and megaspores, respectively. e, These and seedlings, preventing the plant from flowering and prolonging meiotic products undergo mitotic proliferation to form the multicellular vegetative development. b, Exposure to a long period of cold(that embryo sac and pollen gametophytes. f, PcG-protein-mediated vernalization)results in the expression of VIN3(red), which initiates repression at FLC is removed during an undefined resetting proce repression of FLC transcription, and the binding of the PcG protein VRN2, g, Then, the pollen contributes sperm nuclei to the embr ac, and these well as VRNI and LHPl(blue). In this process, chromatin at FLC is fertilize the haploid egg cell and diploid central cell (not shown), formi pigenetically modified by the trimethylation of H3K27. c, After warmer nbr n anew seed. in which flc is temperatures return, FLC repression is maintained, allowing flowering to re-expressed @2007 Nature Publishing Group
In contrast to the independently segregating epialleles that arise in met1 mutants (as a result of the stable loss of CG methylation)39,40,51, backcrossing drm1 drm2 cmt3 triple mutants to wild-type plants or reintroducing either DRM2 or CMT3 by transformation immediately rescues these morphological phenotypes27. This finding suggests that non-CG methylation can be more easily re-established, possibly allowing flexible regulation of genes. However, it is unclear how commonly this type of regulation is used, because few examples of DNA-methylationregulated plant genes have been described. Silencing through time and development The life cycles of plants differ from those of animals in that the products of meiosis undergo mitotic proliferation to form multicellular gametophytes (that is, the embryo sac and the pollen in flowering plants). The embryo sac (female) contains an egg cell, which is haploid, and this is fertilized by a sperm nucleus, which is also haploid, to form a diploid embryo. A second sperm nucleus fertilizes the central cell, which is diploid, to form triploid endosperm, an extra-embryonic tissue that has a supportive role during embryogenesis. The central cell and the endosperm show parent-of-origin-dependent monoallelic expression, or imprinting, which is important for proper seed development52. For example, in A. thaliana, the tandem repeats of maternal FWA alleles are specifically demethylated in the central cell and the endosperm, leading to expression of FWA in these tissues53. Demethylation and activation of FWA depend on maternal expression of the gene encoding the DNA glycosylase–lyase DEMETER (DME), which can directly excise the base 5-methylcytosine54–56. Because the endosperm is a terminally differentiating extra-embryonic tissue, this mechanism does not necessitate remethylation of FWA53. This is in contrast to mammals, in which demethylation of imprinted genes occurs in primordial germ cells (the cells that ultimately generate the germ line) and is followed by germlinespecific remethylation and silencing (see page 425). Other imprinted genes such as MEA and FERTILIZATION-INDEPENDENT SEED 2 also have cytosine-methylated regions in their promoters that are associated with maternally restricted expression55,57. However, only for FWA has it been shown that differential methylation of particular sequences is required for the regulation of imprinting53,58. Cytosine demethylation is also likely to have an important role in the control of silencing in situations other than gametophytic generation and imprinting. DME belongs to a small A. thaliana gene family that includes the somatically expressed gene REPRESSOR OF SILENCING 1 (ROS1) 54,59. Mutations in ROS1 have been shown to increase RNA-directed DNA methylation, and ROS1 has been shown to function as a cytosine demethylase56,59,60. Together, these exciting discoveries have defined a long-sought cytosine demethylation pathway, and they raise many interesting questions. For example, to what extent are genomic methylation patterns balanced by the targeting of de novo DNA methyltransferases and DNA glycosylases? Furthermore, there are indications of a similar mechanism for cytosine demethylation in vertebrates61,62. Flowering Flower Vegetative development Adult plant Germination Fertilization Seed Seedling Megaspore Embryo sac Pollen Microspore Resetting Meiosis Mitosis Mitosis Anther FLC VIN3 FLC FLC Ovary Vernalization FLC FLC × FLC × × a b c d e f g VRN2 VRN2 LHP1 LHP1 VRN1 VRN1 LHP1 VRN1 VRN2 Figure 4 | PcG-protein-mediated silencing throughout the A. thaliana life cycle. The activation state of the PcG protein target FLC is illustrated throughout the plant life cycle. a, FLC is transcriptionally active in seeds and seedlings, preventing the plant from flowering and prolonging vegetative development. b, Exposure to a long period of cold (that is, vernalization) results in the expression of VIN3 (red), which initiates repression of FLC transcription, and the binding of the PcG protein VRN2, as well as VRN1 and LHP1 (blue). In this process, chromatin at FLC is epigenetically modified by the trimethylation of H3K27. c, After warmer temperatures return, FLC repression is maintained, allowing flowering to be induced by other cues. d, During flower development, the anthers and ovaries are sites of meiotic differentiation, giving rise to haploid cells known as microspores and megaspores, respectively. e, These meiotic products undergo mitotic proliferation to form the multicellular embryo sac and pollen gametophytes. f, PcG-protein-mediated repression at FLC is removed during an undefined resetting process. g, Then, the pollen contributes sperm nuclei to the embryo sac, and these fertilize the haploid egg cell and diploid central cell (not shown), forming the embryo and endosperm (respectively) in a new seed, in which FLC is re-expressed. 422 INSIGHT REVIEW NATURE|Vol 447|24 May 2007
NATUREIVol 447 24 May 2007 INSIGHT REVIEW Other examples of imprinted genes are maize fertilization-independent The mechanism by which the vernalization-specific PcG-protein endospermI (fiel)and fie2, which show monoallelic expression from the complex is recruited to FLC is not well understood but is known to maternal allele during endosperm development. This is reflected by the require the PHD-finger-domain-containing protein VERNALIZATION promoters of the silent paternal alleles having differentially methylated INSENSITIVE 3(VIN3). Because VIN3 expression is induced after regions(DMRs),b. Analysis of DMR methylation of fie alleles in sperm, cold treatment, this protein might be a component of the signalli egg and central cells showed interesting differences in the mechanism pathway that recruits PcG-protein-mediated repression to FLO for imprinting fiel and fie2(ref. 64). The DMR fiel is heavily methyl-( Fig. 4). Recently, the A. thaliana homologue of D. melanogaster ated in all three cell types, but the maternal alleles in the central cell Heterochromatin protein 1(HP1)-LIKE HETEROCHROMATIN hich contribute to theendosperm) become specifically demethylated, PROTEIN 1(LHPl; also known as TFL2)-was found to be required sembling the imprinting mechanism described for A thaliana FWA. for the maintenance of FLC silencing after vernalization.. LHPI By contrast, the DMR of fie 2 is unmethylated in all gametes, although becomes associated with the silenced FLClocus, a process that depends the paternal allele becomes methylated de novo in the endosperm. on an intronic sequence element. The role of LHPl in the repression Furthermore, the fie2 DMR also showed extensive non-CG methylation, of PcG-protein-regulated genes differs markedly from the main role of which is consistent with a DRM2-type-mediated RNA-directed DNA animal HPl in heterochromatic silencing (see page 399). The dNA methylation process". A further instance of potential gene regulation by binding protein VRNI is also required for the maintenance of FLC de novo DNA methylation is provided by the Brassica rapa SPIl locus, silencing and associates with mitotic chromosomes.7. Interestingly, hich encodes a pollen self-incompatibility determinant. The B rapa VRNI is absent from meiotic chromosomes of developing pollen".One self-incompatibility phenotype is controlled by dominance relation- speculation is that this absence is associated with the resetting of FLC ships between S-haplotypes, and recessive SPIl alleles were found to be expression, which leads to a requirement for vernalization, at the start specifically methylated de novo and silenced in the anther tapetal tis- of each generation. Indeed, all PcG-protein-mediated silencing might sues. It will be interesting to determine the prevalence of such instances be reset at some point during meiosis or gametogenesis, through an of tissue-specific gene regulation by DNA methylation unknown mechanism(Fig 4). In addition to the gametophytic tissues being an important location for the establishment of imprinted gene expression, they also maintain Conclusions pre-existing patterns of cytosine methylation. Evidence that silencing Plants continue to be excellent systems for the study of epigenetics, and ortant during g vided by null metI their silencing mechanisms have marked similarities to those of mam alleles in A thaliana, which produce hypomethylated epialleles even mals. An advantage of using plants is that they are tolerant of genor when the individual is heterozygous for the null allele. This is caused by stresses, such as large losses of DNA methylation and changes in chro loss of cytosine methylation in the gametophytes of metI mutants, aloss mosome number. The elegant genetic tools available for organisms such that is greater when metI is inherited through the female gametophyte as maize and A thaliana are facilitating the dissection of epigenetic than the male. This difference is probably accounted for by the female control. Recent advances such as the development of whole-genome gametophyte(that is, the embryo sac)undergoing one more postmeiotic microarrays and high-throughput sequencing are allowing the gen- round of DNA replication before fertilization than the male gametophyte eration of large-scale data sets for epigenetic modifications and small RNAs that are extending our view to a genome-wide scale. Together, a different epigenetic system used to de ally silence genes these approaches should enable major advances in our understand during plant life cycles involves Polycomb group(PcG)proteins .a ing of epigenetics to be made using plar for examp onserved complex known as Polycomb repressive complex 2(PRC2) specific chromatin modifications are established and maintained, functions to maintain patterns of gene repression in both plants and how they influence one another, and the extent to which they are used there are several PRC2 complexes, with overlapping subunit composi- for fields as diverse as cancer biology, development and evolution. tions, specialized for distinct developmental roles. For example, the PcG proteins have an important role in the regulation of imprinted 1 Gregory, T R The C-walue enigma in plants and animals: a review of parallels and an appeal gene expression. A thaliana MEA, which is a homologue of Drosophila 2. HalL, I.M. &Grewal S 1. in RNAi A Guide to Gene Silencing (ed. Hannon, G J)205-232 Enhancer of zeste, shows maternally ( Cold Spring Harbor Laboratory Press, Woodbury, 2003 sion. An important component of MEA imprinting is repression of in, B E, Meissner, A Lander, E.S. The epigenome Cell 128, 669-681(2006). the paternal MEA allele in the endosperm, and this process has been 4. ard, P etal Requirement of heterochromatin for cohesion at centromeres. Science 294,2539-2542(2001) found to involve MEA autoregulation, using H3K27 trimethylation 68. 9. 5. Bejerano, G et al. A distal enhancerand an ultraconserved exon are derived from a novel Interestingly, t the mammalian PcG protein EED (embryonic ectoderm 6. Liu. i, He, Y, Amasino.. Chen. x siR As targeting nitronic tR 25 287t 02004) control of imprinted gene expression ing elements to transposons: Barbara McClintock and the Another well-understood example of PcG-protein-mediated regula- 8. Chandler, V.L.& Stam. M Chromati tion in plants involves silencing of the floral-repressor gene FLOWER paramutation. Nature Rev Genet 5, 532-544 (2004) ING LOCUS C(FLC) during the vernalization response in A. thaliana"- 9. Hamilton A 1. Baulcombe, D. CA species of smallantisense RNA in posttranscriptional (Fig. 4).Expression of FLC, which encodes a MADS-box-containing 10 Wassenege MP Himes sriedel l& sangeh.l of the plant tolong periods of cold( ernalization)".In nat9mMm5分理上小mhn resolution mapping and functional analysis of DNA 甲 ng conditions. After the cold signal has been removed. FLCsilencin12国的面是5图 is stable". Mutations in the VERNALIZATION 2(VRN2)gene, which encodes a homologue of the D. melanogaster PcG protein Suppressor 13. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana Nature 408, of zeste 12, cause late flowering after vernalization as a result of high 14. Fransz, P. E et al. High-resolution physical mapping in Arabidopsis thaliana and tomato by levels of FLCexpression?". Interestingly, vrn2 mutants can silence FLC 15. Lippman, Zet fluorescence in situ hybridization to extended DNA fibres Plant1.9, 421-430(19 expression during the cold but fail to maintain this repression after the transposable elements in heterochromatin and epigenetic control. cold signal has been removed". VRN2 is also required for acquisition of 16. Volpe, T A et al. Regulationof heterochromatic silencing and histone H3 lysine-9 H3K27 dimethylation and trimethylation at FLC during vernalization consistent with the known functions of PRC2 in maintaining patterns NA methylation in Arabidopsis. Proc Natl Acad. Sci. USA 99(suppL 4), 16499-16506 of gene repression 423 @2007 Nature Publishing Group
Other examples of imprinted genes are maize fertilization-independent endosperm1 (fie1) and fie2, which show monoallelic expression from the maternal allele during endosperm development. This is reflected by the promoters of the silent paternal alleles having differentially methylated regions (DMRs)63,64. Analysis of DMR methylation of fie alleles in sperm, egg and central cells showed interesting differences in the mechanism for imprinting fie1 and fie2 (ref. 64). The DMR of fie1 is heavily methylated in all three cell types, but the maternal alleles in the central cell (which contribute to the endosperm) become specifically demethylated, resembling the imprinting mechanism described for A. thaliana FWA64. By contrast, the DMR of fie2 is unmethylated in all gametes, although the paternal allele becomes methylated de novo in the endosperm. Furthermore, the fie2 DMR also showed extensive non-CG methylation, which is consistent with a DRM2-type-mediated RNA-directed DNA methylation process64. A further instance of potential gene regulation by de novo DNA methylation is provided by the Brassica rapa SP11 locus, which encodes a pollen self-incompatibility determinant65. The B. rapa self-incompatibility phenotype is controlled by dominance relationships between S-haplotypes, and recessive SP11 alleles were found to be specifically methylated de novo and silenced in the anther tapetal tissues65. It will be interesting to determine the prevalence of such instances of tissue-specific gene regulation by DNA methylation. In addition to the gametophytic tissues being an important location for the establishment of imprinted gene expression, they also maintain pre-existing patterns of cytosine methylation. Evidence that silencing is important during gametophytic generation is provided by null met1 alleles in A. thaliana, which produce hypomethylated epialleles even when the individual is heterozygous for the null allele40. This is caused by loss of cytosine methylation in the gametophytes of met1 mutants, a loss that is greater when met1 is inherited through the female gametophyte than the male40. This difference is probably accounted for by the female gametophyte (that is, the embryo sac) undergoing one more postmeiotic round of DNA replication before fertilization than the male gametophyte (that is, the pollen)40. A different epigenetic system used to developmentally silence genes during plant life cycles involves Polycomb group (PcG) proteins66. A conserved complex known as Polycomb repressive complex 2 (PRC2) functions to maintain patterns of gene repression in both plants and animals, using H3K27 methylation66 (see page 425). However, in plants, there are several PRC2 complexes, with overlapping subunit compositions, specialized for distinct developmental roles66. For example, the PcG proteins have an important role in the regulation of imprinted gene expression. A. thaliana MEA, which is a homologue of Drosophila melanogaster Enhancer of zeste, shows maternally imprinted expression67. An important component of MEA imprinting is repression of the paternal MEA allele in the endosperm, and this process has been found to involve MEA autoregulation, using H3K27 trimethylation55,68,69. Interestingly, the mammalian PcG protein EED (embryonic ectoderm development) has also been shown to have an important role in the control of imprinted gene expression70. Another well-understood example of PcG-protein-mediated regulation in plants involves silencing of the floral-repressor gene FLOWERING LOCUS C (FLC) during the vernalization response in A. thaliana71–73 (Fig. 4). Expression of FLC, which encodes a MADS-box-containing transcription factor, delays flowering and can be silenced by exposure of the plant to long periods of cold (that is, vernalization)71–73. In nature, this cold treatment occurs in winter and leads to flowering in favourable spring conditions. After the cold signal has been removed, FLC silencing is stable71–73. Mutations in the VERNALIZATION 2 (VRN2) gene, which encodes a homologue of the D. melanogaster PcG protein Suppressor of zeste 12, cause late flowering after vernalization as a result of high levels of FLC expression72. Interestingly, vrn2 mutants can silence FLC expression during the cold but fail to maintain this repression after the cold signal has been removed72. VRN2 is also required for acquisition of H3K27 dimethylation and trimethylation at FLC during vernalization, consistent with the known functions of PRC2 in maintaining patterns of gene repression71,73,74. The mechanism by which the vernalization-specific PcG-protein complex is recruited to FLC is not well understood but is known to require the PHD-finger-domain-containing protein VERNALIZATION INSENSITIVE 3 (VIN3)73. Because VIN3 expression is induced after cold treatment, this protein might be a component of the signalling pathway that recruits PcG-protein-mediated repression to FLC73 (Fig. 4). Recently, the A. thaliana homologue of D. melanogaster Heterochromatin protein 1 (HP1) — LIKE HETEROCHROMATIN PROTEIN 1 (LHP1; also known as TFL2) — was found to be required for the maintenance of FLC silencing after vernalization75,76. LHP1 becomes associated with the silenced FLC locus, a process that depends on an intronic sequence element76. The role of LHP1 in the repression of PcG-protein-regulated genes differs markedly from the main role of animal HP1 in heterochromatic silencing (see page 399). The DNAbinding protein VRN1 is also required for the maintenance of FLC silencing and associates with mitotic chromosomes75,77. Interestingly, VRN1 is absent from meiotic chromosomes of developing pollen75. One speculation is that this absence is associated with the resetting of FLC expression, which leads to a requirement for vernalization, at the start of each generation. Indeed, all PcG-protein-mediated silencing might be reset at some point during meiosis or gametogenesis, through an unknown mechanism (Fig. 4). Conclusions Plants continue to be excellent systems for the study of epigenetics, and their silencing mechanisms have marked similarities to those of mammals. An advantage of using plants is that they are tolerant of genome stresses, such as large losses of DNA methylation and changes in chromosome number. The elegant genetic tools available for organisms such as maize and A. thaliana are facilitating the dissection of epigenetic control. Recent advances such as the development of whole-genome microarrays and high-throughput sequencing are allowing the generation of large-scale data sets for epigenetic modifications and small RNAs that are extending our view to a genome-wide scale. Together, these approaches should enable major advances in our understanding of epigenetics to be made using plant systems: for example, how specific chromatin modifications are established and maintained, how they influence one another, and the extent to which they are used throughout the genome. This work should provide important insight for fields as diverse as cancer biology, development and evolution. ■ 1. Gregory, T. R. The C-value enigma in plants and animals: a review of parallels and an appeal for partnership. Ann. Bot. (Lond.) 95, 133–146 (2005). 2. Hall, I. M. & Grewal, S. I. in RNAi: A Guide to Gene Silencing (ed. Hannon, G. J.) 205–232 (Cold Spring Harbor Laboratory Press, Woodbury, 2003). 3. Bernstein, B. E., Meissner, A. & Lander, E. S. The epigenome. Cell128, 669–681 (2006). 4. Bernard, P. et al. Requirement of heterochromatin for cohesion at centromeres. Science 294, 2539–2542 (2001). 5. Bejerano, G. et al. A distal enhancer and an ultraconserved exon are derived from a novel retroposon. Nature 441, 87–90 (2006). 6. Liu, J., He, Y., Amasino, R. & Chen, X. siRNAs targeting an intronic transposon in the regulation of natural flowering behavior in Arabidopsis. Genes Dev.18, 2873–2878 (2004). 7. Comfort, N. C. From controlling elements to transposons: Barbara McClintock and the Nobel Prize. Trends Biochem. Sci. 26, 454–457 (2001). 8. Chandler, V. L. & Stam, M. Chromatin conversations: mechanisms and implications of paramutation. Nature Rev. Genet. 5, 532–544 (2004). 9. Hamilton, A. J. & Baulcombe, D. C. A species of small antisense RNA in posttranscriptional gene silencing in plants. Science 286, 950–952 (1999). 10. Wassenegger, M., Heimes, S., Riedel, L. & Sanger, H. L. RNA-directed de novo methylation of genomic sequences in plants. Cell 76, 567–576 (1994). 11. Zhang, X. et al. Genome-wide high-resolution mapping and functional analysis of DNA methylation in Arabidopsis. Cell126, 1189–1201 (2006). 12. Zilberman, D., Gehring, M., Tran, R. K., Ballinger, T. & Henikoff, S. Genome-wide analysis of Arabidopsis thaliana DNA methylation uncovers an interdependence between methylation and transcription. Nature Genet. 39, 61–69 (2007). 13. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796–815 (2000). 14. Fransz, P. F. et al. High-resolution physical mapping in Arabidopsis thaliana and tomato by fluorescence in situ hybridization to extended DNA fibres. Plant J. 9, 421–430 (1996). 15. Lippman, Z. et al. Role of transposable elements in heterochromatin and epigenetic control. Nature 430, 471–476 (2004). 16. Volpe, T. A. et al. Regulation of heterochromatic silencing and histone H3 lysine-9 methylation by RNAi. Science 297, 1833–1837 (2002). 17. Aufsatz, W., Mette, M. F., van der Winden, J., Matzke, A. J. & Matzke, M. RNA-directed DNA methylation in Arabidopsis. Proc. Natl Acad. Sci. USA 99 (suppl. 4), 16499–16506 (2002). 423 NATURE|Vol 447|24 May 2007 INSIGHT REVIEW
INSIGHT REVIEW NATURE Vol 447 24 May 2007 8. Mochizuki, K, Fine, N A, Fujisawa, T. Gorowsky, M. A. Analysis of late ates small RNas in genome rearrangement in Tetrahymena Cell 110, 689-699 function epigenetic alleles of a homeodomain gene mol Cell 6, 791-802(2000) 52. Gehring, M, Choi, Y. Fischer, R L Imprinting and seed development. Plant Cell 16, M, Matzke, AJ& Kooter, I M RNA: guiding gene silencing. Science 293, 53. Kinoshita, Tet al. One-way control of FWA imprinting in Arabidopsis endosperm by DNA 20. Lu, Cet al Elucidation of the small RNA component of the transcriptome. Science 309 ethylation. Science 303, 521-523(2004). 54. Choi,Yet al. DEMETER, a dna glycosylase domain protein, is 21. Cao, Xetal Role of the DRM and cmt3 methyltransferases in RNA-directed DNA methylation. Curt BioL. 13, 2212-2217(200 55. Gehring. M. et al DEMETER DNA glycosylase establishes MedEa polycomb gene self- 22. Cao, X& Jacobsen, S.E. Role of the Arabidopsis DRM methyltransferases in de novo DNA 56. Morales-Ruiz, Tet al. DEMETER and REPRESs/124,495-506(2006) printing by allele-specific demethylation S. W. et al RNA silencing genes control de nowo DNA methylation. Science 303, 1336 52. ng 24. Zilberman, D et al Role of Arabidopsis ARGONAUTE4 in RNA-directed DNA methylation the arabidopsis life cycle is essential for parental imprinting. Plant Cell 18, 1360-1372 iggered by inverted repeats. Curr Biol. 14, 1214-1220(2004). 25. 58. Kinoshita, Y et al Control of FwA gene silencing in Arabidopsis thaliana by SINe-related direct repeats. Plant 149, 38-45(2007). inscriptional gene silencing in Arabidopsis, enco odesa 26. Xie, Z etal. Genetic and functional diversification of small rNa pathways in plants. o.2,e104(2004) 0. Agius, F. Kapoor, A& Zhu, J K Role of the Arabidopsis dna glycosylase/lyase ROSI in ) Genet. 2, e83(2006 61. Barreto, G et al. Gadd45a promotes epigenetic gene activation by repair mediated DNA Cajal bodies in Arabidopsis thaliana Cell 126, 93-106(2006) 62. Jost, IP, Siegmann, M, Sun, L&Leung, R Mechanisms of DNA demethylation in chicker 30. Qi Yet al. Dis lytic roles of ARGONAUTE4 in RNA-directed 63. Danilevskaya, O.N. et. Duplicated fie genes in maize: expression pattern and imprinting NA methylation Nature 443, 1008-1012(2006) suggest distinct functions. Plant Cell 15, 425-438(2003) 31. Zilberman, D, Cao, X& Jacobsen, S E ARGONAUTE4 control of locus-specific siRNA 64. Gutierrez-Marcos, J.F. et al. Epigenetic asymmetry of imprinted genes in plant gametes cumulation and dNa and histone methylation Science 299, 716-719(2003). 32. Herr, AJ, Jensen, M. B, Dalmay, T& balcomb RNA polymerase IV directs 65. Shiba, H et al Dominance relationships between self-incompatibility alleles controlled by encing of endogenous DNA. Science 308, 118-120(2005 ubunits required for RNA-directed DNA 66. Kohler, C. Grossniklaus, U. Epigenetic inheritance of expression states in plant evelopment: the role of Polycomb group proteins. Cum Opin. Cell BioL. 14, 773-779 mediates siRNA and dna 67. Kinoshita, T- Yadegari, R, Harada, I. Goldberg, R.B. Fische printing of the 35. Pontier, D et al. Reinforcement of silencing at transposons and highly repeated sequences MEDEA polyc Plant Ce111945-1952(1999) quires the concerted action of two distinct RNa polymerases IV in Arabidopsis Genes ev1920 nnte d eN A Imthvement otp baiti e boz-osmotogemodeling protein DRDl in 37. Cao, X&Jacobsen, S E Locus-specific control of asymmetric and CpNpg methylation 69. Jullien, P.E, Katz, A, Oliva, M, Ohad, N& Berger, F. Polycomb group complexes self- gulate imprinting of the Polycomb group gene MEDEA in Arabidopsis. Curr Biol. 16, the dRM and Cmt3 methyltransferase genes. Proc Natl Acad. Sci USA99(suppL 4) SMG以H对mm20M上 omery, N D, de Villena, F.P.& Magnuson, T Genome imprin理 ankel, M. W. etal Arabidopsis mETi cytosine methyltransferase mutants. Genetics 163, 71. Bastow, R etal vernalization requires epigenetic silencing of FLC by histone methylation. Nature427164-167(2004) 40. Saze, H, Mittelsten Scheid, O. Paszkowski, I. Maintenance of Cp 72. Gendall, A R, Levy, Y Y Wilson, A. Dean, C The VERNALIZATION 2 gene mediates the 41. Jackson, I P, Lindroth, A M, Cao, X& Jacobsen, S. E Control of CpNpG DNA methylation finger protein VIN3 Nature 427, 159-164(200 the KRYPtonite histone H3 methyltransferase. Nature 416, 556-560(2002). Sung, S, Schmitz, R.J.& Amasino, R. M. A PHD finger protein involved in both the vernalization and photoperiod pathways in Arabidopsis. Genes Dev. 20, 3244-3248 ablishment of DNA methylation. EMBO1 21, 6842-6852(2002). Hypermethylated SUPERMANepigenetic alleles in 75. JSetal. LHP1, the Arabidopis OCHROMATIN PROTEINI is Arabidopsis Science 277, 1100-1103(1997) required for epigenetic silencing of FLC. Proc Natl Acad. Sci. USA 103, 5012-5017(2006). quires LIKE HETEROCHROMATIN PROTEIN 1. Nature Genet. 38, 706-710 (2006). 45. Stam, M. et al. The regulatory regions required for B'paramutation and expression are 77. Levy, Y.Y, Mesnage, S, Mylne, J$, Gendall, A.R.& Dean, C. Multiple cated far upstream of the maize b1 transcribed sequences. Genetics 162, 917-930 VRNI in vernalization and flowering time controL Science 297, 243-246(2002). 46. Stam, M, Belele, C, Dorweiler, J.E.& Chandler, V L Differential chromatin structure within Acknowled Chan, C Fei Li, K. Niakan, M. Ong and all members of the Jacobsen laboratory for useful comments and discussion. We apologize to 4 colleagues whose research we did not have space to discuss. l.R. H was supported 8(x ndent RNa polymerase is required for paramutation in a long-term fellowship from the European molecular Biology Organization, a 48. Woodhouse, M.R, Freeling, M. & Lisch, D Initiation, establishment, and maintenance of pecial Fellow grant from The Leukemia Lymphoma Society, and a grant from the eritable Mu DR transposon silencing in maize are mediated by distinct factors. PLoS Biol. National Institutes of Health. SE J is an investigator of the Howard Hughes Medical hang X, Bernatavichute, Y sen, S.E. Two-step recruitment of repeats. PLoS Biol. 4, e363(2 Author Information Reprints and permissions information is available at ler, V..A npg. nature. com/reprintsandpermissions. The authors declare no competing nspac nylation and silencing. financial interests. Correspondence should be addressed to S.EJ jacobsen@ucla. edu) 424 @2007 Nature Publishing Group
18. Mochizuki, K., Fine, N. A., Fujisawa, T. & Gorovsky, M. A. Analysis of a piwi-related gene implicates small RNAs in genome rearrangement in Tetrahymena. Cell110, 689–699 (2002). 19. Matzke, M., Matzke, A. J. & Kooter, J. M. RNA: guiding gene silencing. Science 293, 1080–1083 (2001). 20. Lu, C. et al. Elucidation of the small RNA component of the transcriptome. Science 309, 1567–1569 (2005). 21. Cao, X. et al. Role of the DRM and CMT3 methyltransferases in RNA-directed DNA methylation. Curr. Biol.13, 2212–2217 (2003). 22. Cao, X. & Jacobsen, S. E. Role of the Arabidopsis DRM methyltransferases in de novo DNA methylation and gene silencing. Curr. Biol.12, 1138–1144 (2002). 23. Chan, S. W. et al. RNA silencing genes control de novo DNA methylation. Science 303, 1336 (2004). 24. Zilberman, D. et al. Role of ArabidopsisARGONAUTE4 in RNA-directed DNA methylation triggered by inverted repeats. Curr. Biol.14, 1214–1220 (2004). 25. Henderson, I. R. et al. Dissecting Arabidopsis thaliana DICER function in small RNA processing, gene silencing and DNA methylation patterning. Nature Genet. 38, 721–725 (2006). 26. Xie, Z. et al. Genetic and functional diversification of small RNA pathways in plants. PLoS Biol. 2, e104 (2004). 27. Chan, S. W. et al. RNAi, DRD1, and histone methylation actively target developmentally important non-CG DNA methylation in Arabidopsis. PLoS Genet. 2, e83 (2006). 28. Li, C. F. et al. An ARGONAUTE4-containing nuclear processing center colocalized with Cajal bodies in Arabidopsis thaliana. Cell126, 93–106 (2006). 29. Pontes, O. et al. The Arabidopsis chromatin-modifying nuclear siRNA pathway involves a nucleolar RNA processing center. Cell126, 79–92 (2006). 30. Qi, Y. et al. Distinct catalytic and non-catalytic roles of ARGONAUTE4 in RNA-directed DNA methylation. Nature 443, 1008–1012 (2006). 31. Zilberman, D., Cao, X. & Jacobsen, S. E. ARGONAUTE4 control of locus-specific siRNA accumulation and DNA and histone methylation. Science 299, 716–719 (2003). 32. Herr, A. J., Jensen, M. B., Dalmay, T. & Baulcombe, D. C. RNA polymerase IV directs silencing of endogenous DNA. Science 308, 118–120 (2005). 33. Kanno, T. et al. Atypical RNA polymerase subunits required for RNA-directed DNA methylation. Nature Genet. 37, 761–765 (2005). 34. Onodera, Y. et al. Plant nuclear RNA polymerase IV mediates siRNA and DNA methylation-dependent heterochromatin formation. Cell120, 613–622 (2005). 35. Pontier, D. et al. Reinforcement of silencing at transposons and highly repeated sequences requires the concerted action of two distinct RNA polymerases IV in Arabidopsis. Genes Dev.19, 2030–2040 (2005). 36. Kanno, T. et al. Involvement of putative SNF2 chromatin remodeling protein DRD1 in RNAdirected DNA methylation. Curr. Biol.14, 801–805 (2004). 37. Cao, X. & Jacobsen, S. E. Locus-specific control of asymmetric and CpNpG methylation by the DRM and CMT3 methyltransferase genes. Proc. Natl Acad. Sci. USA 99 (suppl. 4), 16491–16498 (2002). 38. Goll, M. G. & Bestor, T. H. Eukaryotic cytosine methyltransferases. Annu. Rev. Biochem. 74, 481–514 (2005). 39. Kankel, M. W. et al. Arabidopsis MET1 cytosine methyltransferase mutants. Genetics163, 1109–1122 (2003). 40. Saze, H., Mittelsten Scheid, O. & Paszkowski, J. Maintenance of CpG methylation is essential for epigenetic inheritance during plant gametogenesis. Nature Genet. 34, 65–69 (2003). 41. Jackson, J. P., Lindroth, A. M., Cao, X. & Jacobsen, S. E. Control of CpNpG DNA methylation by the KRYPTONITE histone H3 methyltransferase. Nature 416, 556–560 (2002). 42. Malagnac, F., Bartee, L. & Bender, J. An Arabidopsis SET domain protein required for maintenance but not establishment of DNA methylation. EMBO J. 21, 6842–6852 (2002). 43. Jacobsen, S. E. & Meyerowitz, E. M. Hypermethylated SUPERMAN epigenetic alleles in Arabidopsis. Science 277, 1100–1103 (1997). 44. Herman, H. et al. Trans allele methylation and paramutation-like effects in mice. Nature Genet. 34, 199–202 (2003). 45. Stam, M. et al. The regulatory regions required for Bʹ paramutation and expression are located far upstream of the maize b1 transcribed sequences. Genetics162, 917–930 (2002). 46. Stam, M., Belele, C., Dorweiler, J. E. & Chandler, V. L. Differential chromatin structure within a tandem array 100 kb upstream of the maize b1 locus is associated with paramutation. Genes Dev.16, 1906–1918 (2002). 47. Alleman, M. et al. An RNA-dependent RNA polymerase is required for paramutation in maize. Nature 442, 295–298 (2006). 48. Woodhouse, M. R., Freeling, M. & Lisch, D. Initiation, establishment, and maintenance of heritable MuDR transposon silencing in maize are mediated by distinct factors. PLoS Biol. 4, e339 (2006). 49. Chan, S. W.-L., Zhang, X., Bernatavichute, Y. V. & Jacobsen, S. E. Two-step recruitment of RNA-directed DNA methylation to tandem repeats. PLoS Biol. 4, e363 (2006). 50. Lisch, D., Carey, C. C., Dorweiler, J. E. & Chandler, V. L. A mutation that prevents paramutation in maize also reverses Mutator transposon methylation and silencing. Proc. Natl Acad. Sci. USA 99, 6130–6135 (2002). 51. Soppe, W. J. et al. The late flowering phenotype of fwa mutants is caused by gain-offunction epigenetic alleles of a homeodomain gene. Mol. Cell 6, 791–802 (2000). 52. Gehring, M., Choi, Y. & Fischer, R. L. Imprinting and seed development. Plant Cell16, S203–S213 (2004). 53. Kinoshita, T. et al. One-way control of FWA imprinting in Arabidopsis endosperm by DNA methylation. Science 303, 521–523 (2004). 54. Choi, Y. et al. DEMETER, a DNA glycosylase domain protein, is required for endosperm gene imprinting and seed viability in Arabidopsis. Cell110, 33–42 (2002). 55. Gehring, M. et al. DEMETER DNA glycosylase establishes MEDEA polycomb gene selfimprinting by allele-specific demethylation. Cell124, 495–506 (2006). 56. Morales-Ruiz, T. et al. DEMETER and REPRESSOR OF SILENCING 1 encode 5- methylcytosine DNA glycosylases. Proc. Natl Acad. Sci.USA 103, 6853–6858 (2006). 57. Jullien, P. E., Kinoshita, T., Ohad, N. & Berger, F. Maintenance of DNA methylation during the Arabidopsis life cycle is essential for parental imprinting. Plant Cell18, 1360–1372 (2006). 58. Kinoshita, Y. et al. Control of FWA gene silencing in Arabidopsis thaliana by SINE-related direct repeats. Plant J. 49, 38–45 (2007). 59. Gong, Z. et al. ROS1, a repressor of transcriptional gene silencing in Arabidopsis, encodes a DNA glycosylase/lyase. Cell111, 803–814 (2002). 60. Agius, F., Kapoor, A. & Zhu, J. K. Role of the Arabidopsis DNA glycosylase/lyase ROS1 in active DNA demethylation. Proc. Natl Acad. Sci. USA 103, 11796–11801 (2006). 61. Barreto, G. et al. Gadd45a promotes epigenetic gene activation by repair-mediated DNA demethylation. Nature 445, 671–675 (2007). 62. Jost, J. P., Siegmann, M., Sun, L. & Leung, R. Mechanisms of DNA demethylation in chicken embryos. Purification and properties of a 5-methylcytosine-DNA glycosylase. J. Biol. Chem. 270, 9734–9739 (1995). 63. Danilevskaya, O. N. et al. Duplicated fie genes in maize: expression pattern and imprinting suggest distinct functions. Plant Cell15, 425–438 (2003). 64. Gutierrez-Marcos, J. F. et al. Epigenetic asymmetry of imprinted genes in plant gametes. Nature Genet. 38, 876–878 (2006). 65. Shiba, H. et al. Dominance relationships between self-incompatibility alleles controlled by DNA methylation. Nature Genet. 38, 297–299 (2006). 66. Kohler, C. & Grossniklaus, U. Epigenetic inheritance of expression states in plant development: the role of Polycomb group proteins. Curr. Opin. Cell Biol.14, 773–779 (2002). 67. Kinoshita, T., Yadegari, R., Harada, J. J., Goldberg, R. B. & Fischer, R. L. Imprinting of the MEDEA polycomb gene in the Arabidopsis endosperm. Plant Cell11, 1945–1952 (1999). 68. Baroux, C., Gagliardini, V., Page, D. R. & Grossniklaus, U. Dynamic regulatory interactions of Polycomb group genes: MEDEA autoregulation is required for imprinted gene expression in Arabidopsis. Genes Dev. 20, 1081–1086 (2006). 69. Jullien, P. E., Katz, A., Oliva, M., Ohad, N. & Berger, F. Polycomb group complexes selfregulate imprinting of the Polycomb group gene MEDEA in Arabidopsis. Curr. Biol.16, 486–492 (2006). 70. Mager, J., Montgomery, N. D., de Villena, F. P. & Magnuson, T. Genome imprinting regulated by the mouse Polycomb group protein Eed. Nature Genet. 33, 502–507 (2003). 71. Bastow, R. et al. Vernalization requires epigenetic silencing of FLC by histone methylation. Nature 427, 164–167 (2004). 72. Gendall, A. R., Levy, Y. Y., Wilson, A. & Dean, C. The VERNALIZATION 2 gene mediates the epigenetic regulation of vernalization in Arabidopsis. Cell107, 525–535 (2001). 73. Sung, S. & Amasino, R. M. Vernalization in Arabidopsis thaliana is mediated by the PHD finger protein VIN3. Nature 427, 159–164 (2004). 74. Sung, S., Schmitz, R. J. & Amasino, R. M. A PHD finger protein involved in both the vernalization and photoperiod pathways in Arabidopsis. Genes Dev. 20, 3244–3248 (2006). 75. Mylne, J. S. et al. LHP1, the Arabidopsis homologue of HETEROCHROMATIN PROTEIN1, is required for epigenetic silencing of FLC. Proc. Natl Acad. Sci. USA 103, 5012–5017 (2006). 76. Sung, S. et al. Epigenetic maintenance of the vernalized state in Arabidopsis thaliana requires LIKE HETEROCHROMATIN PROTEIN 1. Nature Genet. 38, 706–710 (2006). 77. Levy, Y. Y., Mesnage, S., Mylne, J. S., Gendall, A. R. & Dean, C. Multiple roles of Arabidopsis VRN1 in vernalization and flowering time control. Science 297, 243–246 (2002). AcknowledgementsWe thank S. Chan, C. Fei Li, K. Niakan, M. Ong and all members of the Jacobsen laboratory for useful comments and discussion. We apologize to colleagues whose research we did not have space to discuss. I.R.H. was supported by a long-term fellowship from the European Molecular Biology Organization, a Special Fellow grant from The Leukemia & Lymphoma Society, and a grant from the National Institutes of Health. S.E.J is an investigator of the Howard Hughes Medical Institute. Author Information Reprints and permissions information is available at npg.nature.com/reprintsandpermissions. The authors declare no competing financial interests. Correspondence should be addressed to S.E.J. (jacobsen@ucla.edu). 424 INSIGHT REVIEW NATURE|Vol 447|24 May 2007