Tuesday, June 14, 2011

DNA N6-adenine methyltransferases in eukaryotes

The N6 methylated adenine base (N6A) in DNA is widely present in prokaryotes but relatively uncommon in eukaryotes. However, right from the earliest days, DNA N6A methylation  has been  identified in several eukaryotic lineages including ciliates, chlorophyte algae and dinoflagellates. In some species 0.5-10% of the adenines in the genome may be modified. Ciliates in particular have been studied as a model for the regulatory role of adenine methylation (Click here to read articles on the same).  Until recently, the identity of the eukaryotic adenine methyltransferases was a mystery. A study in plants, reported a potential adenine methyltransferase, but it turns out to be an RNA methylase of the TRM11 family.


The N6A methylases and the related N4C methylases can be distinguished by a characteristic sequence signature in the active site (loop after strand-4) of the form [NDS]PP[YFW] (see structure above). However, this signature is present in both RNA and DNA adenine methylases and several RNA methylases are highly conserved across life.  The classical DNA N6A methylases (N6A-MTase or Dam), appear to have emerged from the HemK-RsmC-RsmD clade of RNA methylases (see figure from our previous study). Most prokaryotic N6A DNA methylases are found in R–M systems, which have been widely disseminated via lateral transfer across distantly related lineages, but in some instances such as in the E. coli Dam protein and in the Caulobacter CcrM, they have been taken up for host function. N4C is another amine group base methylation seen in DNA, but sequence analysis reveals that they have emerged from the N6A methylases on multiple occasions independently.


In a recent study, we now report at least six distinct families of DNA N6-adenine methylases (N6A-MTase or Dam) in eukaryotes. Several of these are specified by different types of mobile elements. For example the Dictyostelium DIRS-1-like retrotransposon element encodes a N6A methylase. This element is widely disseminated across eukaryotes and expanded in several distantly-related organisms. However, except certain chlorophyte algae the methylases in most other genomes are inactive. This suggests that in most other species the methylase domain is likely to only function as a DNA-binding regulatory protein instead of a methylase. Another family found only in Trichomonas  includes N6A-DNA methylases that are often fused to phage structural proteins. The CrRem1-like LTR-containing retrotransposons of Chlamydomonas encode a polyprotein with the  methylase fused to C-terminal aspartyl protease and reverse transcriptase domains. Another predicted mobile element seen in chlorophytes and certain chythrid fungi contains a Dam methylase that is fused to a ParB-like HTH domain. The pervasive presence of Dam-like methylases associated with distinct groups of transposons suggests that they might act in cis to control their own gene expression and mobility through methylation of specific adenines within themselves or in their vicinity. 


We also report eukaryotic N6A methylases that appear to be cellular enzymes with a role in chromatin organization. One of these, found across chlorophyte algae is a multidomain protein with the N6A methylase domain fused to one or more N-terminal BMB/PWWP and C-terminal PHD-X/ZF-CW domains. Additionally, they often contain PHD finger domains N-terminal to the methylase domain. The accessory domains function as adaptors that read the histone code. The architectural syntax suggests that these enzymes localize to specific regions of chromatin where the histones bear modified marks such as trimethylated lysines, which is followed by localized N6A or N4C methylation.


Most of these eukaryotic methylases are obviously related to the DNA N6A-methylases of mobile elements of prokaryotes, suggesting their derivation from prokaryotic mobile elements. The situation in ciliates, which are a model system for studying the regulatory role of N6A methylation in eukaryotes is quite different and interesting. Sequence searches failed to reveal any obvious DNA N6-MTase candidates. However, ciliates  have a distinctive paralogous version of the Ime4-like family of adenine methylases, which is fused to N-terminal ZZ Zn-fingers, a domain of the treble-clef fold also found in chromatin proteins such as Ada2 and CBP/p300. Given that all ciliates studied to date show substantial N6mA in DNA, and have no other candidate methylases to catalyze this reaction, we suggest that these ZZ-domain containing methylases indeed perform this function. Orthologous methylases of this ciliate version are found in amoeboflagellates like Naegleria and the rhodophyte alga Cyanidioschyzon, suggesting a possibly wider distribution for this form of adenine methylation across eukaryotes. 


The Ime4/MunI-like methylases have several interesting aspects. They are circularly permuted. Bacterial versions are part of Restriction-modification systems, suggesting that these are DNA methylases. In contrast, the classical eukaryotic IME/MT-A70 methylases, which are derived from bacterial methylases, are RNA methylases. Although ciliates have the regular IME4-like version, the presence of this distinct paralog and its architecture suggests a return to DNA methylation in these species!!


You can read about this in our detailed study on DNA methylation-demethylation systems (Click here).