New Tools for Screening for Enivronmentally Influenced Epigenetic Disease Risk Factors

We are studying how epigenetic modifications, changes to DNA structure that affect gene expression, without any DNA sequence changes, translate environmental exposures to disease risk. We have designed a new tool that will allow rapid screening to identify genes affected in any disease or exposure.
Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

bioRxiv
bioRxiv bioRxiv

Creation and Validation of the First Infinium DNA Methylation Array for the Human Imprintome

Background Differentially methylated imprint control regions (ICRs) regulate the monoallelic expression of imprinted genes. Their epigenetic dysregulation by environmental exposures throughout life results in the formation of common chronic diseases. Unfortunately, existing Infinium methylation arrays lack the ability to profile these regions adequately. Whole genome bisulfite sequencing (WGBS) is the unique method able to profile these regions, but it is very expensive and it requires not only a high coverage but it is also computationally intensive to assess those regions. Findings To address this deficiency, we developed a custom methylation array containing 22,819 probes. Among them, 9,757 probes map to 1,088 out of the 1,488 candidate ICRs recently described. To assess the performance of the array, we created matched samples processed with the Human Imprintome array and WGBS, which is the current standard method for assessing the methylation of the Human Imprintome. We compared the methylation levels from the shared CpG sites and obtained a mean R2 = 0.569. We also created matched samples processed with the Human Imprintome array and the Infinium Methylation EPIC v2 array and obtained a mean R2 = 0.796. Furthermore, replication experiments demonstrated high reliability (ICC: 0.799-0.945). Conclusions Our custom array will be useful for replicable and accurate assessment, mechanistic insight, and targeted investigation of ICRs. This tool should accelerate the discovery of ICRs associated with a wide range of diseases and exposures, and advance our understanding of genomic imprinting and its relevance in development and disease formation throughout the life course. ### Competing Interest Statement NC, VBD, RS, HW, and TLM are employees of TruDiagnostic. The other authors declare no conflict of interest.

Background

Over the past 100 years, the primary causes of death in the developed world have shifted from infectious diseases to non-infectious diseases; in the United States in 1900, influenza and tuberculosis were the top two causes of death, whereas in 2018, heart disease and cancer were the top two causes of death.  In part, these shifts are due to the reduction of infectious diseases by improved medical technology, so that a larger proportion of the population is living long enough to develop age-related diseases.  However, demographic shifts towards an older population do not fully account for increased incidence of some diseases, such as metabolic syndrome, liver disease, and certain cancers.  These diseases have had dramatic increases in the percentage of people affected over the last 30 years.  There  are also diseases with disparities depending on sex and ethnic background, with differences in age of onset, rate of progression, and overall severity.

While there are genetic and environmental factors associated as risks for these diseases, they are not definitive that the disease will occur, and are only found in a subset of affected people.  A better understanding of what the risks are, and how genes and environment (nature and nurture) interact to cause disease, will give us new tools for early detection and intervention, and new possibilities for treatment.

Epigenetics

Epigenetic mechanisms control how DNA is packaged, by directly methylating DNA bases and modifying histone proteins that DNA wraps around, with the effect of regulating gene expression.  Epigenetic factors have an essential role in embryonic development, by guiding the specialization of cell types, and regulating the correct growth of tissues and organs.  However, this epigenetic regulation is sensitive to environmental exposures, and can change how an animal or person grows, based on the environment they live in.  Because epigenetic marks are actual chemical changes to DNA and proteins, they can be long-term recordings of what someone has lived through, and even be passed on to their children.  Like an inherited mutation, inherited epigenetic marks can create changes in offspring, but unlike mutations that that change the actual DNA sequence, epigenetic marks are flexible, and can be created and removed, so that their effects can be created and reversed. Therefore, determining how specific genes are controlled by epigenetic marks, and how the environment can affect these epigenetic marks, offers possibilities to understanding what raises and lowers the risks of complex non-contagious diseases, and how exposures from the modern world and lifestyle can be increasing the incidence of these diseases (Figure 1).

Figure 1 – Epigenetics as a mechanism for translating environmental exposures to outcomes.

Imprinting and Imprint Control Regions

We are particularly interested in the set of epigenetic markers that make up “Imprint Control Regions” (ICRs), which regulate “Imprinted genes” – genes which are expressed from only one of the two allelic copies.  ICRs are methylated on only one allele, determined by which parent the allele was inherited from, subsequently regulating the activation/silencing of the two alleles of target genes based on the parent of origin (Figure 2).  Imprinted genes are particularly important in development, as most of the known imprinted genes are developmental regulatory genes, with the set of genes expressed from the father’s copies tending to promote growth, and the set of genes expressed from the mother’s copies tending to restrict growth.  Keeping this imprinting under strict control is important because these genes are developmental regulators that must have specific expression amounts for proper growth.  Developmental disorders such as Prader-Willi and Angelman syndromes are due to complete loss of control of specific imprinted genes, and it is suspected that slight alterations in imprinted gene expression affect the development of disorders later in life.  ICRs are also a potentially powerful diagnostic tool, as their methylation is established at the single-cell stage of development, and maintained consistently in all subsequent cell types.  Therefore, any perturbations to ICR methylation that happen in development should be detectable in blood DNA, as a proxy for cell types inaccessible for screening in people with no diagnosed illness (i.e. brain, liver, heart).

Figure 2 – Parent of origin specific allele methylation at an ICR regulating imprinted allele specific gene expression.

Using whole genome sequencing methods to quantitate methylation, we have identified 1488 regions in the human genome that have methylation patterns consistent with what an ICR is expected to look like (1)(Figure 2).  With these potential ICRs identified, our goal became to identify how the methylation is altered in complex diseases, and in association with specific environmental exposures.  This requires the screening of large case-control sets, in order to make definitive decisions about which ICRs are significantly affected.  The problem is, that sequencing the number of samples (hundreds) required is not time or cost effective, and also requires large amounts of data storage and processing.  Additionally, once the aberrantly methylated regions are identified, and we wish to put this information into use in diagnostic screening, which needs a straightforward method that can be applied and analyzed in a clinical setting.

Custom ICR Methylation Array

To make ICR screening more accessible and economical, we partnered with TruDiagnostic to design a custom Illumina methylation array, targeting the set of human ICRs, that can be broadly implemented for identifying aberrant methylation, and early detection of disease risk.  The array generated can measure methylation at over 22,000 DNA sites, covering 1088 (74%) of the 1488 ICRs we have identified, in comparison to the ~10% of ICRs covered by previous methylation arrays.  Using this array on a case-control set of Alzheimer samples, using both autopsy brain tissue, and blood from living patients, we have determined that the array methylation measurements correlate well to the whole genome methylation sequencing.  The ICR methylation measured by the arrays is highly replicable upon repeated sample analysis, and showed the expected intermediate level of methylation expected for ICRs with monoalleleic methylation (2).  These arrays are now being implemented in case-control studies to identify ICRs with altered methylation, pointing the way to new diagnostic tools, and the underlying disease mechanisms that can be targets for treatment.

References

  1.  Jima DD, Skaar DA, Planchart A, Motsinger-Reif A, Cevik SE, Park SS, Cowley M, Wright F, House J, Liu A, Jirtle RL, Hoyo C. Genomic map of candidate human imprint control regions: the imprintome. Epigenetics. 2022 Dec;17(13):1920-1943. doi: 10.1080/15592294.2022.2091815. Epub 2022 Jul 4. PMID: 35786392; PMCID: PMC9665137.
  2. Carreras-Gallo N, Dwaraka VB, Jima DD, Skaar DA, Mendez TL, Planchart A, Zhou W, Jirtle RL, Smith R, Hoyo C. Creation and Validation of the First Infinium DNA Methylation Array for the Human Imprintome. bioRxiv [Preprint]. 2024 Jan 16:2024.01.15.575646. doi: 10.1101/2024.01.15.575646. PMID: 38293193; PMCID: PMC10827131.

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in