The genetic code, long understood to be comprised of four bases, may be more complex than previously thought. Researchers are increasingly recognizing the importance of epigenetic markers – modifications to DNA that don’t change the sequence itself but influence gene expression – and a new sequencing method promises a more complete picture of this intricate system. This breakthrough allows scientists to read both the genetic sequence and its epigenetic modifications, specifically differentiating between 5-methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC), in a single pass.
Understanding these epigenetic changes is crucial, as 5mC plays a key role in silencing genes and defining cell identities. However, accurately detecting 5mC has been challenging due to its similarity to 5hmC, a related modification involved in gene activation. Existing methods often struggled to distinguish between the two, creating a “chemical blind spot,” according to researchers. Now, a team at the University of Pennsylvania has developed a technique called integrated sequencing to overcome this hurdle.
Integrated Sequencing: A New Approach to Decoding the Epigenome
The new method, detailed in the Journal of the American Chemical Society (DOI: 10.1021/jacs.5c18450), involves creating a DNA “hairpin duplex.” This process copies each short DNA sequence, converting the cytosine, 5mC, and 5hmC on the new strand into an analog that resists deamination – a chemical process that alters the base. On the original DNA strand, researchers then selectively deaminate either unmodified cytosine or both unmodified cytosine and 5mC. By sequencing both strands of this duplex, they can simultaneously recover the complete DNA sequence and identify the epigenetic markers present.
This approach addresses a limitation of previous methods. Earlier techniques relied on enzymes called deaminases, which convert cytosine to uracil but don’t affect 5hmC. While useful for identifying 5hmC, this process sacrificed genetic information by reducing the effective genetic code to three letters. “You lose genetic information to gain the epigenetic information,” explained chemist Shankar Balasubramanian to Chemical & Engineering News in 2022.
Building on Past Discoveries
The development of this new technique builds on decades of research into epigenetic modifications. In 2009, researchers discovered that 5mC could be oxidized into 5hmC by a specific enzyme, a finding that hinted at a more complex role for these modifications than previously understood. However, early methods like bisulfite sequencing couldn’t differentiate between 5mC and 5hmC. As noted in a 2009 Genome Research study (PMC3807530), accurately mapping these modified bases at a genome-wide scale was a significant challenge until the advent of next-generation sequencing technologies.
Christian Loo, a graduate student who worked on the integrated sequencing method, emphasized the power of simultaneously capturing both sequence and modification information. “There are methods where you can computationally overlay different profiles, but if you have a method that can actually directly link information, that’s incredibly powerful,” he said.
Potential Applications in Cancer Diagnostics
The researchers envision a range of applications for their new method, particularly in cell-free cancer diagnostics. These tests aim to detect rare mutant DNA molecules from cancer cells amidst a large background of healthy cell DNA. The epigenome of these mutant molecules could provide valuable clues about the cancer’s origin. Chunxiao Song, who recently published a method for differentiating 5mC and 5hmC in single-cell DNA (DOI: 10.1186/s13059-025-03708-1), described the integrated sequencing method as “a useful addition to and complement of existing reported and commercial copy-strand–based approaches for DNA modification analysis.”
As research continues, this new sequencing technique promises to unlock a deeper understanding of the epigenome and its role in health and disease. The ability to accurately read both the genetic code and its epigenetic modifications represents a significant step forward in the field of genomics.
Disclaimer: This article provides information for general knowledge and informational purposes only, and does not constitute medical advice. It is essential to consult with a qualified healthcare professional for any health concerns or before making any decisions related to your health or treatment.
What are your thoughts on the potential of integrated sequencing to revolutionize cancer diagnostics? Share your comments below!