Full Spectrum CRISPR Analysis: Rapidly Verify On-Target and Off-Target Edits with PED

Full Spectrum CRISPR Analysis: Rapidly Verify On-Target and Off-Target Edits with PED
Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

BioMed Central
BioMed Central BioMed Central

Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data - BMC Bioinformatics

Background Accurate detection of polymorphisms with a next generation sequencer data is an important element of current genetic analysis. However, there is still no detection pipeline that is completely reliable. Result We demonstrate two new detection methods of polymorphisms focusing on the Polymorphic Edge (PED). In the matching between two homologous sequences, the first mismatched base to appear is the SNP, or the edge of the structural variation. The first method is based on k-mers from short reads and detects polymorphic edges with k-mers for which there is no match between target and control, making it possible to detect SNPs by direct comparison of short-reads in two datasets (target and control) without a reference genome sequence. The second method is based on bidirectional alignment to detect polymorphic edges, not only SNPs but also insertions, deletions, inversions and translocations. Using these two methods, we succeed in making a high-quality comparison map between rice cultivars showing good match to the theoretical value of introgression, and in detecting specific large deletions across cultivars. Conclusions Using Polymorphic Edge Detection (PED), the k-mer method is able to detect SNPs by direct comparison of short-reads in two datasets without genomic alignment step, and the bidirectional alignment method is able to detect SNPs and structural variations from even single-end short-reads. The PED is an efficient tool to obtain accurate data for both SNPs and structural variations. Availability The PED software is available at: https://github.com/akiomiyao/ped .

My previous blog post introduced bidirectional alignment algorithm PED (Polymorphic Edge Detection), a method that aligns a reference genome with Next-Generation Sequencing (NGS) reads from both directions and detects the edges of genome portions where the mutation has occurred.


A key advantage of this approach is its ability to identify large deletion mutations often missed by other programs. Furthermore, it can detect various other mutations, including single-base substitutions, insertions, translocations, and inversions.


This blog post demonstrates how PED can be used for sequence analysis of organisms that have undergone genome editing with CRISPR/Cas9.


To illustrate, I began by searching for "CRISPR" on NCBI's Sequence Read Archive (SRA) to find relevant sequence data. I found a dataset from Umeå University titled "Genotyping of C. elegans mutants - CRISPR/Cas9 of all GPCR and neuropeptide genes" and downloaded it from NCBI.

The sequence data for a specific sample, ERR11472167, was downloaded using the fastq-dump command from the SRA Toolkit provided by NCBI:

fastq-dump ERR11472167

After saving the downloaded files to the ERR11472167/read directory, the PED program was run using the following command:

perl ped.pl target=ERR11472167,ref=WBcel235

Here, WBcel235 refers to the reference genome sequence for the nematode C. elegans. Subsequently, the snpEff program was used to identify the affected genes and the types of mutations, generating a list of these findings (Figure 1).

Figure 1. Mutation sites detected from ERR11472167 NGS sequence data

According to NCBI's BioSample database, the ERR11472167 sample was reported to have intended mutations in genes WBGene00005318 and WBGene00005319. As highlighted in red in Figure 1, the PED program confirmed mutations in the targeted genes (smg-10/WBGene00005318 and dsh-2/WBGene00000102), demonstrating its ability to verify successful genome editing.


Importantly, PED analysis also revealed numerous off-target mutations in unintended genomic locations. A total of sixty-two off-target mutations were identified in this specific C. elegans line.

Figure 2. Mutation sites detected from ERR11472179 NGS sequence data

Similarly, for sample ERR11472179, the SRA database indicated that gene WBGene00005641 (the sro-1 gene) was the target for genome editing.

As shown in Figure 2 (with sro-1/WBGene00005641 highlighted in red), PED confirmed a frameshift mutation in the sro-1 gene. However, it also detected 60 additional off-target mutations in this sample.


These examples demonstrate that the PED program is a valuable tool not only for verifying intended edits but also for comprehensively checking for off-target mutations in genome-edited organisms. We encourage researchers to try PED for their analyses.

References

Miyao, A., Kiyomiya, J.S., Iida, K. et al. Polymorphic edge detection (PED): two efficient methods of polymorphism detection from next-generation sequencing data. BMC Bioinformatics 20, 362 (2019). https://doi.org/10.1186/s12859-019-2955-6

https://github.com/akiomiyao/ped

Cingolani, P., Platts, A., Wang, leL., Coon, M., Nguyen, T., Wang, L., Land, S. J., Lu, X., & Ruden, D. M. (2012). A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly6(2), 80–92. https://doi.org/10.4161/fly.19695

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Go to the profile of Ilkka Havukkala
about 2 months ago

Worth having a look. PED has many other applications, too.

Go to the profile of Akio Miyao
about 2 months ago

Hi Ilkka! Thank you for your comment. I will write blogs for other applications.

Follow the Topic

Genetics and Genomics
Life Sciences > Biological Sciences > Genetics and Genomics
Cancer Genetics and Genomics
Life Sciences > Biological Sciences > Cancer Biology > Cancer Genetics and Genomics
Biotechnology
Life Sciences > Biological Sciences > Biotechnology
  • BMC Bioinformatics BMC Bioinformatics

    This is an open access, peer-reviewed journal that considers articles describing novel computational algorithms and software, models and tools, including statistical methods, machine learning and artificial intelligence, as well as systems biology.

Related Collections

With collections, you can get published faster and increase your visibility.

Extracellular vesicle research

BMC Bioinformatics is welcoming submissions to our Collection on Extracellular vesicles research.

BMC Bioinformatics is welcoming submissions to our Collection on Extracellular vesicles research. Extracellular vesicles (EVs) are are small lipid bilayer-delimited particles released by cells that play crucial roles in intercellular communication and various physiological processes. The study of EVs has gained significant attention due to their potential as biomarkers for disease diagnosis, therapeutic targets and drug delivery systems. Advanced bioinformatics tools are essential for analyzing EV data, identifying EV-associated molecules, and understanding their biological functions.

This Collection welcomes submissions on the development of new computational and/or statistical approaches for the study of extracellular vesicles. We encourage contributions that highlight innovative methods for detecting and characterizing EVs and elucidating the molecular mechanisms underlying EV biogenesis and function.

All manuscripts submitted to this journal, including those submitted to collections and special issues, are assessed in line with our editorial policies and the journal’s peer-review process. Reviewers and editors are required to declare competing interests and can be excluded from the peer review process if a competing interest exists.

Publishing Model: Open Access

Deadline: Mar 30, 2026

Bioinformatics platforms for the study and detection of anti-microbial resistance

BMC Bioinformatics is welcoming submissions to our Collection on Bioinformatics platforms for the study and detection of antimicrobial resistance.

Antimicrobial resistance (AMR) poses a significant threat to global health, necessitating advanced tools to detect and analyze resistance mechanisms. Bioinformatics platforms can help researchers identify AMR determinants within genomic and metagenomic data, providing insights into the spread and evolution of resistance genes. These platforms are important for public health surveillance, guiding the development of targeted interventions and informing policy decisions.

This Collection welcomes submissions on the development of new computational and/or statistical approaches for the study of antimicrobial resistance. We encourage contributions that highlight innovative methods for detecting resistance genes and integrating genomic data with phenotypic information.

All manuscripts submitted to this journal, including those submitted to collections and special issues, are assessed in line with our editorial policies and the journal’s peer-review process. Reviewers and editors are required to declare competing interests and can be excluded from the peer review process if a competing interest exists.

Publishing Model: Open Access

Deadline: Nov 17, 2025