Unmasking Cancer’s Microbial Allies: A Mega-Analysis of the Vaginal Microbiome in Cervical Cancer

Cervical cancer is almost always linked to HPV, but not all infections become cancer. Our mega-analysis reveals how shifts in the vaginal microbiome may influence this trajectory, uncovering microbial biomarkers and functional pathways with diagnostic potential (link to paper: https://rdcu.be/eAuwO)

Published in Microbiology

Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

BioMed Central
BioMed Central BioMed Central

Insights into the tripartite relationship between cervical cancer, human papillomavirus, and the vaginal microbiome: a mega-analysis - Human Genomics

Background Cervical cancer (CC) is the fourth most prevalent malignancy among women worldwide, where 99.7% of the cases are linked to persistent human papillomavirus (HPV) infections. While emerging evidence suggests a role for vaginal microbiome dysbiosis in HPV-driven CC, the specific microbial alterations and their functional implications remain unclear. However, inconsistencies in identifying specific microbial signatures—largely due to heterogeneous study designs, targeted 16S rRNA regions, and data processing methods—have limited the generalizability of existing findings. To address these challenges, we conducted a standardized mega-analysis using a compositionality-aware approach to ensure consistency and minimize technical bias across studies. Results Our mega-analysis consolidates findings from five case–control 16S rRNA ampilicon sequencing studies, encompassing 215 samples. Compared to healthy controls, CC patients exhibited significantly higher alpha diversity (Shannon index, p <0.005) and a shift from a Lactobacillus-dominant to a polymicrobial vaginal microbiome. This microbial dysbiosis was characterized by an increased abundance of Porphyromonadaceae, particularly Porphyromonas asaccharolytica, and other anaerobic bacterial species such as Campylobacter ureolyticus, Peptococcus niger, and Anaerococcus obesiensis (FDR <0.05). Functional profiling of the altered microbiome revealed enrichment in pathways associated with chronic inflammation, fatty acid biosynthesis, amino acid metabolism, cellular proliferation, invasion, and metastasis. Conclusions This mega-analysis presents the most methodologically homogeneous study to date of CC–associated vaginal microbiome using publicly available 16S datasets. Our findings not only deepen our understanding of microbial influences on CC but also pave the way for novel diagnostic and therapeutic approaches potentially enhancing patient outcomes in CC care. These insights open new avenues for clinical interventions that extend beyond conventional HPV-centric strategies.

Introduction

Cervical cancer (CC) remains the fourth most common malignancy among women worldwide, with over 600,000 new cases and more than 340,000 deaths each year. Persistent infection with high-risk human papillomavirus (HPV) is the primary driver of CC development, yet infection alone is insufficient to cause malignancy. The majority of HPV infections resolve spontaneously, suggesting that additional biological factors modulate progression to high-grade lesions and invasive cancer.

Recent research has pointed to the vaginal microbiome as one such factor. A healthy vaginal ecosystem is typically dominated by Lactobacillus species, which maintain a low pH, produce antimicrobial compounds, and contribute to mucosal immune defense. In contrast, dysbiotic states—characterized by reduced Lactobacillus abundance and increased anaerobic diversity—have been associated with higher HPV persistence, chronic inflammation, and epithelial barrier disruption. These microenvironmental changes may facilitate viral integration, immune evasion, and carcinogenesis.

However, the literature on CC-associated microbiome shifts is fragmented. Studies vary in their sampling strategies, sequencing platforms, targeted 16S rRNA regions, and analytical approaches, leading to inconsistent and sometimes contradictory findings. Without harmonized data analysis, it is difficult to distinguish genuine biological patterns from methodological noise.

To overcome these challenges, we performed a compositionality-aware mega-analysis of all publicly available CC microbiome datasets meeting strict inclusion criteria. By reprocessing raw sequence data from multiple studies through a unified bioinformatic pipeline, we minimized technical bias and maximized comparability. Our goal was to identify reproducible microbial signatures and functional pathways linked to CC and its HPV-positive subsets—findings that could lay the groundwork for novel diagnostic and preventive strategies.

Key Findings 

Our standardized analysis revealed a consistent shift in CC microbiota from a Lactobacillus-dominated community to a more diverse, anaerobe-rich profile. Alpha diversity was significantly higher in CC, and enriched taxa included Porphyromonas asaccharolytica, Campylobacter ureolyticus, Peptococcus niger, and Anaerococcus obesiensis. Concurrently, protective Lactobacillus species, particularly L. crispatus, were markedly depleted, especially in HPV-positive CC.

Functional predictions indicated enrichment in pathways related to fatty acid biosynthesis, oxidative phosphorylation, and altered amino acid metabolism—changes consistent with known cancer biology. Several of these pathways mirrored host transcriptomic profiles from independent CC datasets, pointing to possible microbial–host metabolic convergence.

Machine learning models trained on these microbial profiles achieved high predictive performance (up to 93% accuracy with XGBoost), underscoring the translational potential for microbiome-based diagnostics.

Future Directions

We faced limitations in geographic diversity, sample size, and clinical metadata. Our next steps involve expanding to more diverse populations, integrating shotgun metagenomics and metabolomics, and exploring causality through experimental models. Ultimately, we aim to extend this analytical framework to other female-related cancers, seeking shared microbial “fingerprints” and actionable biomarkers.

Final Remark

This first paper in our female cancer microbiome project establishes a reproducible foundation for studying microbial influences on HPV-driven cancers. By harmonizing data and revealing robust microbial and functional signatures, we open pathways for targeted diagnostics and prevention strategies—not only in cervical cancer, but across the spectrum of female malignancies. Read the full paper here: https://rdcu.be/eAuwO

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Microbiome
Life Sciences > Biological Sciences > Microbiology > Microbial Communities > Microbiome
Metagenomics
Life Sciences > Biological Sciences > Microbiology > Microbial Genetics > Metagenomics
Metagenomics
Life Sciences > Biological Sciences > Genetics and Genomics > Microbial Genetics > Metagenomics
  • Human Genomics Human Genomics

    Human Genomics is a peer-reviewed, open access journal that focuses on the application of genomic analysis in all aspects of human health and disease, as well as genomic analysis of drug efficacy and safety, and comparative genomics.

Related Collections

With Collections, you can get published faster and increase your visibility.

Artificial Intelligence in Omics and Translational Research

This special Collection in Human Genomics focuses on the integration of artificial intelligence (AI), machine learning (ML), and deep learning methodologies with omics and multi-omics data, with a particular emphasis on human health and disease modeling. As AI technologies rapidly evolve, their potential to revolutionize omics research and precision medicine is - especially in understanding genetic variation in human populations and advancing disease modeling - is becoming increasingly clear. However, bridging the gap between omics experts and AI specialists remains crucial for fully harnessing this potential.

We invite manuscripts featuring original studies that utilize state-of-the-art AI and deep learning algorithms applied to clinical, population-based, or other human-focused omics and multi-omics datasets. Contributions should demonstrate clearly how these methodologies can identify biomarkers, elucidate complex disease mechanisms, and inform precision medicine strategies, presented in a way accessible to researchers without extensive computational backgrounds. Additionally, this Collection aims to inform AI and computer science communities about the types and availability of omics datasets, with a focus on human-centric applications, thereby promoting interdisciplinary collaborations. Submissions that provide practical guidance, review emerging AI methodologies, offer clear tutorials, or highlight successful interdisciplinary case studies are also highly encouraged.

Together, we seek to enhance AI literacy within the omics community and foster collaborative innovation, accelerating translational discoveries and advancing human genomic applications with tangible impact on public health. Topics of interest include but are not limited to:

• Original research applying deep learning algorithms to human-focused omics and multi-omics datasets

• Reviews highlighting the current state-of-the-art AI methodologies, including generative and foundational models, and their applications in human disease modeling

• Tutorials introducing deep and graph-based learning concepts tailored for omics researchers

• Descriptions and characterizations of publicly available omics datasets suitable for AI applications, with emphasis on clinical or population-based datasets

• Perspectives and case studies on interdisciplinary collaboration between omics scientists and AI specialists

We encourage clear, concise, and accessible writing for interdisciplinary readers to support greater integration across these rapidly advancing fields.

This Collection supports and amplifies research related to SDG 3, Good Health and Well-Being .

Publishing Model: Open Access

Deadline: Nov 15, 2026

Human Genomics in the Pangenome Era: From Reference to Representation

This HUGO Collection welcomes original research, reviews, perspectives, and methods papers that advance the science and application of human pangenomes. We are particularly interested in work that moves beyond single linear references to graph‑based and multi‑haplotype models; improves detection and interpretation of structural and non‑coding variation; and develops interoperable, reproducible workflows that reduce reference bias across ancestries. Relevant submissions may include population‑scale analyses, benchmarking and standards, assembly and annotation pipelines, multi‑omics integration, and clinically oriented applications such as diagnostics and decision support.

A central aim of this Collection is to integrate scientific rigour with equity, ethics, and governance. We welcome contributions addressing inclusive participation and benefit-sharing; data sovereignty and responsible access; community and Indigenous engagement; privacy-preserving computation; biosecurity; and responsible applications of artificial intelligence in genomics. The Collection aligns with the goals of The Human Genome Project II (HGP2): Delivering Precision Public Health for Humanity, a UNESCO-endorsed project within the United Nations International Decade of Sciences for Sustainable Development (UN IDSSD 2024–2033). It is also supported by the emerging Centre for AI Capacity Building in Genomics, Health Data, and Responsible Research Systems (AI4GHR), and its role within the Global Network of Centres for Exchange and Cooperation on AI Capacity Building (AICBC), supported by the United Nations Office for Digital and Emerging Technologies (UN ODET). The focus is on developing cutting-edge methodologies and ethical frameworks for globally inclusive science, open science, and rights-based precision public health.

We welcome manuscripts that:

  • Advance methods for data access, reliability, and quality that promote assembly, annotation, and analysis of human pangenomes (graph models, alternative haplotypes, structural and non‑coding variation);
  • Demonstrate interoperable, reproducible pipelines that reduce reference bias and improve variant detection across populations and ancestries;
  • Integrate multi‑omics, spatial omics, and clinical evidence to strengthen diagnostics, risk prediction, and decision support;
  • Embed ethics, governance, open‑science, and sovereignty principles within digital tools and study design for genomics; and
  • Advance standards, implementation science, ethics guidance, and assessment frameworks for responsible pangenomics in research, clinical, and public-health settings, including LMIC considerations and sustainable infrastructures.

This Collection prioritizes global representation and aims to support advanced scientific and practical foundations for equitable, high impact pangenomics.

All submissions in this collection undergo the journal’s standard peer review process. Similarly, all manuscripts authored by a Guest Editor(s) are handled by the Editor-in-Chief. As an open access publication, this journal levies an article processing fee (details here). We recognize that many key stakeholders may not have access to such resources and are committed to supporting participation in this issue wherever resources are a barrier. For more information about what support may be available, please visit OA funding and support, or email OAfundingpolicy@springernature.com or the Editor-in-Chief.

Publishing Model: Open Access

Deadline: Jan 21, 2027