Behind the Paper

A high-quality dataset featuring classifed and annotated cervical spine X-ray atlas

The Cervical Spine X-ray Atlas (CSXA) is a dataset to aid the research communities in experiment replication and advancing the feld of medical imaging in cervical spine, which includes 4963 raw PNG images and 4963 annotated images with JSON format, and enriched with a lot of necessary information.

Published in Computational Sciences

Jun 13, 2024

Yu Ran and Yu Ran

2 contributors

Liked by India Ambler

Explore the Research

Background and Summary

Recent research in computational imaging largely focuses on developing machine learning (ML) techniques for image recognition in the medical field, which requires large-scale and high-quality training datasets consisting of raw images and annotated images. However, such datasets for cervical spine X-rays are notably scarce. To fill the gap, we have developed the Cervical Spine X-ray Atlas (CSXA), an open-access dataset containing 4963 raw PNG images and an equal number of annotated images in JSON format. Each image is meticulously annotated with details including gender, age, pixel equivalent, and classifications for symptomatic and asymptomatic cases, as well as cervical curvature categorization and 118 quantitative parameters.

Research Methods and Behind-the-Scenes Image Collection and Annotation Process

The raw images were collected from multiple sources, including routine clinical visits and health checkups at Dongzhimen Hospital, Beijing University of Chinese Medicine. Image annotation was a collaborative effort involving four orthopedic spine surgeons, each with an average of six years of experience. The process involved three rounds of cross-checking to ensure accuracy and consistency. Keypoints for annotation included critical anatomical landmarks such as the corners of the vertebral bodies and the centroids of each vertebra. These points were selected based on their importance in diagnosing and assessing cervical spine diseases.

Algorithm Development

An advanced algorithm was developed to convert 23 keypoints from the annotated images into 77 quantitative parameters. These parameters are essential for diagnosing and treating cervical spine diseases. The algorithm uses Python scripts to compute the pixel equivalent for each image, ensuring that the quantitative measurements are accurate and reliable.

Data Validation and Quality Assurance

The annotated data underwent rigorous validation, including manual measurements and cross-checks to ensure consistency. The pixel equivalent were verified using ImageJ software to confirm the accuracy of the computed distances and angles. Access and Use The CSXA dataset, algorithm, and related documentation are available for open access on GitHub and the Science Data Bank. Researchers are encouraged to utilize these resources for their studies, contributing to the advancement of computational imaging and machine learning in the medical field.

Relevance to the Community

The CSXA dataset and accompanying algorithm have significant implications for the research community. By providing a comprehensive, high-quality dataset, we aim to facilitate the replication of experiments and the development of new ML techniques for medical imaging. This resource is particularly valuable for researchers focusing on cervical spine diseases, offering a robust foundation for future studies and algorithmic advancements.

Multiple Contributors

Yu Ran and Yu Ran

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Medical Imaging

Mathematics and Computing > Computer Science > Computer Imaging, Vision, Pattern Recognition and Graphics > Computer Vision > Medical Imaging

Scientific Data

Scientific Data

A peer-reviewed, open-access journal for descriptions of datasets, and research that advances the sharing and reuse of scientific data.

More about the journal

Related Collections

With Collections, you can get published faster and increase your visibility.

Data for crop management

This Scientific Data Collection welcomes submissions of Data Descriptors associated with datasets for crop management, which are essential for optimising agricultural productivity, sustainability, and food security.

Publishing Model: Open Access

Deadline: Apr 17, 2026

Explore this Collection

Data to support drug discovery

This Scientific Data collection aims to gather data descriptors on high-quality, reusable datasets relevant to the drug discovery and development process.

Publishing Model: Open Access

Deadline: Apr 22, 2026

Explore this Collection

Paving the Future of Intelligent Asphalt Defect Detection with Machine Learning

Behind the Paper

The functional role and regulatory mechanism of paeonol in the treatment of liver diseases

Behind the Paper

Pathogenesis of Sex Differences in Autism Risk: Evidence from Cohort and Animal Studies Focused on Maternal Perinatal Depression

Behind the Paper

Unlocking "Invisible Modes": How Metamaterials Help Catch the Dielectric Fingerprints of Cancer Cells

Behind the Paper

Building sustainable futures through CBET: Examining the role of teacher preparedness and leadership in the implementation of education-related SDG policies in Kenyan TVETs

Cookies

We use cookies to ensure the functionality of our website, to personalize content and advertising, to provide social media features, and to analyze our traffic. If you allow us to do so, we also inform our social media, advertising and analysis partners about your use of our website. You can decide for yourself which categories you want to deny or allow. Please note that based on your settings not all functionalities of the site are available.

Further information can be found in our privacy policy.

A high-quality dataset featuring classifed and annotated cervical spine X-ray atlas

Share this post

Share with...

...or copy the link