VISEM-Tracking, a human spermatozoa tracking dataset

Manual sperm motility assessment is challenging; hence, computer-assisted sperm analysis (CASA) is used. We introduce VISEM-Tracking dataset with expert annotations, enabling deep learning models like YOLOv5 for accurate sperm analysis, improving accuracy and reliability in sperm motility evaluation

Manual sperm motility assessment using microscopy is challenging. It requires extensive training, making computer-assisted sperm analysis (CASA) a popular alternative. However, supervised machine learning needs more data to enhance accuracy and reliability. To address this,  researchers have introduced the VISEM-Tracking dataset, containing 20 video recordings (29,196 frames) of wet sperm preparations with manually annotated bounding-box coordinates and expert-analyzed sperm characteristics.

The VISEM-Tracking dataset stands out as a unique and invaluable resource for advancing computer-assisted sperm analysis by providing a comprehensive collection of diverse, high-quality sperm video recordings with expert annotations. This addresses the current lack of data and facilitates the development of more accurate and reliable machine-learning models in sperm motility assessment.

The YOLOv5 deep learning model, trained on the VISEM-Tracking dataset, has shown promising baseline sperm detection performance, indicating the dataset's potential for training complex models to analyze spermatozoa.

VISEM-Tracking is available on Zenodo under a Creative Commons Attribution 4.0 International (CC BY 4.0) license, featuring 30-second videos from 20 different patients with annotated bounding boxes. Additional 30-second video clips from both the annotated and unlabelled portions of the VISEM dataset are also provided, making it suitable for future research in semi- or self-supervised learning.

Sample frames from the VISEM-Tracking dataset  with corresponding bounding boxes

Access the dataset here:

Read the full paper: 

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Subscribe to the Topic

Research Data
Research Communities > Community > Research Data

Related Collections

With collections, you can get published faster and increase your visibility.

Medical imaging data for digital diagnostics

This Collection presents a series of articles describing annotated datasets of medical images and video. All medical specialities are considered and data can be derived from study participants, tissue samples, electronic health records (EHRs) or other sources.

Publishing Model: Open Access

Deadline: Dec 20, 2023

Meteorology and hydroclimate observations and models

This Collection presents a series of articles describing hydroclimate datasets, including data sourced from remote sensing, primary measurements or theoretical models. Datasets are presented without analyses in order to support policy development and further research, with Data Descriptors providing full details of data sources, modelling, and any associated code.

Publishing Model: Open Access

Deadline: Dec 15, 2023