Behind the Paper

DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 40 Subclasses

We introduce a large-scale, richly annotated dermatoscopic image dataset of 12,345 images across 40 skin lesion subclasses, designed to enhance AI-based diagnostics, support clinical insights, and advance research in dermatology.

Published in Computational Sciences and General & Internal Medicine

Apr 08, 2025

Abdurrahim Yilmaz

PhD Student, Imperial College London

DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 40 Subclasses

Liked by Yijia Li and 2 others

Explore the Research

In this blog post, we share the story behind our recently published paper in Nature Scientific Data, titled “DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 40 Subclasses” This post walks through our journey — from the initial idea to the final publication — highlighting the motivations, unexpected turns, and challenges we encountered while building one of the most detailed dermatoscopic datasets to date.

What led us to pursue this study?

In dermatology, diagnosing a skin lesion isn’t just about looking at a single image — it’s about context, experience, and a structured clinical decision-making process. Dermatologists often rely on a hierarchical approach: Is the lesion melanocytic or not? Benign or malignant? Which subtype could it be?

Higher-level classes, such as distinguishing between melanocytic and non-melanocytic lesions or benign versus malignant categories, play a critical role in AI-based skin lesion analysis. They mirror the stepwise decision-making process used by dermatologists and help AI models learn more efficiently by providing a structured learning path. This hierarchical approach not only improves classification accuracy but also supports more interpretable and clinically relevant predictions, making AI tools safer and more reliable in real-world medical settings.

As researchers working at the intersection of clinical dermatology and AI, we saw an opportunity to replicate this clinical reasoning in a machine-learning-compatible format. Existing public datasets were valuable, but they didn’t reflect the way decisions are made in practice. Many lacked subclass annotations or the nuanced distinctions that affect real-world diagnoses.

So, we created our own: a large-scale dataset annotated using a three-level taxonomy tree, carefully designed to support both human learning and AI training. Our goal was simple — to bridge the gap between the way clinicians think and the way algorithms learn.

What surprised us along the way?

We began with a clear taxonomy structure, aiming to standardize annotation across the dataset. But as we dove into labeling, we quickly realized that theory and practice don’t always align. Some categories were too broad, others too specific — and certain distinctions didn’t translate well between clinical and AI perspectives.

This led to a dynamic, iterative process: updating the taxonomy during annotation, balancing clinical accuracy with computational utility. It reminded us how different — and yet complementary — the worlds of medicine and AI can be. What started as a rigid tree became a living structure, shaped by both domain knowledge and dataset realities.

What did we do?

This study was all about data: collecting it, structuring it, and making it meaningful.

We gathered 12,345 high-resolution dermatoscopic images from three dermatology centers in Türkiye, using a mix of professional dermatoscopy systems and mobile-based tools. These images were collected over more than a decade and include a diverse set of lesions across 40 subclasses.

After collection, we focused on preprocessing, standardization, and most importantly — annotation. Two expert dermatologists reviewed the images, using our evolving taxonomy to label each case. All malignant cases were biopsy-confirmed, and benign cases were verified through follow-up records.

To make the dataset more useful for the community, we also included image embeddings from Google’s Derm Foundation Model — providing a ready-to-use feature representation for researchers. And we uploaded the full dataset to the ISIC archive, ensuring long-term open access and visibility.

What are the broader implications?

DERM12345 isn’t just another image dataset — it’s a foundation for building smarter, more clinically aligned AI systems.

By offering a structured hierarchy of skin lesions, the dataset encourages the development of models that reflect the way dermatologists actually work. Hierarchical classification — where models learn not just what a lesion is, but how it relates to broader diagnostic categories — is a promising direction for making AI more interpretable and clinically reliable.

Another key contribution is the representation of the Turkish population, which has historically been underrepresented in open-access dermatological datasets. This makes DERM12345 a valuable tool for evaluating how well existing AI models generalize to this demographic. Researchers can use the dataset to test algorithm performance on data from Türkiye and potentially identify population-specific limitations, biases, or misclassifications — a crucial step toward more inclusive, fair, and globally applicable diagnostic tools.

Additionally, the inclusion of rare and visually similar lesion types provides a robust challenge for model development, while the dataset’s public availability through the ISIC archive supports reproducibility, benchmarking, and ongoing research in dermatology and medical AI.

We hope this work helps move the field toward AI tools that are not just accurate, but trustworthy, explainable, and clinically meaningful.

Abdurrahim Yilmaz

PhD Student, Imperial College London

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Dermatology

Life Sciences > Health Sciences > Clinical Medicine > Dermatology

Artificial Intelligence

Mathematics and Computing > Computer Science > Artificial Intelligence

Computer Imaging, Vision, Pattern Recognition and Graphics

Mathematics and Computing > Computer Science > Computer Imaging, Vision, Pattern Recognition and Graphics

Skin Cancer

Life Sciences > Health Sciences > Clinical Medicine > Diseases > Cancers > Skin Cancer

Scientific Data

Scientific Data

A peer-reviewed, open-access journal for descriptions of datasets, and research that advances the sharing and reuse of scientific data.

More about the journal

Related Collections

With Collections, you can get published faster and increase your visibility.

Genomics in freshwater and marine science

This Scientific Data collection of articles focuses on transcriptomic datasets and genome assemblies from freshwater and marine taxa.

Publishing Model: Open Access

Deadline: Jul 23, 2026

Explore this Collection

Genomes of endangered species

This Scientific Data Collection of articles focuses on genome assemblies of endangered or threatened species.

Publishing Model: Open Access

Deadline: Jul 01, 2026

Explore this Collection

Uncovering unpublished radiocarbon data from Late Quaternary megafauna fossils

News and Opinion

March Highlights from Mathematics, Physical and Applied Sciences Communities

Behind the Paper

Analyzing the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES) with a novel CITES COPs attendee-level data

Behind the Paper

New Collectivism Index Offers Dataset for Exploring Regional Differences and Change Over Time

Behind the Paper

Human neuronal differentiation under Aβ exposure: a single-cell transcriptomic and epigenomic dataset

Cookies

We use cookies to ensure the functionality of our website, to personalize content and advertising, to provide social media features, and to analyze our traffic. If you allow us to do so, we also inform our social media, advertising and analysis partners about your use of our website. You can decide for yourself which categories you want to deny or allow. Please note that based on your settings not all functionalities of the site are available.

Further information can be found in our privacy policy.

DERM12345: A Large, Multisource Dermatoscopic Skin Lesion Dataset with 40 Subclasses

Share this post

Share with...

...or copy the link