Teaching AI to Spot Brain Damage in Real-World MRI Scans

Clinical brain scans vary widely, making automated detection of white matter damage unreliable. Using real-world MRI data, we built and tested new AI models that learn to handle this variability, enabling more accurate and clinically useful brain lesion detection.
Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

SpringerLink
SpringerLink SpringerLink

Benchmark White Matter Hyperintensity Segmentation Methods Fail on Heterogeneous Clinical MRI: A New Dataset and Deep Learning–Based Solutions - Journal of Imaging Informatics in Medicine

Existing automated methods for white matter hyperintensity (WMH) segmentation often generalize poorly to heterogeneous clinical MRI due to variability in scanner types, field strengths, and protocols. To address this challenge, we introduce a diverse clinical WMH dataset and evaluate two deep learning–based solutions: an nnU-Net model trained directly on the data and a foundation model adapted through fine-tuning. This retrospective study included 195 routine brain MRI scans acquired from 71 scanners between June 2006 and October 2022. Participants ranged in age from 46 to 87 years (median, 70 years; 94 females). WMHs were manually annotated by an experienced rater and reviewed under neuroradiologist supervision. Several benchmark segmentation methods were evaluated against these annotations. We then developed Robust-WMH-UNet by training nnU-Net on the dataset and Robust-WMH-SAM by fine-tuning MedSAM, a vision foundation model. Benchmark methods demonstrated poor generalization, frequently missing small lesions and producing false positives in anatomically complex regions such as the septum pellucidum. Robust-WMH-UNet achieved superior accuracy (median Dice similarity coefficient [DSC], 0.768) with improved specificity, while Robust-WMH-SAM attained competitive performance (median DSC up to 0.750) after only limited training, reaching acceptable accuracy within a single epoch. This new clinically representative dataset provides a strong foundation for developing robust WMH algorithms, enabling fair cross-method comparisons, and supporting the translation of segmentation models into routine clinical practice.

Brain scans from hospitals don’t all look the same—they come from different machines, settings, and years. This makes it hard for AI tools to reliably detect small areas of brain damage linked to aging and disease. In this work, we used real clinical MRI scans to train smarter AI models that can adapt to this variability, bringing automated brain analysis closer to everyday medical use.

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Alzheimer's disease
Life Sciences > Health Sciences > Clinical Medicine > Neurology > Neurological Disorders > Neurodegenerative diseases > Alzheimer's disease
Magnetic Resonance Imaging
Life Sciences > Health Sciences > Radiology > Nuclear Medicine > Magnetic Resonance Imaging
Medical Imaging
Technology and Engineering > Biological and Physical Engineering > Biomedical Engineering and Bioengineering > Biomedical Devices and Instrumentation > Medical Imaging
Neuroradiology
Life Sciences > Health Sciences > Radiology > Neuroradiology
White matter injury
Life Sciences > Health Sciences > Clinical Medicine > Neurology > Neurological Disorders > Brain Injuries > White matter injury
Stroke
Life Sciences > Health Sciences > Clinical Medicine > Diseases > Cardiovascular Diseases > Vascular Diseases > Cerebrovascular Disorders > Stroke