From Simple Labels to Descriptive Sentences in Leukemia Image Classification
Published in Bioengineering & Biotechnology, Computational Sciences, and Immunology
The idea of this paper was very simple. In traditional leukemia classification, each cell image is usually assigned to only one class such as AML, CML, ALL, or CLL. I thought this method is limited because a single label cannot describe the full meaning of a medical image.
So, we tried a different idea. Instead of assigning only one class label, we connected each image to a short descriptive sentence.
For instance:
AML → “Large immature blood cells with abnormal structure.”
CML → “High number of abnormal white blood cells in blood cells.”
ALL → “Fast-growing immature lymphocyte cells in the blood.”
CLL → “Small abnormal lymphocyte cells with slow progression.”
And other similar sentence like that.
In this work, we used both text-to-image and image-to-text learning. The model tried to understand the relation between medical images and written descriptions at the same time. We also used GAN-based methods to help balance the dataset because some classes had fewer images than others.
One challenge in this project was creating meaningful text descriptions that were simple but still useful for the model. Another challenge was the limited and unbalanced medical dataset. Training vision-language models also needed careful tuning and many experiments.
The model performance was evaluated using the metrics discussed in the paper. The final results were around the 70% range. This is not considered perfect or extremely high accuracy. However, the method is more practical and informative than traditional classification systems that only assign simple labels like 0, 1, 2, or 3.
I think the interesting part of this project was trying to make AI understand medical images in a more human-like way using language. This project showed that combining text and image understanding can open new directions in medical AI research.
In the future, this idea can be improved using larger datasets, better text descriptions, and stronger multimodal models. It may also help doctors better understand AI decisions instead of only seeing simple class outputs.
Link of paper: https://link.springer.com/article/10.1007/s44163-026-01239-7
Also links of other researches I contributed:
1- Advanced deep learning framework for automated hematological malignancy classification: Integrating FCMAE V2-WPAT with ACDB-GAN for enhanced leukemia subtype detection
https://doi.org/10.1007/s11760-025-05096-2
2- Constructive learning: A high-performance framework for fetal head circumference estimation
https://doi.org/10.1007/s11227-026-08293-z
3- Multimodal vision-language framework for text-guided leukemia classification using advanced deep learning architectures
https://doi.org/10.1007/s44163-026-01239-7
4- Advancing WBC classification: A hybrid ConvNeXtV2–Swin Transformer framework with R3GAN data balancing and CLAHE preprocessing
Follow the Topic
-
Signal, Image and Video Processing
-
Discover Artificial Intelligence
This is a transdisciplinary, international journal that publishes papers on all aspects of the theory, the methodology and the applications of artificial intelligence (AI).
-
Journal of Imaging Informatics in Medicine
This journal enhances the exchange of knowledge encompassed by the general topic of Imaging Informatics in Medicine including, but not limited to, research and practice in clinical, engineering, information technologies and techniques in all medical imaging environments.
Related Collections
With Collections, you can get published faster and increase your visibility.
Enhancing Trust in Healthcare: Implementing Explainable AI
Healthcare increasingly relies on Artificial Intelligence (AI) to assist in various tasks, including decision-making, diagnosis, and treatment planning. However, integrating AI into healthcare presents challenges. These are primarily related to enhancing trust in its trustworthiness, which encompasses aspects such as transparency, fairness, privacy, safety, accountability, and effectiveness. Patients, doctors, stakeholders, and society need to have confidence in the ability of AI systems to deliver trustworthy healthcare. Explainable AI (XAI) is a critical tool that provides insights into AI decisions, making them more comprehensible (i.e., explainable/interpretable) and thus contributing to their trustworthiness. This topical collection explores the contribution of XAI in ensuring the trustworthiness of healthcare AI and enhancing the trust of all involved parties. In particular, the topical collection seeks to investigate the impact of trustworthiness on patient acceptance, clinician adoption, and system effectiveness. It also delves into recent advancements in making healthcare AI decisions trustworthy, especially in complex scenarios. Furthermore, it underscores the real-world applications of XAI in healthcare and addresses ethical considerations tied to diverse aspects such as transparency, fairness, and accountability.
We invite contributions to research into the theoretical underpinnings of XAI in healthcare and its applications. Specifically, we solicit original (interdisciplinary) research articles that present novel methods, share empirical studies, or present insightful case reports. We also welcome comprehensive reviews of the existing literature on XAI in healthcare, offering unique perspectives on the challenges, opportunities, and future trajectories. Furthermore, we are interested in practical implementations that showcase real-world, trustworthy AI-driven systems for healthcare delivery that highlight lessons learned.
We invite submissions related to the following topics (but not limited to):
- Theoretical foundations and practical applications of trustworthy healthcare AI: from design and development to deployment and integration.
- Transparency and responsibility of healthcare AI.
- Fairness and bias mitigation.
- Patient engagement.
- Clinical decision support.
- Patient safety.
- Privacy preservation.
- Clinical validation.
- Ethical, regulatory, and legal compliance.
Publishing Model: Open Access
Deadline: Sep 10, 2026
Transforming Education through Artificial Intelligence: Opportunities, Challenges, and Future Directions
Artificial Intelligence (AI) is rapidly changing the educational field by enabling personalized learning, intelligent tutoring systems, automated assessments, learning analytics, and administrative automation.
This collection invites original research, systematic reviews, and visionary perspectives on the transformative impact of AI in education. It aims to explore how AI technologies can enhance equity, inclusion, and efficiency in educational settings across different contexts, including higher education, K-12, vocational training, and lifelong learning. This collection will address technical, pedagogical, ethical, and policy aspects, fostering interdisciplinary perspectives and evidence-based insights.
This Collection supports and amplifies research related to SDG 4 and SDG 9.
Keywords: Artificial Intelligence, AI in Education, Educational Technology, Data Analytics, AI Ethics
Publishing Model: Open Access
Deadline: Nov 30, 2026
Please sign in or register for FREE
If you are a registered user on Research Communities by Springer Nature, please sign in