How Can AI Empower Disabled Communities? An Experimental Study of AI-Driven Audio Description for Visual Arts
Published in Computational Sciences and Arts & Humanities
Globally, vision impairment and blindness affect a substantial and growing proportion of the population. According to the World Health Organisation, at least 2.2 billion people worldwide live with some form of near- or distance-vision impairment, and in at least 1 billion of these cases, the condition could have been prevented or has yet to be addressed. As populations age, this number is expected to rise further, intensifying the need for inclusive social and cultural participation. To foster a more equitable global society, it is therefore imperative to address the needs of blind and visually impaired communities, ensuring meaningful access to the arts. Audio Description (AD) is a service that provides verbal narration of key visual elements in media, enabling blind and visually impaired audiences to engage more fully with visual content.
My research interest in AD emerged from my teaching experience in a Digital Translation course, where I was introduced to this form of intersemiotic translation from the non-verbal visual channel to the verbal auditory channel. More importantly, I became increasingly interested in the potential of translation as a means of fostering accessibility and contributing to a more inclusive society. While based at the University of Auckland, I learned that a Computer Science Capstone Course was seeking project proposals. I recognised this as an ideal opportunity to explore the intersection of translation studies and accessibility studies, supported by emerging technologies. I therefore proposed a project aimed at developing an application capable of providing automated AD services for visual artworks, particularly paintings. The proposal was selected for the Capstone Course group project in Semester 1, 2025. Working collaboratively with a team of students, we developed a mobile application, Chromeco, an AI-driven system designed to generate audio descriptions for paintings.
The app is designed to enhance accessibility to visual arts for blind and visually impaired users by enabling them to generate audio descriptions of paintings in real time. By simply capturing an image of an artwork using a mobile device, users can access automatically generated AD that conveys key visual elements of the painting. The system has been trained using carefully developed linguistic and content-oriented guidelines. In particular, the content framework incorporates genre-specific Artwork-Type Description Guidelines, which ensure that descriptions are sensitive to the stylistic and compositional features of different genres of paintings. To improve both accuracy and usability, the system adopts a human-in-the-loop approach, whereby generated descriptions have been reviewed and refined by our research team. This process allows the AI to produce descriptions that are not only clear and informative but also vivid and engaging across diverse artistic genres. Nevertheless, the application raises important ethical and legal considerations, particularly with regard to the copyright of the artworks being described. Addressing these concerns will require the establishment of collaborative partnerships with museums and art galleries, ensuring that accessibility initiatives can be developed in a manner that is both sustainable and respectful of intellectual property rights. Addressing these challenges should be a key priority for future development.
Follow the Topic
-
Universal Access in the Information Society
This journal addresses the accessibility, usability, and, ultimately, acceptability of Information Society Technologies by anyone, anywhere, at anytime, and through any media and device.
Introducing the Palgrave Macmillan Campaign for the Humanities
At Palgrave Macmillan we publish cutting-edge humanities research that has real-world impact. This research community brings together the voices of our authors and editorial team to highlight and publicize the value of the humanities and humanities research in our world today.
Continue reading announcementRelated Collections
With Collections, you can get published faster and increase your visibility.
Challenges in the design and evaluation of accessible user interfaces for people with disabilities
This topical collection of Universal Access in the Information Society (UAIS) focuses on recent trends and emerging challenges in the design and evaluation of human–computer interfaces for people with disabilities. The main objective is to advance the scientific and practical understanding of how inclusive, adaptive, and intelligent technologies can promote equitable participation, autonomy, and well-being for users with sensory, motor, cognitive, or neurodiverse profiles.
Aligned with the journal’s mission to promote universal access and social inclusion through information and communication technologies, this special collection invites contributions that explore new paradigms of interaction, design methodologies, and evaluation frameworks that enable accessibility across digital environments. We particularly welcome research that bridges engineering innovation, user-centred design, and empirical evaluation.
This topical collection will include: *Extended papers accepted at the XXV International Conference on Human-Computer Interaction, Interacción 2025, held in Valladolid, Spain, and *original research contributions related to the main topics of the special issue.
Please find a detailed call for papers at https://link.springer.com/journal/10209/updates/27847192.
Publishing Model: Hybrid
Deadline: Jun 15, 2026
Please sign in or register for FREE
If you are a registered user on Research Communities by Springer Nature, please sign in