Name
A Fully AI Approach to Descriptive Video Accessibility
Date & Time
Wednesday, October 23, 2024, 3:30 PM - 4:00 PM
Description

Descriptive Video Service (DVS), also known as Audio Description (AD), enhances media accessibility by providing an additional audio track that narrates on-screen activities for individuals who are blind or have low vision. This service integrates detailed descriptions of visual elements—such as settings, character appearances, and text—into the original audio, including dialogue, music, and sound effects. Traditionally, AD involves manually scripting and recording descriptions by trained writers and voice actors. This paper explores advancements toward automating the AD process using generative AI for scriptwriting and description, text-to-speech technology for narration, and automated audio mixing. It evaluates these technologies' accuracy, contextual relevance, tone appropriateness, and overall quality compared to traditional methods. The study also examines the impact of automation on AD production efficiency, cost, and volume, potentially facilitating greater compliance with accessibility regulations and broadening the availability of accessible video content.

Technical Depth of Presentation
Intermediate
Take-Aways from this Presentation

Background on Audio Description (also called Descriptive Video) and why it's important for accessibility and compliance Demonstrating how AI workflows can successfully describe video for accessibility purposes Discussion of impact of fast turnaround and low cost described video on the availability of this service in the market