Friday, March 13, 2026
Back to All Stories
Production Technology

Project Gutenberg Tackles Accessibility with AI-Assisted Alt Text Tool

Project Gutenberg has deployed an AI-assisted tool to generate descriptive alt text for images in its vast catalogue of public domain e-books, making tens of thousands of titles more accessible to visually impaired readers.

Person using assistive technology at a computer

Analysis

Project Gutenberg's decision to deploy AI-assisted alt text generation across its catalogue represents one of the most consequential accessibility upgrades in the history of digital public domain publishing. The organisation's archive contains over 70,000 e-books, a significant proportion of which include illustrations, diagrams, maps, and decorative elements that have historically been invisible to screen readers. For the millions of visually impaired readers who rely on assistive technology, this has meant encountering unexplained gaps in texts that sighted readers experience as richly illustrated.

The timing of this initiative is notable. Accessibility legislation has been tightening across major markets — the European Accessibility Act came into full force in 2025, and the United States continues to see litigation under the Americans with Disabilities Act targeting digital content providers. While Project Gutenberg operates as a non-profit and is unlikely to face the same legal exposure as commercial publishers, the organisation's move signals a broader cultural shift: accessibility is no longer treated as an afterthought or a compliance burden, but as a core dimension of what it means to make a text genuinely available to the public.

The use of AI for alt text generation raises legitimate questions about quality. Automated image description has improved dramatically in recent years — modern vision-language models can produce contextually accurate descriptions of complex historical illustrations, maps, and portraits — but they are not infallible. Errors in alt text can be more disorienting for screen reader users than no description at all, since a wrong description actively misleads rather than simply omitting information. Project Gutenberg has indicated that its process includes human review for a sample of generated descriptions, though the scale of the catalogue makes comprehensive human oversight impractical.

What makes this development particularly interesting from a publishing technology perspective is the precedent it sets for the wider industry. Commercial publishers with large backlist catalogues face the same problem at far greater scale: millions of previously published titles contain images that were never described for accessibility purposes. The cost of human-written alt text at that scale is prohibitive, but AI-assisted generation — with appropriate quality controls — makes the problem tractable. Project Gutenberg's open-source approach to this tooling means that the methods it develops could be adopted by other digital libraries, academic repositories, and publishers who lack the resources to build their own solutions.

The broader implication is that AI is beginning to address accessibility gaps that the publishing industry has largely ignored for decades. The focus on AI in publishing has understandably centred on content generation, rights disputes, and the economics of authorship — but the technology's most unambiguously positive applications may be in making existing content accessible to audiences who have been systematically underserved. Project Gutenberg's initiative deserves recognition not just as a technical achievement but as a statement of values about who digital publishing is actually for.