Readability of Scanned Books in Digital Libraries & Enhancing Readability of Scanned Picture Books
Readability of Scanned Books in Digital Libraries
From the abstract:
Displaying scanned book pages in a web browser is difficult, due to an array of characteristics of the common user’s configuration that compound to yield text that is degraded and illegibly small. For books which contain only text, this can often be solved by using OCR or manual transcription to extract and present the text alone, or by magnifying the page and presenting it in a scrolling panel. Books with rich illustrations, especially children’s picture books, present a greater challenge because their enjoyment is dependent on reading the text in the context of the full page with its illustrations.
and
Enhancing Readability of Scanned Picture Books
We describe a system that enhances the readability of scanned picture books. Motivated by our website of children’s books in the International Children’s Digital Library, the system separates textual from visual content which decreases the size of the image files (since their quality can be lower) while increasing the quality of the text by displaying it as computer-generated text instead of an image.
Source: Human-Computer Interaction Lab, University of Maryland
