S. R. Klemmer, J. Graham, G. J. Wolff, and J. A. Landay, “Books with voices: paper transcripts as a physical interface to oral histories,” in CHI ’03: Proceedings of the SIGCHI conference on Human factors in computing systems, (New York, NY, USA), pp. 89–96, ACM, 2003. [PDF]
———–
This paper describes Books with Voices, an enhacement of paper transcripts enabling random access to digital video interviews on a PDA. Historians collect a huge amount of audio interviews that later are transcribed. However, the audio recording preserves some value as the original voice of the interviewee, the intonation of the words, etc. Unfortunately, because this material is mostly undedited and difficult to find, the textual trasncripts are the preferred source of infromation. Therefore, the authors proposed a prototype that could help relate a certain textual transcript to the original audio souce.
They performed a qualitative evaluation of the prototype with 13 participants. The video helped readers clarify the text and observe non-verbal cues.
The paper contains also a thorough literature review on the subject.

C. Duarte and L. Carriço, When You Can’t Read It, Listen to It! An Audio-Visual Interface for Book Reading, vol. 5616 of Lecture Notes in Computer Science, pp. 24–33. Springer Berlin / Heidelberg, 2009. [PDF]
——–
This paper describe the Digital Talking Book player for mobile devices, an application that was designed to allow people with visual impairments to learn from audiobooks. It allows the user to replicate the same functionalities of a paper book on digital content. The paper presents an usability evaluation of the prototype.

S. Zerachovitz and M. Zuker, “The book is talking to you – using an audio version of the course textbooks to support learning,” in Proceeding of ITHET07 conference, (Kumamoto, Japan), June 10-13 2007. [PDF]
————–
This paper presents subjective-reported data that support the thesis that audio-books of course material is helpful for students with learning disabilitites and for those whose main language is not the one the course is taught. The authors found that students like tghe convenience of learning whenever and wherever they choose. Many student listened to the audio books while reading the printed book. Learning by the audio book si more passive than learning by reading. Many students felt that learning while listening to the audio book increased their comprehension.
I. Trancoso, C. Duarte, A. Serralheiro, D. Caseiro, L. Carriço, and C. Viana, “Spoken language technologies applied to digital talking books,” in Proceedings of Interspeech, (Pittsburgh, PA, USA), September 17-21 2006. [PDF]
———
This paper presents the DTB player, which offer to visually impaired users an evolution of paper books. The prototype offers a multimodal interface that presents the textual content of the book synchronized with an audio narration, either pre-recorded by a human speaker or constructed using a text-to-speech synthetizer. Speech recognition allows further the user to add bookmarks or annotation to the book. This paper summarized the different language technologies that may be integrated in spoken books and the different application domains in which spoken books might be used.

Recent Comments