This paper addresses the problem that a listener experiences when attempting to capture information presented during a lecture, meeting, or interview. Listeners must divide their attention between the talker and their notetaking activity. We propose a new device-the Audio Notebook-for taking notes and interacting with a speech recording. The Audio Notebook is a combination of a digital audio recorder and paper notebook, all in one device. Audio recordings are structured using two techniques: user structuring based on notetaking activity, and acoustic structuring based on a talker's changes in pitch, pausing, and energy. A field study showed that the interaction techniques enabled a range of usage styles, from detailed review to high speed skimming. The study motivated the addition of phrase detection and topic suggestions to improve access to the audio recordings. Through these audio interaction techniques, the Audio Notebook defines a new approach for navigation in the audio domain.
Speech technology promises to enable 'natural' experiences that remove the translation between a user's desires and a system's actions. While speech systems can translate speech into text, they cannot yet understand the vast range of users' intentions. The use of speech is also expanding from mobile contexts to environments like the living room. Given these shifts, key design questions emerge. How do we encourage natural speech while conveying scope? How do we provide feedback of understanding? What is the role of text-to-speech in this context? This paper presents findings from studying a prototype speech system enabling users to find movies and TV shows. We show the influence of a novel approach for conveying examples, a method for seamlessly integrating text and audio feedback, and share guidance on the use of textto-speech in the living room.
VoiceNotes is an application for a voice-controlled handheld computer that allows the creation, management and retrievat of user-authored voice notes-small segments of digitized speech containing thoughts, ideas, reminders, or things to do. Iterative design and user testing helped to refine the initial user interface design. VoiceNotes explores the problem of capturing and retrieving spontaneous ideas, the use of speech as data, and the use of speech input and output in the user interface for a hand-held computer without a visual display. In addition, VoiceNotes serves as a step toward new uses of voice technology and interfaces for future portable &vices.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.