Text versus speech : a comparison of tagging input modalities for camera phones

Details

Serval ID
serval:BIB_8749971B012A
Type
Inproceedings: an article in a conference proceedings.
Collection
Publications
Title
Text versus speech : a comparison of tagging input modalities for camera phones
Title of the conference
Proceedings of the 11th International Conference on Human-Computer Interaction with Mobile Devices and Services - MobileHCI '09
Author(s)
Cherubini M., Anguera X., Oliver N., de Oliveira R.
Publisher
Association for Computing Machinery (ACM)
Address
Bonn, Germany
ISBN
978-1-60558-281-8
Publication state
Published
Issued date
2009
Language
english
Abstract
Speech and typed text are two common input modalities for mobile phones. However, little research has compared them in their ability to support annotation and retrieval of digital pictures on mobile devices. In this paper, we report the results of a month-long field study in which participants took pictures with their camera phones and had the choice of adding annotations using speech, typed text, or both. Subsequently, the same subjects participated in a controlled experiment where they were asked to retrieve images based on annotations as well as retrieve annotations based on images in order to study the ability of each modality to effectively support users' recall of the previously captured pictures. Results demonstrate that each modality has advantages and shortcomings for the production of tags and retrieval of pictures. Several guidelines are suggested when designing tagging applications for portable devices.
Keywords
Photo tagging, text tagging, audio tagging, camera phones, personal image search
Create date
29/11/2016 14:16
Last modification date
20/08/2019 14:46
Usage data