Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility 2013
DOI: 10.1145/2513383.2513443
|View full text |Cite
|
Sign up to set email alerts
|

Real time object scanning using a mobile phone and cloud-based visual search engine

Abstract: Computer vision and human-powered services can provide blind people access to visual information in the world around them, but their efficacy is dependent on high-quality photo inputs. Blind people often have difficulty capturing the information necessary for these applications to work because they cannot see what they are taking a picture of. In this paper, we present Scan Search, a mobile application that offers a new way for blind people to take high-quality photos to support recognition tasks. To support r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 40 publications
(23 citation statements)
references
References 11 publications
0
23
0
Order By: Relevance
“…As seen in both related work [24] and our experiments, blind people usually have worse phototography skills than sighted people, so an accessible camera interface with few restric tions and rich guidance is crucial to help them take better photos. Existing camera interfaces often fail to provide assis tance, resulting in poor performance of photo-based assistive applications.…”
Section: Discussionmentioning
confidence: 62%
See 1 more Smart Citation
“…As seen in both related work [24] and our experiments, blind people usually have worse phototography skills than sighted people, so an accessible camera interface with few restric tions and rich guidance is crucial to help them take better photos. Existing camera interfaces often fail to provide assis tance, resulting in poor performance of photo-based assistive applications.…”
Section: Discussionmentioning
confidence: 62%
“…al's key frame extraction algorithm [24], we created an panorama interface for RegionSpeak which has no restriction that needs visual inspection. Users of RegionSpeak can move the camera in any direction, and the key frame extraction al gorithm will detect substantial changes in view port and alert users to hold their position to capture a new image.…”
Section: Interface Detailsmentioning
confidence: 99%
“…These include ThirdEye [5], VIZWIZ [3] and LendAnEye [13], and others [6,[13][14][15]. These solutions make use of the integration between human resource and the information technology.…”
Section: Related Workmentioning
confidence: 99%
“…Indeed, they can ask anyone for the unknown object but there were some confidential data and situations that he can't share with any strangers except close friend or a family's member [6].…”
Section: Introductionmentioning
confidence: 99%
“…These descriptors are invariant to translation, scaling and rotation of objects and partially invariant to changes in illumination. Relatively quick calculation of the image features allows the development of systems for object recognition that work nearly in real-time [11]. Computer-vision-based techniques have a higher functionality, but also drawbacks such as: (1) high cost of server-side hardware and software; (2) still low recognition accuracy of such techniques resulting in safety concerns for use by BVI persons [9,12]-recognition depends not only on the descriptor used but also on the training data, training algorithm and type of classifier; (3) such systems are sensitive to the illumination; (4) intensive camera use of mobile devices quickly shortens battery life; (5) intensive network traffic, especially in systems where images are processed entirely by software from the server side, involves paying a higher price for mobile data transfer; and (6) it is still hard to extract detailed descriptions of objects from images.…”
Section: Introductionmentioning
confidence: 99%