2009 IEEE Workshop on Automatic Speech Recognition &Amp; Understanding 2009
DOI: 10.1109/asru.2009.5372952
|View full text |Cite
|
Sign up to set email alerts
|

Voice-based information retrieval — how far are we from the text-based information retrieval ?

Abstract: Although network content access is primarily text-based today, almost all roles of text can be accomplished by voice. Voice-based information retrieval refers to the situation that the user query and/or the content to be retried are in form of voice. This paper tries to compare the voice-based information retrieval with the currently very successful text-based information retrieval, and identifies two major issues in which voice-based information retrieval is far behind: retrieval accuracy and user-system inte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2012
2012
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 85 publications
0
4
0
Order By: Relevance
“…With a 20% expected annual growth rate and projected sales of more than 500 million units worldwide in 2024 (Wadhwani and Gankar, 2018), the potential influence of smart speakers recommendations is huge. However, customers cannot process voice-based information as efficiently as visual or even text-based information, mostly because of a lack of accuracy and user-system interaction (Lee and Pan, 2010), so smart speakers need to offer engaging recommendations that generate favorable attitudes toward the recommended product or service as well as purchase or visiting intentions.…”
Section: Introductionmentioning
confidence: 99%
“…With a 20% expected annual growth rate and projected sales of more than 500 million units worldwide in 2024 (Wadhwani and Gankar, 2018), the potential influence of smart speakers recommendations is huge. However, customers cannot process voice-based information as efficiently as visual or even text-based information, mostly because of a lack of accuracy and user-system interaction (Lee and Pan, 2010), so smart speakers need to offer engaging recommendations that generate favorable attitudes toward the recommended product or service as well as purchase or visiting intentions.…”
Section: Introductionmentioning
confidence: 99%
“…Traditionally, the spoken query detection is performed by cascading an automatic speech recognition (ASR) system with text based retrieval techniques [1], [2], [3], [4]. In this approach, the spoken queries as well as the test utterances are first converted into a sequence of words or symbols.…”
Section: Introductionmentioning
confidence: 99%
“…The objective function in (4) can be the sum of the differences between all positive and negative example pairs here, 4 With the new acoustic models to update in (6), only in (6) have to be changed without generating new lattices, so updating on-line is not computation-intensive [129].…”
Section: ) Retrieval-oriented Acoustic Modeling Under Relevancementioning
confidence: 99%