Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429)
DOI: 10.1109/icip.2003.1246942
|View full text |Cite
|
Sign up to set email alerts
|

Text to visual synthesis with appearance models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
13
0

Publication Types

Select...
2
1
1

Relationship

3
1

Authors

Journals

citations
Cited by 4 publications
(14 citation statements)
references
References 8 publications
1
13
0
Order By: Relevance
“…The present work is our first approach to automatic emotional speech synthesis in Catalan with the purpose of including emotional expressivity in the output channel of an HCI system [7] [8]. Catalan is the native language of Catalonia, the Valencian Country and the Balearic Islands (central east and north east part of Spain), which is spoken by more than 6 million people.…”
Section: Introductionmentioning
confidence: 99%
“…The present work is our first approach to automatic emotional speech synthesis in Catalan with the purpose of including emotional expressivity in the output channel of an HCI system [7] [8]. Catalan is the native language of Catalonia, the Valencian Country and the Balearic Islands (central east and north east part of Spain), which is spoken by more than 6 million people.…”
Section: Introductionmentioning
confidence: 99%
“…The short sequence consists of 316 images and it has been used to compare the results obtained from our On-the-fly Training Algorithm and its previous non-causal version [7]. Achieving the same quality in the results (see Fig.…”
Section: On-the-fly Training Algorithmmentioning
confidence: 85%
“…First of all, the four masks π r are manually extracted from the first image the corresponding alignment coefficients a 1 are set to 0; they represent the affine transformation used to fit the masks onto the face on each frame [5]. Using the tracking algorithm presented in [7] (Figure 3) can be executed. Besides, only those columns of U r t+1 and V r t+1 whose values of Σ r t+1 exceed a threshold τ are considered, keeping only those eigenvectors with enough information.…”
Section: Training Processmentioning
confidence: 99%
See 1 more Smart Citation
“…The visual information is extracted from the recorded image sequence using the registration algorithm presented in [18]. This algorithm takes as input the recorded image sequence and a set of masks and returns a set of orthonormal bases B (PSFAM) and a matrix of coefficients C with columns c i .…”
Section: Visual Informationmentioning
confidence: 99%