2011
DOI: 10.1007/s12046-011-0047-z
|View full text |Cite
|
Sign up to set email alerts
|

Role of neural network models for developing speech systems

Abstract: This paper discusses the application of neural networks for developing different speech systems. Prosodic parameters of speech at syllable level depend on positional, contextual and phonological features of the syllables. In this paper, neural networks are explored to model the prosodic parameters of the syllables from their positional, contextual and phonological features. The prosodic parameters considered in this work are duration and sequence of pitch (F 0 ) values of the syllables. These prosody models ar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2012
2012
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 43 publications
(13 citation statements)
references
References 97 publications
0
13
0
Order By: Relevance
“…This result indicated that G2 had an inability to identify the duration of sounds, the non-verbal sounds in the presence of other sounds, and the acquisition of information about the tone of the language (8,9) , Such temporal aspects of speech are fundamental to understanding the information, and act as a resource linked to the processing of language subsystems, phonological aspects, and syntactic and semantic cues. Research in the field of psycholinguistics provide enough data for such findings (23,24,25,26) . The literature indicates that emphasis should be placed on paralinguistic aspects (non-verbal) and on the language pragmatics, as they facilitate the enunciation of the speaker, and allow for different interpretations of the message's meanings.…”
Section: Discussionmentioning
confidence: 95%
See 2 more Smart Citations
“…This result indicated that G2 had an inability to identify the duration of sounds, the non-verbal sounds in the presence of other sounds, and the acquisition of information about the tone of the language (8,9) , Such temporal aspects of speech are fundamental to understanding the information, and act as a resource linked to the processing of language subsystems, phonological aspects, and syntactic and semantic cues. Research in the field of psycholinguistics provide enough data for such findings (23,24,25,26) . The literature indicates that emphasis should be placed on paralinguistic aspects (non-verbal) and on the language pragmatics, as they facilitate the enunciation of the speaker, and allow for different interpretations of the message's meanings.…”
Section: Discussionmentioning
confidence: 95%
“…It can be divided into: its linguistic aspects, which include speech, as well as its extralinguistic aspects, or paralinguistic elements (stress/loudness, pitch and speech intonation); non-verbal communication (gestures, facial and body language and eye contact); and metalinguistic aspects (ability to use language to analyze language) (23) . The combination of paralinguistic aspects constitutes the term oral/auditory speech prosody, which is responsible for conveying the emotional and inferential perceptions of speech that contribute largely towards speech intelligibility for the listener (24,25) . The interpreters of G1 and G2 went through a process of understanding the spoken language of the proposed audio, for five minutes.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…These features have been proven to be the key feature in human perception of speech. In [9] it is shown that combination of spectral and prosodic features can improve the system performance. Few prosodic features have been investigated here in combination with spectral features for their capability to give dialect specific information.…”
Section: Hindi Dialect Speech Corpusmentioning
confidence: 99%
“…The second layer (first hidden layer) of the network has more units than the input layer, and it can be interpreted as capturing some local features in the input space. The third layer (second hidden layer) has fewer units than the first layer, and can be interpreted as capturing some global features (Rao 2011). Thus the activation function for the units at the input layer is linear, and for the units at the hidden layers, it is non-linear.…”
Section: Multilayer Perceptronsmentioning
confidence: 99%