2019 International Conference on Communication and Signal Processing (ICCSP) 2019
DOI: 10.1109/iccsp.2019.8698039
|View full text |Cite
|
Sign up to set email alerts
|

Speech Recognition in Kannada using HTK and Julius: A Comparative Study

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
0
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 7 publications
0
0
0
Order By: Relevance
“…Speaking vocal commands directly to applications, instead of relying on a mouse and keyboard to manipulate text, accelerates communication in human-computer interactions. However, achieving accurate automatic speech recognition (ASR) remains a major challenge due to factors such as speaker and language variability, vocabulary size, and noise interference [10,11].…”
Section: Introductionmentioning
confidence: 99%
“…Speaking vocal commands directly to applications, instead of relying on a mouse and keyboard to manipulate text, accelerates communication in human-computer interactions. However, achieving accurate automatic speech recognition (ASR) remains a major challenge due to factors such as speaker and language variability, vocabulary size, and noise interference [10,11].…”
Section: Introductionmentioning
confidence: 99%
“…Some of the human speech (6) production terminologies are respiration that is inhaling and exhaling air to control vocal intensity and loudness, phonation is the determination of how voiced sounds are produced, articulation is the restriction of airflow in the vocal tract producing a word, oral/nasal resonance is the sound produced as it goes through the mouth/nose, prosody is the tone of speaker utterance that may be a question/command/irony/emphasis/inference, phoneme (7) smallest component that may cause a change in meaning, phone (8) is a basic unit of sound utterance, phonological responsiveness is capability of the audience to perceive words, phonemic responsiveness is the capability of the audience to recognize phonemes, phonemic transcripts use very few symbols of phonetics, a single phoneme for each, phonetic spelling is the confirmation of pronunciation of each single letter as a word, phonetic transcriptions are the illustrations of spoken speech sounds, phonetics (9) involves detailed analysis of human speech and its perception, acoustic phonetics (10) is the analysis of transmission of sounds from narrator to listener, phonology (11) is a subdivision of phonetics that analyzes phoneme pronunciation, articulatory phonetics (12) is a study of building spoken speech sounds by narrator, auditory phonetics (13) is the treatment and sensitivity of verbal communication, stress may be given to a syllable in a word for some words in a sentence, phonics is a subdivision of linguistics that is concerned with the spoken oral sound, phonotactics (14) is a branch for specifying the rules for a phoneme and articulators (15) are the movable speech organs in the speech production.…”
Section: Introductionmentioning
confidence: 99%