DOI: 10.14711/thesis-b1514768
|View full text |Cite
|
Sign up to set email alerts
|

Multi-task learning deep neural networks for automatic speech recognition

Abstract: Chapter 1 Introduction 1 1.1 Why Multi-task Learning (MTL) for ASR? 3 1.2 How to Apply MTL to ASR? 5 1.3 Thesis Outline 6 Chapter 2 Review of Automatic Speech Recognition 8 2.1 Automatic Speech Recognition 8 2.2 Language Model 9 2.3 Acoustic Model 2.3.1 Hidden Markov Model (HMM) 2.3.2 Gaussian Mixture Model (GMM) 2.3.3 Deep Neural Network (DNN) 2.4 Phonetic Unit Selection 2.4.1 Context-independent (CI) Units 2.4.2 Context-dependent (CD) Units 2.5 Context-Dependent Acoustic Modeling vi 2.6 Data Scarcity Problem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 89 publications
(110 reference statements)
0
9
0
Order By: Relevance
“…Research efforts have been made in counting mitosis cells in breast cancer, lymphocytes in breast cancer WSIs, centroblasts on follicular lymphoma WSIs, and plasma cells in bone marrow image patches 41,64,83,164 . Counting cells typically requires cell segmentation; a variety of attempts have been made for cell segmentation on breast cancer, 165–168 colon cancer, 146,169 and bladder cancer 170 . Research attempts have also been made on counting neuroendocrine tumor (NET) cells within the gastrointestinal tract and pancreas 171 .…”
Section: Clinical Tasks In Computational Histopathologymentioning
confidence: 99%
“…Research efforts have been made in counting mitosis cells in breast cancer, lymphocytes in breast cancer WSIs, centroblasts on follicular lymphoma WSIs, and plasma cells in bone marrow image patches 41,64,83,164 . Counting cells typically requires cell segmentation; a variety of attempts have been made for cell segmentation on breast cancer, 165–168 colon cancer, 146,169 and bladder cancer 170 . Research attempts have also been made on counting neuroendocrine tumor (NET) cells within the gastrointestinal tract and pancreas 171 .…”
Section: Clinical Tasks In Computational Histopathologymentioning
confidence: 99%
“…Alternatively, the second modality is considered as privileged information given at training time but not at test time. Most works on this direction focused on joint monocular depth estimation and semantic segmentation showing that joint training allows improving the performance of both tasks (Wang et al, 2015;Mousavian et al, 2016;Zhang et al, 2018b;Kendall et al, 2018;Chen et al, 2018d;He et al, 2021b). A multi-task guided Prediction-and-Distillation Network was designed by Xu et al (2018), where the model first predicts a set of intermediate auxiliary tasks ranging from low to high level, and then such predictions are used as multi-modal input to a multi-modal distillation module, opted at learning the final tasks.…”
Section: Sis With Additional Modalitiesmentioning
confidence: 99%
“…Moreover, the domain mismatch scenario has a low-resource problem if the target domain has only fewer data compared to the scale of the source domain data. There are also several works that tried to solve this problem, such as (Chen and Mak, 2015;Zoph et al, 2016;. Our jointly trained model is also based on this low-resource scenario.…”
Section: Joint Training and Transfer Learningmentioning
confidence: 99%