The development of a speech recognition system requires at least three resources: a large labeled speech corpus to build the acoustic model, a pronunciation lexicon to map words to phone sequences, and a large text corpus to build the language model. For many languages such as dialects or minority languages, these resources are limited or even unavailable-we label these languages as under-resourced. In this thesis, the focus is to develop reliable acoustic models for under-resourced languages. The following three works have been proposed. I would like to express my sincere thanks and appreciation to my supervisor, Dr. Chng Eng Siong (NTU), and co-supervisor, Dr. Li Haizhou (I 2 R) for their invaluable guidance, support and suggestions. Their encouragement also helps me to overcome the difficulties encountered in my research. My thanks also go to Dr. Xiao Xiong (NTU) for his help, discussions during my PhD study time. My thanks also go to my colleagues in the speech group of the International Computer Science Institute (ICSI) including Prof. Nelson Morgan, Dr. Steven Wegmann, Dr. Adam Janin, Arlo Faria for their generous help and fruitful discussions during my internship at ICSI. I also want to thank my colleagues in the speech group in NTU, for their help. I am very comfortable to collaborate with my team mates Guangpu,