Machine translation is a sub-field of Natural Language Processing and Artificial Intelligence that investigates the use of computers to translate a given text from one human language to another. More specifically, Statistical Machine Translation is an approach used to build these translation systems. The quality of these systems depends mostly, on the example translations used to train or adapt the models. Corpora can come from a variety of sources, many of which are not optimal for common specific domains. Hence, the primary purpose of this thesis is to find out the right data to train or adapt models from, applied to a particular domain or task. This thesis proposes different Data Selection methods to identify task-relevant translation training data from a general data pool.