“…It has been the focus of a range of corpus-based studies employing different terminologies, (e.g., pattern, collocation, colligation, multi-word units, lexical bundles, n-gram, construction, among others), but all emphasise the inter-dependence of form and meaning (Biber, 2006;Biber et al, 1999Hoey, 2005;Hunston and Francis, 2000;Hyland, 2008;and Goldberg, 2006). Crossley and Louwerse (2007) classify registers using the frequency of bigrams shared among nine spoken and two written corpora, the findings of which demonstrate that the phrasal units and grammatical constructions can function as a powerful approach to MD analysis. Indeed, as Gries et al (2011) observe, 'a pure n-gram-based approach can be used as an initial, computationally cheap, way of classifying corpus registers that produces useful results.…”