Lexical frequency is one of the strongest predictors of word processing time. The frequencies are often calculated from book-based corpora, or more recently from subtitlebased corpora. We present new frequencies based on Twitter, blog posts, or newspapers for 66 languages. We show that these frequencies predict lexical decision reaction times similar to the already existing frequencies, or even better than them. These new frequencies are freely available and may be downloaded from http://worldlex.lexique.org.Keywords Word frequency . Cross-language frequency . Twitter . BlogsThe number of occurrences of a word within a corpus is one of the best predictor of word processing time (Howes & Solomon, 1951). High-frequency words are processed more accurately and more rapidly than low-frequency words, both in comprehension and in production (Baayen, Feldman, & Schreuder, 2006;Balota, Cortese, Sergent-Marshall, Spieler, & Yap, 2004;Monsell, 1991;Yap & Balota, 2009). This word frequency effect was observed in different tasks such as lexical decision tasks (Andrews & Heathcote, 2001;Balota et al., 2004), perceptual identification tasks (Grainger & Jacobs, 1996;Howes & Solomon, 1951), pronunciation tasks (Balota & Chumbley, 1985;Forster & Chambers, 1973), and semantic categorization tasks (Andrews & Heathcote, 2001;Taft & van Graan, 1998). This word frequency effect is a robust effect, since it was found in many languages.For a long time, corpora were compiled from written texts, principally books (Kučera & Francis, 1967;Thorndike & Lorge, 1944 (Kučera & Francis, 1967). Some corpora were used despite their age and despite some critics (see Brysbaert & New, 2009, for a discussion of the Kučera & Francis corpus).More recently, another source of corpora was found to be reliable: movie subtitles. The subtitle-based frequencies were first computed in French by New, Brysbaert, Véronis, and Pallier (2007). The authors showed two main results. First, they showed that the subtitle-based frequencies were a better predictor of reaction times than the book-based frequencies. Second, the subtitle-based frequencies were complementary to book-based frequencies. For instance, typical words from spoken language in everyday life were much more frequent in the subtitle-based than in the book-based corpora. Because the book-based and subtitle-based frequencies were shown to be complementary in the analyses (they explained more variance together than separately), the authors concluded that bookbased frequencies could be good estimates of written language and that subtitle-based frequencies could be good estimates of spoken language. The subtitle-based frequencies were then