The World Wide Web evolved so rapidly that it is no longer considered a luxury, but a necessity. That is why currently the most popular infection vectors used by cybercriminals are either web pages or commonly used documents (such as pdf files). In both of these cases, the malicious actions performed are written in JavaScript. Because of this, JavaScript has become the preferred language for spreading malware. In order to be able to stop malicious content from executing, detection of its infection vector is crucial. In this paper we propose various methods for detecting JavaScript-based attack vectors.For achieving our goal we first need to fight metamorphism techniques usually used in JavaScript malicious code, which are by no means trivial: garbage instruction insertion, variable renaming, equivalent instruction substitution, function permutation, instruction reordering, and so on. Our approach to deal with metamorphism starts with splitting the JavaScript content in components and filtering the insignificant ones. We then use a data set, consisting in over one million JavaScript files in order to test several machine learning algorithms such as Hidden Markov Models, linear classifiers and hybrid approaches for malware detection.Finally, we analyze these detection methods from a practical point of view, emphasizing the need for a very low false positive rate and the ability to be trained on large datasets.